Hierarchical multiple-factor analysis for classifying genotypes based on phenotypic and genetic data

Franco, J and Crossa, J and Desphande, S (2010) Hierarchical multiple-factor analysis for classifying genotypes based on phenotypic and genetic data. Crop Science, 50 (1). pp. 105-117.

[img] PDF - Published Version
Restricted to ICRISAT users only

Download (863kB) | Request a copy


A numerical classifi cation problem encountered by breeders and gene-bank curators is how to partition the original heterogeneous population of genotypes into non-overlapping homogeneous subpopulations. The measure of distance that may be defi ned depends on the type of variables measured (i.e., continuous and/or discrete). The key points are whether and how a distance may be defi ned using all types of variables to achieve effective classifi cation. The objective of this research was to propose an approach that combines the use of hierarchical multiple-factor analysis (HMFA) and the two-stage Ward Modifi ed Location Model (Ward-MLM) classifi cation strategy that allows (i) combining different types of phenotypic and genetic data simultaneously; (ii) balancing out the effects of the different phenotypic, genetic, continuous, and discrete variables; and (iii) measuring the contribution of each original variable to the new principal axes (PAs). Of the two strategies applied for developing PA scores to be used for clustering genotypes, the strategy that used the fi rst few PA scores to which phenotypic and genetic variables each contributed 50% (i.e., a balanced contribution) formed better groups than those formed by the strategy that used a large number of PA scores explaining 95% of total variability. Phenotypic variables account for much variability in the initial PA; then their contributions decrease. The importance of genetic variables increases in later PAs. Results showed that various phenotypic and genetic variables made important contributions to the new PA. The HMFA uses all phenotypic and genetic variables simultaneously and, in conjunction with the Ward-MLM method, it offers an effective unifying approach for the classifi cation of breeding genotypes into homogeneous groups and for the formation of core subsets for genetic resource conservation.

Item Type: Article
Agro Tags: <b>Agrotags</b> - ratoons | genetics | phenotypes | genotypes | genetic variation | environment | social groups | laboratory equipment | crops | imports <br><b>Fishtags</b> - NOT-AVAILABLE<br><b>Geopoliticaltags</b> - principe | maine | usa | mexico | uruguay
Subjects: Others > Genetics and Genomics
Depositing User: Users 6 not found.
Date Deposited: 07 Jul 2011 05:26
Last Modified: 01 Sep 2011 12:43
URI: http://oar.icrisat.org/id/eprint/90
Official URL: http://dx.doi.org/10.2135/cropsci2009.01.0053
Funders: Suri Sehgal Foundation
Acknowledgement: The authors thank Drs. Rajan Sharma, R.P. Thakur, and C.T. Hash of ICRISAT for the sorghum data set. The sorghum data were generated in a research project sponsored by the Sehgal Foundation Endowment Fund
View Statistics

Actions (login required)

View Item View Item