G-DIRT: a web server for identification and removal of duplicate germplasms based on identity-by-state analysis using single nucleotide polymorphism genotyping data

Sahu, T K and Singh, A K and Mittal, S and Jha, S K and Kumar, S and Jacob, S R and Singh, K (2022) G-DIRT: a web server for identification and removal of duplicate germplasms based on identity-by-state analysis using single nucleotide polymorphism genotyping data. Briefings in Bioinformatics, 23 (5). ISSN 1477-4054

Full text not available from this repository. (Request a copy)

Abstract

Maintaining duplicate germplasms in genebanks hampers effective conservation and utilization of genebank resources. The redundant germplasm adds to the cost of germplasm conservation by requiring a large proportion of the genebank financial resources towards conservation rather than enriching the diversity. Besides, genome-wide-association analysis using an association panel with over-represented germplasms can be biased resulting in spurious marker-trait associations. The conventional methods of germplasm duplicate removal using passport information suffer from incomplete or missing passport information and data handling errors at various stages of germplasm enrichment. This limitation is less likely in the case of genotypic data. Therefore, we developed a web-based tool, Germplasm Duplicate Identification and Removal Tool (G-DIRT), which allows germplasm duplicate identification based on identity-by-state analysis using single-nucleotide polymorphism genotyping information along with pre-processing of genotypic data. A homozygous genotypic difference threshold of 0.1% for germplasm duplicates has been determined using tetraploid wheat genotypic data with 94.97% of accuracy. Based on the genotypic difference, the tool also builds a dendrogram that can visually depict the relationship between genotypes. To overcome the constraint of high-dimensional genotypic data, an offline version of G-DIRT in the interface of R has also been developed. The G-DIRT is expected to help genebank curators, breeders and other researchers across the world in identifying germplasm duplicates from the global genebank collections by only using the easily sharable genotypic data instead of physically exchanging the seeds or propagating materials. The web server will complement the existing methods of germplasm duplicate identification based on passport or phenotypic information being freely accessible at

Item Type: Article
Divisions: Genebank
CRP: UNSPECIFIED
Uncontrolled Keywords: GWAS, identity-by-state, duplicate identification, genotype, germplasm conservation, genebank
Subjects: Others > Germplasm
Others > Gene Bank
Depositing User: Mr Nagaraju T
Date Deposited: 09 Nov 2023 11:12
Last Modified: 09 Nov 2023 11:12
URI: http://oar.icrisat.org/id/eprint/12286
Official URL: https://academic.oup.com/bib/article-abstract/23/5...
Projects: UNSPECIFIED
Funders: UNSPECIFIED
Acknowledgement: UNSPECIFIED
Links:
View Statistics

Actions (login required)

View Item View Item