Acta Veterinaria et Zootechnica Sinica ›› 2024, Vol. 55 ›› Issue (6): 2431-2440.doi: 10.11843/j.issn.0366-6964.2024.06.015

• Animal Genetics and Breeding • Previous Articles     Next Articles

Methods of Genotype Feature Extraction Affecting the Prediction Accuracy of Genomic Selection

Huaxuan WU(), Zhiqiang DU*()   

  1. College of Animal Science and Technology, Yangtze University, Jingzhou 434025, China
  • Received:2023-11-08 Online:2024-06-23 Published:2024-06-28
  • Contact: Zhiqiang DU E-mail:2021710855@yangtzeu.edu.cn;zhqdu@yangtzeu.edu.cn

Abstract:

The purpose of this study was to explore and evaluate 6 different methods for extracting genotype feature of single nucleotide polymorphisms (SNP). Six methods were analyzed and compared: principal component analysis (PCA), gene-principal component analysis (gene-PCA), SNP-Pearson correlation coefficient (SNP-PCC), linkage disequilibrium (LD), and genome-wide association study (GWAS) and random sampling (RS). The prediction accuracy of GEBV in 2 sets of data (Beijing duck, 542 samples, SNP loci 39 932; Duroc pig, 2 549 samples, SNP loci 230 884) and 3 sets of phenotypes (Beijing duck body length, Duroc pig backfat thickness and teat number) was evaluated. Results showed that SNP-PCC combined with 5 GS methods (GBLUP, BayesA, BayesB, BayesC, and Bayesian Lasso) achieved relatively reliable prediction accuracy for the Pecking duck body length phenotype and achieved the highest average prediction accuracy in pig backfat thickness and teat number phenotypes (increased by 5%, reaching 32.3%), and significantly improved computational efficiency (on average 5-7 times faster). In summary, this study found that selecting appropriate feature extraction methods can effectively improve the accuracy and computational efficiency of GS prediction, laying the foundation for in-depth research on the impact of different feature extraction methods on GS prediction accuracy, and providing reference for their application in breeding practice.

Key words: genomic selection, feature extraction, prediction accuracy

CLC Number: