Acta Veterinaria et Zootechnica Sinica ›› 2025, Vol. 56 ›› Issue (10): 4963-4972.doi: 10.11843/j.issn.0366-6964.2025.10.017

• Animal Genetics and Breeding • Previous Articles     Next Articles

Advantages of Using Population-specific Reference Genome for SNP Calling in Chinese Indicine Cattle

LI Aixin1(), LI Ziyang1, CHEN Wenjie1, TIAN Yuyang1, LEI Chuzhao1, LI Zhigang2,*(), CHEN Ningbo1,*()   

  1. 1. College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China
    2. Pingdingshan Livestock Husbandry Development Center, Pingdingshan 467000, China
  • Received:2025-03-06 Online:2025-10-23 Published:2025-11-01
  • Contact: LI Zhigang, CHEN Ningbo E-mail:lax2561957992@163.com;long0014@163.com;ningbochen@nwafu.edu.cn

Abstract:

This study aimed to use the population-specific reference genome of Chinese indicine cattle and systematically evaluate its advantages in identifying SNPs. This study focused on whole-genome resequencing data from 60 Chinese indicine cattle samples to identify and compare SNPs via both the European taurine cattle reference genome (ARS-UCD1.2) and the population-specific reference genome of Chinese indicine cattle (Leiqiong breed: ASM3988116v1). For the multiallelic SNPs detected in ARS-UCD1.2, genomic coordinate mapping chain files between the genomes was established, converting these SNPs into biallelic SNPs on the basis of the coordinates of the population-specific reference genome, followed by functional annotation. Variants detected in Chinese indicine cattle populations via the ASM3988116v1 population-specific reference genome presented significant advantages over the ARS-UCD1.2 genome: 1) This approach could perform a more comprehensive identification of intronic and untranslated region variants, increasing the detection sensitivity for low-frequency and rare variants; 2) It could reduce false positives in variant identification caused by reference bias; 3) It facilitated the conversion of some multiallelic SNPs, previously filtered out owing to genomic reference bias, into biallelic SNPs. These SNPs were annotated to 8 352 genes, including important genes related to the growth, development, and environmental adaptability of indicine cattle, such as muscle development (CTNNA1), immunity (SIL1), blood circulation (VPS13A), muscle development and photoperiod regulation (EYA3), etc. The specific reference genome for Chinese cattle can increase variant detection sensitivity, reduce reference genome bias, and uncover more functionally significant loci. This provides a high-confidence data foundation for population genetics research and precision breeding, offering important theoretical and practical implications.

Key words: population-specific reference genome, SNP, reference bias, biallelic/multi-allelic SNPs, functional genes

CLC Number: