畜牧兽医学报 ›› 2020, Vol. 51 ›› Issue (2): 205-216.doi: 10.11843/j.issn.0366-6964.2020.02.001
王琦, 朱迪, 王宇哲, 吴杰, 胡晓湘*, 赵毅强*
收稿日期:
2019-07-10
出版日期:
2020-02-23
发布日期:
2020-02-22
通讯作者:
胡晓湘,主要从事生物化学与分子生物学研究,E-mail:hxx@cau.edu.cn;赵毅强,主要从事生物信息学研究,E-mail:yiqiangz@cau.edu.cn
作者简介:
王琦(1995-),女,河南许昌人,博士生,主要从事猪基因组育种研究,E-mail:SZ20183020184@cau.edu.cn;朱迪(1997-),男,山东济宁人,博士生,主要从事家畜功能基因组研究,E-mail:zhudicau@163.com
基金资助:
WANG Qi, ZHU Di, WANG Yuzhe, WU Jie, HU Xiaoxiang*, ZHAO Yiqiang*
Received:
2019-07-10
Online:
2020-02-23
Published:
2020-02-22
摘要: 单核苷酸多态性(single nucleotide polymorphism,SNP)是遗传学研究中重要的材料。近年来,全基因组SNP标记开发方法的发展使得研究者们能够以较低成本获得丰富的基因组标记,大大推动了基因组水平的相关研究。基因组预测从已知基因型数据和表型数据的个体建立训练模型,对未知表型的个体进行基因型和表型预测,在育种领域具有重要意义。全基因组SNP的分型策略结合基因组预测方法,构成了动物基因组选择的前沿。本文从这两个方面进行综述,以期为从事分子遗传学,尤其是复杂性状研究的研究者们提供参考。
中图分类号:
王琦, 朱迪, 王宇哲, 吴杰, 胡晓湘, 赵毅强. 全基因组SNP分型策略及基因组预测方法的研究进展[J]. 畜牧兽医学报, 2020, 51(2): 205-216.
WANG Qi, ZHU Di, WANG Yuzhe, WU Jie, HU Xiaoxiang, ZHAO Yiqiang. Research Progress of Genomic-wide SNP Genotyping and Genomic Prediction Methods[J]. Acta Veterinaria et Zootechnica Sinica, 2020, 51(2): 205-216.
[1] | WALDMANN P.Approximate Bayesian neural networks in genomic prediction[J].Genet Sel Evol,2018,50:70. |
[2] | DAVIES R W,FLINT J,MYERS S,et al.Rapid genotype imputation from sequence without reference panels[J].Nat Genet, 2016,48(8):965-969. |
[3] | PASANIUC B,ROHLAND N,MCLAREN P J,et al.Extremely low-coverage sequencing and imputation increases power for genome-wide association studies[J].Nat Genet,2012,44(6):631-635. |
[4] | FRAZER K A,BALLINGER D G,COX D R,et al.A second generation human haplotype map of over 3.1 million SNPs[J]. Nature,2007,449(7164):851-861. |
[5] | STEEMERS F J,GUNDERSON K L.Whole genome genotyping technologies on the BeadArrayTM platform[J].Biotechnol J,2007,2(1):41-49. |
[6] | QIAO X,SU R,WANG Y,et al.Genome-wide target enrichment-aided chip design:a 66 K SNP chip for cashmere goat[J].Sci Rep,2017,7(1):8621. |
[7] | GU X R,FENG C G,MA L,et al.Genome-wide association study of body weight in chicken F2 resource population[J].PLoS One,2011,6(7):e21872. |
[8] | ZHU C Y,FAN H Y,YUAN Z H,et al.Detection of selection signatures on the X chromosome in three sheep breeds[J].Int J Mol Sci,2015,16(9):20360-20374. |
[9] | OSTERSEN T,CHRISTENSEN O F,HENRYON M,et al.Deregressed EBV as the response variable yield more reliable genomic predictions than traditional EBV in pure-bred pigs[J].Genet Sel Evol,2011,43:38. |
[10] | DIDION J P,YANG H,SHEPPARD K,et al.Discovery of novel variants in genotyping arrays improves genotype retention and reduces ascertainment bias[J].BMC Genomics,2012,13:34. |
[11] | 王宇哲.利用简化基因组测序精细定位鸡生长性状功能基因[D].北京:中国农业大学,2017.WNG Y Z.Fine mapping the QTL for growth traits in chicken by improves genotyping by sequencing[D].Beijing:China Agricultural University,2017.(in Chinese) |
[12] | AVISE J C,LANSMAN R A,SHADE R O.The use of restriction endonucleases to measure mitochondrial dna sequence relatedness in natural populations.I.population structure and evolution in the genus peromyscus[J].Genomics, 1979, 92(1):279-295. |
[13] | BROWN W M.Polymorphism in mitochondrial DNA of humans as revealed by restriction endonuclease analysis[J].Proc Natl Acad Sci U S A,1980,77(6):3605-3609. |
[14] | VAN TASSELL C P,SMITH T P L,MATUKUMALLI L K,et al.SNP discovery and allele frequency estimation by deep sequencing of reduced representation libraries[J].Nat Methods,2008,5(3):247-252. |
[15] | WIEDMANN R T,SMITH T P,NONNEMAN D J.SNP discovery in swine by reduced representation and high throughput pyrosequencing[J]. BMC Genet,2008,9:81. |
[16] | MILLER M R,DUNHAM J P,AMORES A,et al.Rapid and cost-effective polymorphism identification and genotyping using restriction site associated DNA (RAD) markers[J].Genome Res,2007,17(2):240-248. |
[17] | ELSHIRE R J,GLAUBITZ J C,SUN Q,et al.A robust,simple genotyping-by-sequencing (GBS) approach for high diversity species[J].PLoS One,2011,6(5):e19379. |
[18] | ANDREWS K R,GOOD J M,MILLER M R,et al.Harnessing the power of RADseq for ecological and evolutionary genomics[J].Nat Rev Genet,2016,17(2):81-92. |
[19] | HE J F,ZHAO X Q,LAROCHE A,et al.Genotyping-by-sequencing (GBS),an ultimate marker-assisted selection (MAS) tool to accelerate plant breeding[J].Front Plant Sci,2014,5:484. |
[20] | WANG Y Z,CAO X M,ZHAO Y Q,et al.Optimized double-digest genotyping by sequencing (ddGBS) method with high-density SNP markers and high genotyping accuracy for chickens[J].PLoS One,2017,12(6):e0179073. |
[21] | TAN C,WU Z F,REN J L,et al.Genome-wide association study and accuracy of genomic prediction for teat number in Duroc pigs using genotyping-by-sequencing[J].Genet Sel Evol,2017,49:35. |
[22] | GAUTIER M,GHARBI K,CEZARD T,et al.The effect of RAD allele dropout on the estimation of genetic variation within and between populations[J].Mol Ecol,2013,22(11):3165-3178. |
[23] | SCHWEYEN H,ROZENBERG A,LEESE F.Detection and removal of PCR duplicates in population genomic ddRAD studies by addition of a degenerate base region (DBR) in sequencing adapters[J].Boil Bull,2014,227(2):146-160. |
[24] | TIN M M Y,RHEINDT F E,CROS E,et al.Degenerate adaptor sequences for detecting PCR duplicates in reduced representation sequencing data improve genotype calling accuracy[J].Mol Ecol Resour,2015,15(2):329-336. |
[25] | TOONEN R J,PURITZ J B,FORSMAN Z H,et al.ezRAD:a simplified method for genomic genotyping in non-model organisms[J].PeerJ,2013,1:e203. |
[26] | LITI G,CARTER D M,MOSES A M,et al.Population genomics of domestic and wild yeasts[J].Nature,2009, 458(7236):337-341. |
[27] | NIELSEN R,PAUL J S,ALBRECHTSEN A,et al.Genotype and SNP calling from next-generation sequencing data[J].Nat Rev Genet,2011,12(6):443-451. |
[28] | HOWIE B,FUCHSBERGER C,STEPHENS M,et al.Fast and accurate genotype imputation in genome-wide association studies through pre-phasing[J].Nat Genet,2012,44(8):955-959. |
[29] | DELANEAU O,ZAGURY J F,MARCHINI J.Improved whole-chromosome phasing for disease and population genetic studies[J].Nat Methods,2013,10(1):5-6. |
[30] | BROWNING B L,BROWNING S R.Genotype imputation with millions of reference samples[J].Am J Hum Genet, 2016, 98(1):116-126. |
[31] | HOWIE B N,DONNELLY P,MARCHINI J.A flexible and accurate genotype imputation method for the next generation of genome-wide association studies[J].PLoS Genet,2009,5(6):e1000529. |
[32] | HUANG X H,WEI X H,SANG T,et al.Genome-wide association studies of 14 agronomic traits in rice landraces[J].Nat Genet,2010,42(11):961-967. |
[33] | LIU S Y,HUANG S J,CHEN F,et al.Genomic analyses from non-invasive prenatal testing reveal genetic associations,patterns of viral infections,and chinese population history[J].Cell,2018,175(2):347-359.e14. |
[34] | NICOD J,DAVIES R W,CAI N,et al.Genome-wide association of multiple complex traits in outbred mice by ultra-low-coverage sequencing[J].Nat Genet,2016,48(8):912-918. |
[35] | YANG R F,GUO X L,ZHU D,et al.Genome-wide association analyses of multiple traits in Duroc pigs using low-coverage whole-genome sequencing strategy[J].Preprint Server Biol,2019,doi:10.1101/754671. |
[36] | ROS-FREIXEDES R,BATTAGIN M,JOHNSSON M,et al.Impact of index hopping and bias towards the reference allele on accuracy of genotype calls from low-coverage sequencing[J].Genet Sel Evol,2018,50:64. |
[37] | TIBSHIRANI R.Regression shrinkage and selection via the lasso[J].J Roy Stat B,1996,58(1):267-288. |
[38] | VASQUEZ M M,HU C C,ROE D J,et al.Least absolute shrinkage and selection operator type methods for the identification of serum biomarkers of overweight and obesity:simulation and application[J].BMC Med Res Methodol,2016,16(1):154. |
[39] | VASQUEZ M M,HU C C,ROE D J,et al.Measurement error correction in the least absolute shrinkage and selection operator model when validation data are available[J].Stat Methods Med Res,2019,28(3):670-680. |
[40] | ARBET J,MCGUE M,CHATTERJEE S,et al.Resampling-based tests for Lasso in genome-wide association studies[J].BMC Genet,2017,18:70. |
[41] | DE VLAMING R,GROENEN P J F.The current and future use of ridge regression for prediction in quantitative genetics[J]. Biomed Res Int,2015,2015:143712. |
[42] | VANRADEN P M.Efficient methods to compute genomic predictions[J].J Dairy Sci,2008,91(11):4414-4423. |
[43] | GARCÍA-RUIZ A,COLE J B,VANRADEN P M,et al.Changes in genetic selection differentials and generation intervals in US Holstein dairy cattle as a result of genomic selection[J].Proc Natl Acad Sci U S A,2016,113(28):E3995-E4004. |
[44] | TENG J Y,GAO N,ZHANG H B,et al.Performance of whole genome prediction for growth traits in a crossbred chicken population[J].Poult Sci,2019,98(5):1968-1975. |
[45] | WANG X,XU Y,HU Z L,et al.Genomic selection methods for crop improvement:current status and prospects[J].Crop J,2018,6(4):330-340. |
[46] | AGUILAR I,MISZTAL I,JOHNSON D L,et al.Hot topic:a unified approach to utilize phenotypic,full pedigree,and genomic information for genetic evaluation of Holstein final score[J].J Dairy Sci,2010,93(2):743-752. |
[47] | FRAGOMENI B O,LOURENCO D A L,LEGARRA A,et al.Alternative SNP weighting for single-step genomic best linear unbiased predictor evaluation of stature in US Holsteins in the presence of selected sequence variants[J].J Dairy Sci,2019,102(11):10012-10019. |
[48] | TEISSIER M,LARROQUE H,ROBERT-GRANIÉ C.Weighted single-step genomic BLUP improves accuracy of genomic breeding values for protein content in French dairy goats:a quantitative trait influenced by a major gene[J].Genet Sel Evol,2018,50:31. |
[49] | MEUWISSEN T H,HAYES B J,GODDARD M E.Prediction of total genetic value using genome-wide dense marker maps[J].Genetics,2001,157(4):1819-1829. |
[50] | HABIER D,FERNANDO R L,KIZILKAYA K,et al.Extension of the bayesian alphabet for genomic selection[J].BMC Bioinformatics,2011,12:186. |
[51] | CHEN K,LU Y T,ZHAO H Y,et al.Predicting the change of exon splicing caused by genetic variant using support vector regression[J].Hum Mutat,2019,40(9):1235-1242. |
[52] | HO T K.Random decision forests[C]//Proceedings of 3rd International Conference on Document Analysis and Recognition.Montreal:IEEE,1995. |
[53] | BRENNAN R S,HEALY T M,BRYANT H J,et al.Integrative population and physiological genomics reveals mechanisms of adaptation in killifish[J].Mol Biol Evol,2018,35(11):2639-2653. |
[54] | MCCULLOCH W S,PITTS W.A logical calculus of the ideas immanent in nervous activity.1943[J].Bull Math Biol,1990,52(1-2):99-115. |
[55] | WU J,ZHAO Y Q.Machine learning technology in the application of genome analysis:a systematic review[J].Gene,2019, 705:149-156. |
[56] | MA W L,QIU Z X,SONG J,et al.A deep convolutional neural network approach for predicting phenotypes from genotypes[J].Plnata,2018,248(5):1307-1318. |
[57] | ZOU J,HUSS M,ABID A,et al.A primer on deep learning in genomics[J].Nat Genet,2019,51(1):12-18. |
[1] | 路畅, 董磊, 张万锋, 高鹏飞, 郭晓红, 蔡春波, 曹果清, 李步高. 基于全基因组重测序对晋汾白猪单核苷酸多态性位点鉴定和筛选[J]. 畜牧兽医学报, 2023, 54(7): 2761-2771. |
[2] | 刘继强, 郝晓东, 武丽娜, 廖诗莹, 冯羿方, 弥世荣, 刘燊, 刘建, 张龙超. 全基因组SNP分型技术在畜禽遗传育种研究中的应用[J]. 畜牧兽医学报, 2022, 53(12): 4123-4137. |
[3] | 任钰为, 王峰, 王成, 张艳, 孙瑞萍, 刘海隆, 乔传民, 邢漫萍, 黄丽丽, 曹宗喜, 晁哲. 五指山猪和杜洛克猪全基因组选择信号差异分析[J]. 畜牧兽医学报, 2022, 53(12): 4172-4182. |
[4] | 李宏伟, 徐凌洋, 王泽昭, 蔡文涛, 朱波, 陈燕, 高雪, 张路培, 高会江, 李俊雅. 基于单倍型肉牛屠宰性状全基因组关联分析研究[J]. 畜牧兽医学报, 2022, 53(12): 4232-4243. |
[5] | 许静漪, 徐昊祺, 胡丽蓉, 张帆, 高清, 罗汉鹏, 张海亮, 师睿, 李想, 刘林, 郭刚, 王雅春. MET基因多态性与中国荷斯坦牛繁殖和泌乳性状的关联分析[J]. 畜牧兽医学报, 2022, 53(11): 3769-3785. |
[6] | 陶璇, 何志平, 梁艳, 杨雪梅, 雷云峰, 王言, 胡紫惠, 廖坤, 肖贤勋, 格桑, 马贵云, 吉绒丹增, 敖翔, 吕学斌, 顾以韧. 不同地方猪重要经济性状关联位点SNP芯片分型及群体遗传结构研究[J]. 畜牧兽医学报, 2022, 53(10): 3358-3367. |
[7] | 宋兴超, 刘琳玲, 潘虹军, 赵家平, 贾赟, 杨福合, 徐超. 水貂酪氨酸酶(TYR)基因克隆、SNPs筛查及其皮肤组织mRNA差异表达分析[J]. 畜牧兽医学报, 2021, 52(1): 66-76. |
[8] | 张文建, 马广伟, 张潇飞, 褚衍凯, 刘静, 杨华, 闫晓红, 李玉茂, 李辉, 王宁. 羊毛品质性状相关基因CCNY的验证和分析[J]. 畜牧兽医学报, 2017, 48(8): 1389-1400. |
[9] | 杨文志,张明月,王冠楠,赵宇鹏,张巍巍,李世杰. 2个印记的基因间lncRNAs位于牛Dlk1-Dio3印记区域[J]. 畜牧兽医学报, 2016, 47(9): 1848-1852. |
[10] | 宋兴超,徐超,刘宗岳,岳志刚,丛波,刘琳玲,杨福合. 美洲水貂刺鼠信号蛋白基因SNPs检测及其与毛色表型的关联分析[J]. 畜牧兽医学报, 2016, 47(4): 723-732. |
[11] | 洪潜龙,张颖,曹春伟,王宪龙,赵建国. 巴马小型猪OCA2基因序列、组织表达分析及载体构建研究[J]. 畜牧兽医学报, 2016, 47(3): 439-448. |
[12] | 赵谦,浦亚斌,关伟军,赵倩君,马月辉. 猪重要性状全基因组关联分析的研究进展[J]. 畜牧兽医学报, 2015, 46(6): 873-881. |
[13] | 王兴平,罗仍卓么,李峰,李德才,都鑫. 湘西黄牛Myf5和Pax7基因的SNPs检测及其与体尺性状的关联分析[J]. 畜牧兽医学报, 2014, 45(9): 1531-1537. |
[14] | 朱波,吴洋,齐欣,牛红,张静静,王延晖,陈燕,张路培,高会江,高雪,李俊雅,孙少华. 利用混合线性模型和BayesCPi进行QTL-MAS2011公共数据集的全基因组关联分析[J]. 畜牧兽医学报, 2014, 45(5): 692-698. |
[15] | 丁月云,朱卫华,薛玮纬,孟云,黄龙,张晓东,殷宗俊. 安徽地方猪TLR4基因第3外显子的SNP分析[J]. 畜牧兽医学报, 2014, 45(11): 1767-1774. |
阅读次数 | ||||||
全文 |
|
|||||
摘要 |
|
|||||