遗传 ›› 2026, Vol. 48 ›› Issue (5): 506-521.doi: 10.16288/j.yczz.25-335
收稿日期:2025-12-19
修回日期:2026-01-14
出版日期:2026-03-11
发布日期:2026-03-11
通讯作者:
边黎明,博士,教授,研究方向:林木遗传改良。E-mail: Lmbian@njfu.edu.cn;作者简介:蒋雨萱,硕士研究生,专业方向:林木遗传育种。E-mail: 806939913@qq.com
基金资助:
Yuxuan Jiang1(
), Liming Bian1(
), Shiliang Zhou1, Charles Chen4, Yousry A. El-Kassaby2, Zhiqiang Chen1(
), Harry X. Wu1,3
Received:2025-12-19
Revised:2026-01-14
Published:2026-03-11
Online:2026-03-11
Supported by:摘要:
湿地松(Pinus elliottii)基因组庞大且重复序列占比高,现有低密度单核苷酸多态性(single nucleotide polymorphism,SNP)芯片在标记覆盖与连锁信息刻画方面存在局限。为提高群体遗传分析的标记密度及基因组亲缘关系矩阵(genomic relationship matrix,GRM)估计精度,本研究以50株母本约10×全基因组重测序数据构建参考面板,并对715株半同胞子代的51K芯片分型数据进行全基因组填充。芯片原有位点采用遮盖实验评估填充准确性,参考面板扩增且芯片未覆盖的新增位点采用子代重测序外部验证评估一致性并筛选,遮盖验证一致性稳定在约95.5%,参考面板扩增位点经阈值筛选后保留高可信位点,构建了覆盖715个体的高密度基因型矩阵,包含120,650,180个SNP。局部连锁不平衡(linkage disequilibrium,LD)热图对比显示,增密后LD信号更连续且块结构更清晰;以第4号染色体上10.22~10.33 Mb区段为例,高LD位点对比例由14.5%增至27.6%。基于增密数据构建的基因组关系矩阵与芯片矩阵在非对角线元素上保持高度一致,Pearson相关系数约为0.984;距离分层分析进一步显示,距芯片标记500 kb以内的填充位点构建的矩阵一致性更高,而更远距离窗口的一致性逐步下降,提示远距离填充位点带来的改进相对有限。综上,本研究建立的参考面板驱动填充、验证与整合流程可为湿地松及相关针叶树的全基因组关联分析与基因组选择研究提供高密度基因型数据基础。
蒋雨萱, 边黎明, 周师亮, 陈志强. 基因型填充提高湿地松SNP密度与遗传分析准确性[J]. 遗传, 2026, 48(5): 506-521.
Yuxuan Jiang, Liming Bian, Shiliang Zhou, Charles Chen, Yousry A. El-Kassaby, Zhiqiang Chen, Harry X. Wu. Genotype imputation improves SNP density and genetic analysis accuracy in slash pine[J]. Hereditas(Beijing), 2026, 48(5): 506-521.
| [1] |
Ding XY, Diao S, Luan QF, Wu HX, Zhang YN, Jiang JM. A transcriptome-based association study of growth, wood quality, and oleoresin traits in a slash pine breeding population. PLoS Genet, 2022, 18(2): e1010017.
pmid: 35108269 |
| [2] |
Bernhardsson C, Zan YJ, Chen ZQ, Ingvarsson PK, Wu HX. Development of a highly efficient 50K single nucleotide polymorphism genotyping array for the large and complex genome of Norway spruce (Picea abies L. Karst.) by whole genome resequencing and its transferability to other spruce species. Mol Ecol Resour, 2021, 21(3): 880-896.
pmid: 33179386 |
| [3] |
Estravis Barcala M, van der Valk T, Chen ZQ, Funda T, Chaudhary R, Klingberg A, Fundova I, Suontama M, Hallingbäck H, Bernhardsson C, Nystedt B, Ingvarsson PK, Sherwood E, Street N, Gyllensten U, Nilsson O, Wu HX. Whole-genome resequencing facilitates the development of a 50K single nucleotide polymorphism genotyping array for Scots pine (Pinus sylvestris L.) and its transferability to other pine species. Plant J, 2024, 117(3): 944-955.
pmid: 37947292 |
| [4] |
Caballero M, Lauer E, Bennett J, Zaman S, McEvoy S, Acosta J, Jackson C, Townsend L, Eckert A, Whetten RW, Loopstra C, Holliday J, Mandal M, Wegrzyn JL, Isik F. Toward genomic selection in Pinus taeda: integrating resources to support array design in a complex conifer genome. Appl Plant Sci, 2021, 9(6): e11439.
pmid: 34268018 |
| [5] | Diao S, Ding XY, Luan QF, Chen ZQ, Wu HX, Li XC, Zhang YN, Sun JM, Wu YD, Zou LH, Jiang JM. Development of 51K liquid-phased probe array for loblolly and slash pines and its application to GWAS of slash pine breeding population. Ind Crops Prod, 2024, 216: 118777. |
| [6] |
Zhang TY, Sun XM, Li J, Cui K. Application and development prospect of genomic selection breeding in coniferous trees. Planta, 2025, 262(6): 133.
pmid: 41144003 |
| [7] |
Li JH, Mazur CA, Berisa T, Pickrell JK. Low-pass sequencing increases the power of GWAS and decreases measurement error of polygenic risk scores compared to genotyping arrays. Genome Res, 2021, 31(4): 529-537.
pmid: 33536225 |
| [8] | Ros-Freixedes R. The contribution of whole-genome sequence data to genome-wide association studies in livestock: outcomes and perspectives. Livest Sci, 2024, 281: 105430. |
| [9] |
Rubinacci S, Ribeiro DM, Hofmeister RJ, Delaneau O. Efficient phasing and imputation of low-coverage sequencing data using large reference panels. Nat Genet, 2021, 53(1): 120-126.
pmid: 33414550 |
| [10] |
McCarthy S, Das S, Kretzschmar W, Delaneau O, Wood AR, Teumer A, Kang HM, Fuchsberger C, Danecek P, Sharp K, Luo Y, Sidore C, Kwong A, Timpson N, Koskinen S, Vrieze S, Scott LJ, Zhang H, Mahajan A, Veldink J, Peters U, Pato C, van Duijn CM, Gillies CE, Gandin I, Mezzavilla M, Gilly A, Cocca M, Traglia M, Angius A, Barrett JC, Boomsma D, Branham K, Breen G, Brummett CM, Busonero F, Campbell H, Chan A, Chen S, Chew E, Collins FS, Corbin LJ, Smith GD, Dedoussis G, Dorr M, Farmaki AE, Ferrucci L, Forer L, Fraser RM, Gabriel S, Levy S, Groop L, Harrison T, Hattersley A, Holmen OL, Hveem K, Kretzler M, Lee JC, McGue M, Meitinger T, Melzer D, Min JL, Mohlke KL, Vincent JB, Nauck M, Nickerson D, Palotie A, Pato M, Pirastu N, McInnis M, Richards JB, Sala C, Salomaa V, Schlessinger D, Schoenherr S, Slagboom PE, Small K, Spector T, Stambolian D, Tuke M, Tuomilehto J, Van den Berg LH, Van Rheenen W, Volker U, Wijmenga C, Toniolo D, Zeggini E, Gasparini P, Sampson MG, Wilson JF, Frayling T, de Bakker PIW, Swertz MA, McCarroll S, Kooperberg C, Dekker A, Altshuler D, Willer C, Iacono W, Ripatti S, Soranzo N, Walter K, Swaroop A, Cucca F, Anderson CA, Myers RM, Boehnke M, McCarthy MI, Durbin R, Haplotype Reference Consortium. A reference panel of 64,976 haplotypes for genotype imputation. Nat Genet, 2016, 48(10): 1279-1283.
pmid: 27548312 |
| [11] |
Das S, Forer L, Schönherr S, Sidore C, Locke AE, Kwong A, Vrieze SI, Chew EY, Levy S, McGue M, Schlessinger D, Stambolian D, Loh PR, Iacono WG, Swaroop A, Scott LJ, Cucca F, Kronenberg F, Boehnke M, Abecasis GR, Fuchsberger C. Next-generation genotype imputation service and methods. Nat Genet, 2016, 48(10): 1284-1287.
pmid: 27571263 |
| [12] |
Minamikawa MF, Kunihisa M, Moriya S, Shimizu T, Inamori M, Iwata H. Genomic prediction and genome-wide association study using combined genotypic data from different genotyping systems: application to apple fruit quality traits. Hortic Res, 2024, 11(7): uhae131.
pmid: 38979105 |
| [13] |
Chen LF, Yang SP, Araya S, Quigley C, Taliercio E, Mian R, Specht JE, Diers BW, Song QJ. Genotype imputation for soybean nested association mapping population to improve precision of QTL detection. Theor Appl Genet, 2022, 135(5): 1797-1810.
pmid: 35275252 |
| [14] |
Tan HZ, Stuart KC, Vi T, Whibley A, Bailey S, Brekke P, Santure AW. High imputation accuracy can be achieved using a small reference panel in a natural population with low genetic diversity. Mol Ecol Resour, 2025, 25(8): e70024.
pmid: 40797301 |
| [15] |
Zheng WG, Ma WL, Chen ZL, Wang C, Sun T, Dong WJ, Zhang WJ, Zhang S, Tang ZL, Li K, Zhao YX, Liu YW. DPImpute: a genotype imputation framework for ultra- low coverage whole-genome sequencing and its application in genomic selection. Adv Sci (Weinh), 2025, 12(16): e2412482.
pmid: 40013759 |
| [16] |
Gao YJ, Yang ZQ, Yang WQ, Yang YB, Gong J, Yang QY, Niu XH. Plant-ImputeDB: an integrated multiple plant reference panel database for genotype imputation. Nucleic Acids Res, 2021, 49(D1): D1480-D1488.
pmid: 33137192 |
| [17] |
Jensen SE, Charles JR, Muleta K, Bradbury PJ, Casstevens T, Deshpande SP, Gore MA, Gupta R, Ilut DC, Johnson L, Lozano R, Miller Z, Ramu P, Rathore A, Romay MC, Upadhyaya HD, Varshney RK, Morris GP, Pressoir G, Buckler ES, Ramstein GP. A sorghum practical haplotype graph facilitates genome-wide imputation and cost-effective genomic prediction. Plant Genome, 2020, 13(1): e20009.
pmid: 33016627 |
| [18] |
Mora-Márquez F, Nuño JC, Soto Á, de Heredia UL. Missing genotype imputation in non-model species using self-organizing maps. Mol Ecol Resour, 2025, 25(3): e13992.
pmid: 38970328 |
| [19] |
Naito T, Okada Y. Genotype imputation methods for whole and complex genomic regions utilizing deep learning technology. J Hum Genet, 2024, 69(10): 481-486.
pmid: 38225263 |
| [20] |
Browning BL, Zhou Y, Browning SR. A one-penny imputed genome from next-generation reference panels. Am J Hum Genet, 2018, 103(3): 338-348.
pmid: 30100085 |
| [21] |
Zhang RF, Zhang Y, Liu TN, Jiang B, Li ZY, Qu YP, Chen YS, Li ZC. Utilizing variants identified with multiple genome-wide association study methods optimizes genomic selection for growth traits in pigs. Animals (Basel), 2023, 13(4): 722.
pmid: 36830509 |
| [22] |
Marchini J, Howie B. Genotype imputation for genome- wide association studies. Nat Rev Genet, 2010, 11(7): 499-511.
pmid: 20517342 |
| [23] |
Nguyen TV, Bolormaa S, Reich CM, Chamberlain AJ, Vander Jagt CJ, Daetwyler HD, MacLeod IM. Empirical versus estimated accuracy of imputation: optimising filtering thresholds for sequence imputation. Genet Sel Evol, 2024, 56(1): 72.
pmid: 39548370 |
| [24] |
Delaneau O, Zagury JF, Robinson MR, Marchini JL, Dermitzakis ET. Accurate, scalable and integrative haplotype estimation. Nat Commun, 2019, 10(1): 5436.
pmid: 31780650 |
| [25] |
Cahoon JL, Rui XY, Tang E, Simons C, Langie J, Chen MH, Lo YC, Chiang CWK. Imputation accuracy across global human populations. Am J Hum Genet, 2024, 111(5): 979-989.
pmid: 38604166 |
| [26] | Hao CL, Yu XF, Qu MH, Lai EH, Guo SM, Gao L. Current status and prospects of pan-genome studies in plants. Plant Sci J, 2022, 40(1): 124-132. |
| 郝晨路, 於晓芬, 曲明昊, 赖恩惠, 郭素敏, 高磊. 植物泛基因组研究进展与展望. 植物科学学报, 2022, 40(1): 124-132. | |
| [27] |
Zimin AV, Stevens KA, Crepeau MW, Puiu D, Wegrzyn JL, Yorke JA, Langley CH, Neale DB, Salzberg SL. An improved assembly of the loblolly pine mega-genome using long-read single-molecule sequencing. Gigascience, 2017, 6(1): 1-4.
pmid: 28369353 |
| [28] | Chen Y, Qiu A, Zhang ZP, Du HH, Bai JY, Wang GJ, Luo WX, Ni JQ, Li K, Ding XD. Study on the genotype imputation effect of 10K-50K genotype of pig SNP liquid chip. Acta Vet Zootech Sin, 2022, 53(10): 3368-3376. |
| 陈宇, 邱奥, 张梓鹏, 都鹤鹤, 白俊艳, 王贵江, 罗文学, 倪俊卿, 李凯, 丁向东. 猪SNP液相芯片10K-50K基因型填充效果研究. 畜牧兽医学报, 2022, 53(10): 3368-3376. | |
| [29] |
Watowich MM, Chiou KL, Graves B, Montague MJ, Brent LJN, Higham JP, Horvath JE, Lu A, Martinez MI, Platt ML, Schneider-Crease IA, Lea AJ, Snyder-Mackler N. Best practices for genotype imputation from low-coverage sequencing data in natural populations. Mol Ecol Resour, 2025, 25(5): e13854.
pmid: 37602981 |
| [30] |
Wragg D, Zhang WG, Peterson S, Yerramilli M, Mellanby R, Schoenebeck JJ, Clements DN. A cautionary tale of low-pass sequencing and imputation with respect to haplotype accuracy. Genet Sel Evol, 2024, 56(1): 6.
pmid: 38216889 |
| [31] |
Taliun D, Harris DN, Kessler MD, Carlson J, Szpiech ZA, Torres R, Gagliano Taliun SA, Corvelo A, Gogarten SM, Kang HM, Pitsillides AN, LeFaive J, Lee SB, Tian XW, Browning BL, Das S, Emde AK, Clarke WE, Loesch DP, Shetty AC, Blackwell TW, Smith AV, Wong Q, Liu XM, Conomos MP, Bobo DM, Aguet F, Albert C, Alonso A, Ardlie KG, Arking DE, Aslibekyan S, Auer PL, Barnard J, Barr RG, Barwick L, Becker LC, Beer RL, Benjamin EJ, Bielak LF, Blangero J, Boehnke M, Bowden DW, Brody JA, Burchard EG, Cade BE, Casella JF, Chalazan B, Chasman DI, Chen YDI, Cho MH, Choi SH, Chung MK, Clish CB, Correa A, Curran JE, Custer B, Darbar D, Daya M, de Andrade M, DeMeo DL, Dutcher SK, Ellinor PT, Emery LS, Eng C, Fatkin D, Fingerlin T, Forer L, Fornage M, Franceschini N, Fuchsberger C, Fullerton SM, Germer S, Gladwin MT, Gottlieb DJ, Guo XQ, Hall ME, He J, Heard-Costa NL, Heckbert SR, Irvin MR, Johnsen JM, Johnson AD, Kaplan R, Kardia SLR, Kelly T, Kelly S, Kenny EE, Kiel DP, Klemmer R, Konkle BA, Kooperberg C, Köttgen A, Lange LA, Lasky-Su J, Levy D, Lin XH, Lin KH, Liu CY, Loos RJF, Garman L, Gerszten R, Lubitz SA, Lunetta KL, Mak ACY, Manichaikul A, Manning AK, Mathias RA, McManus DD, McGarvey ST, Meigs JB, Meyers DA, Mikulla JL, Minear MA, Mitchell BD, Mohanty S, Montasser ME, Montgomery C, Morrison AC, Murabito JM, Natale A, Natarajan P, Nelson SC, North KE, O'Connell JR, Palmer ND, Pankratz N, Peloso GM, Peyser PA, Pleiness J, Post WS, Psaty BM, Rao DC, Redline S, Reiner AP, Roden D, Rotter JI, Ruczinski I, Sarnowski C, Schoenherr S, Schwartz DA, Seo JS, Seshadri S, Sheehan VA, Sheu WH, Shoemaker MB, Smith NL, Smith JA, Sotoodehnia N, Stilp AM, Tang WH, Taylor KD, Telen M, Thornton TA, Tracy RP, Van Den Berg DJ, Vasan RS, Viaud-Martinez KA, Vrieze S, Weeks DE, Weir BS, Weiss ST, Weng LC, Willer CJ, Zhang YZ, Zhao XT, Arnett DK, Ashley-Koch AE, Barnes KC, Boerwinkle E, Gabriel S, Gibbs R, Rice KM, Rich SS, Silverman EK, Qasba P, Gan WN, NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium, Papanicolaou GJ, Nickerson DA, Browning SR, Zody MC, Zöllner S, Wilson JG, Cupples LA, Laurie CC, Jaquish CE, Hernandez RD, O'Connor TD, Abecasis GR. Sequencing of 53,831 diverse genomes from the NHLBI TOPMed program. Nature, 2021, 590(7845): 290-299.
pmid: 33568819 |
| [32] |
Shi SN, Rubinacci S, Hu SL, Moutsianas L, Stuckey A, Need AC, Palamara PF, Caulfield M, Marchini J, Myers S. A Genomics England haplotype reference panel and imputation of UK Biobank. Nat Genet, 2024, 56(9): 1800-1803.
pmid: 39134668 |
| [33] |
Zhao HH, MacLeod IM, Keeble-Gagnere G, Barbulescu DM, Tibbits JF, Kaur S, Hayden M. Using genotype imputation to integrate Canola populations for genome- wide association and genomic prediction of blackleg resistance. BMC Genomics, 2025, 26(1): 215.
pmid: 40038585 |
| [34] |
Flanagan J, Liu XX, Ortega-Reyes D, Tomizuka K, Matoba N, Akiyama M, Koido M, Ishigaki K, Ashikawa K, Takata S, Shi MY, Aoi T, Momozawa Y, Ito K, Murakami Y, Matsuda K, Biobank Japan Project, Kamatani Y, Morris AP, Horikoshi M, Terao C. Population-specific reference panel improves imputation quality for genome-wide association studies conducted on the Japanese population. Commun Biol, 2024, 7(1): 1665.
pmid: 39702642 |
| [35] |
Liu SK, Shi CY, Chen CG, Tan Y, Tian Y, Macqueen DJ, Li Q. Haplotype-resolved genomes provide insights into the origins and functional significance of genome diversity in bivalves. Cell Rep, 2025, 44(5): 115697.
pmid: 40349337 |
| [36] |
Sun Q, Li Y. Advances in haplotype phasing and genotype imputation. Nat Rev Genet, 2026, 27(2): 155-169.
pmid: 40993437 |
| [37] |
Sengupta D, Botha G, Meintjes A, Mbiyavanga M, AWI-Gen Study, H3Africa Consortium, Hazelhurst S, Mulder N, Ramsay M, Choudhury A. Performance and accuracy evaluation of reference panels for genotype imputation in sub-Saharan African populations. Cell Genom, 2023, 3(6): 100332.
pmid: 37388906 |
| [38] |
Ros-Freixedes R, Johnsson M, Whalen A, Chen CY, Valente BD, Herring WO, Gorjanc G, Hickey JM. Genomic prediction with whole-genome sequence data in intensely selected pig lines. Genet Sel Evol, 2022, 54(1): 65.
pmid: 36153511 |
| [39] |
Ebert P, Audano PA, Zhu QH, Rodriguez-Martin B, Porubsky D, Jan Bonder M, Sulovari A, Ebler J, Zhou WC, Serra Mari R, Yilmaz F, Zhao XF, Hsieh P, Lee J, Kumar S, Lin JD, Rausch T, Chen Y, Ren JW, Santamarina M, Höps W, Ashraf H, Chuang NT, Yang XF, Munson KM, Lewis AP, Fairley S, Tallon LJ, Clarke WE, Basile AO, Byrska-Bishop M, Corvelo A, Evani US, Lu TY, Chaisson MJP, Chen JJ, Li C, Brand H, Wenger AM, Ghareghani M, Harvey WT, Raeder B, Hasenfeld P, Regier AA, Abel HJ, Hall IM, Flicek P, Stegle O, Gerstein MB, Tubio JMC, Mu ZP, Li YI, Shi XH, Hastie AR, Ye K, Chong ZC, Sanders AD, Zody MC, Talkowski ME, Mills RE, Devine SE, Lee C, Korbel JO, Marschall T, Eichler EE. Haplotype- resolved diverse human genomes and integrated analysis of structural variation. Science, 2021, 372(6537): eabf7117.
pmid: 33632895 |
| [40] |
Lu MM, Krutovsky KV, Nelson CD, Koralewski TE, Byram TD, Loopstra CA. Exome genotyping, linkage disequilibrium and population structure in loblolly pine (Pinus taeda L.). BMC Genomics, 2016, 17(1): 730.
pmid: 27624183 |
| [41] |
Thistlethwaite FR, Ratcliffe B, Klápště J, Porth I, Chen C, Stoehr MU, El-Kassaby YA. Genomic prediction accuracies in space and time for height and wood density of Douglas-fir using exome capture as the genotyping platform. BMC Genomics, 2017, 18(1): 930.
pmid: 29197325 |
| [1] | 魏文琦, 蔡大伟. 基因型填充技术在古DNA研究中的应用与潜在价值[J]. 遗传, 2026, 48(5): 471-482. |
| [2] | 王梦燏, 周成浩, 薛倩, 殷建玫, 蒋一秀, 张会永, 李国辉, 韩威. “酉芯一号”在地方鸡遗传多样性和结构分析中的应用效力研究[J]. 遗传, 2024, 46(8): 640-648. |
| [3] | 李智,何俊,蒋隽,Richard G. Tait Jr.,Stewart Bauck,过伟,吴晓林. 牛SNP芯片分型检出率和分型错误率对基因型填充准确率的影响[J]. 遗传, 2019, 41(7): 644-652. |
| [4] | 何俊,钱长嵩,RichardG.TaitJr.,StewartBauck,吴晓林. SNP芯片数据估计动物个体基因组品种构成的方法及应用[J]. 遗传, 2018, 40(4): 305-314. |
| [5] | 刘小磊,杨松柏,Max F Rothschild,ZHANG Zhi-Wu,樊斌. 利用紧缩线性模型和贝叶斯模型对猪总产仔数和产活仔数性状的全基因组关联研究[J]. 遗传, 2012, 34(10): 1261-1270. |
| 阅读次数 | ||||||
|
全文 |
|
|||||
|
摘要 |
|
|||||
www.chinagene.cn
备案号:京ICP备09063187号-4
总访问:,今日访问:,当前在线: