遗传 ›› 2024, Vol. 46 ›› Issue (2): 149-167.doi: 10.16288/j.yczz.23-260

• 研究报告 • 上一篇    下一篇

基于Y-SNP和Y-STR揭示汉族人群父系遗传关系

朱信1,2(), 金鑫3, 刘俊2,4, 杨澜2,4, 邹丽馨2,5, 李彩霞1,2, 黄江1(), 江丽2()   

  1. 1.贵州医科大学法医学院,贵阳 550004
    2.公安部鉴定中心,北京市现场物证检验工程技术研究中心,现场物证溯源技术国家工程实验室,北京 100038
    3.海南省公安厅,海口 570203
    4.山西医科大学,太原 030001
    5.江苏师范大学,徐州 221116
  • 收稿日期:2023-12-11 修回日期:2024-01-24 出版日期:2024-02-20 发布日期:2024-01-26
  • 通讯作者: 黄江,江丽 E-mail:1987538437@qq.com;mmm_hj@gmc.edu.cn;jl@mail.bnu.edu.cn
  • 作者简介:朱信,硕士研究生,专业方向:法医物证学。E-mail: 1987538437@qq.com.
  • 基金资助:
    国家重点研发计划项目(2022YFC3341004);国家自然科学基金项目(82171870);公安部鉴定中心基本科研业务费专项资金项目(2022JB020);现场物证溯源技术国家工程实验室开放课题(2021FGKFKT01)

Paternal genetic structure analysis of the modern Han populations based on Y-SNP and Y-STR

Xin Zhu1,2(), Xin Jin3, Jun Liu2,4, Lan Yang2,4, Lixin Zou2,5, Caixia Li1,2, Jiang Huang1(), Li Jiang2()   

  1. 1. Institute of Forensic Medicine, Guizhou Medical University, Guiyang 550004, China
    2. Key Laboratory of Forensic Genetics, Beijing Engineering Research Center of Crime Scene Evidence Examination, National Engineering Laboratory for Forensic Science, Institute of Forensic Science, Beijing 100038, China
    3. Public Security Department of Hainan Province, Haikou 570203, China
    4. Shanxi Medical University, Taiyuan 030001, China
    5. Jiangsu Normal University, Xuzhou 221116, China
  • Received:2023-12-11 Revised:2024-01-24 Published:2024-02-20 Online:2024-01-26
  • Contact: Jiang Huang,Li Jiang E-mail:1987538437@qq.com;mmm_hj@gmc.edu.cn;jl@mail.bnu.edu.cn
  • Supported by:
    National Key R&D Program of China(2022YFC3341004);National Natural Science Foundation of China(82171870);Fundamental Research Funds for Institute of Forensic Science(2022JB020);Open Project of the National Engineering Laboratory for Forensic Science(2021FGKFKT01)

摘要:

汉族是中国人口最多的民族,现有研究多集中于汉族人群的起源、迁徙、融合等遗传历史,以及局部地区汉族人群的父系遗传关系,鲜有全局视角下的汉族人群父系遗传结构研究。本研究检测了362份青海、四川和辽宁的汉族无关男性样本,整合已发表文献相关数据,最终获得了国内15个省份16个汉族人群1830人份样本,覆盖89个Y-SNP、16个Y-STR的数据。通过统计Y-SNP单倍群频率、Y-STR单倍型多样性,使用主成分分析(principal component analysis, PCA)、系统发育树、单倍型网络等分析,综合Y-SNP和Y-STR两个反映不同时间尺度的遗传标记,研究不同地区汉族人群之间的遗传分化、汉族人群与其周边少数民族的遗传关系。单倍群频率统计结果显示单倍群O-M175是汉族人群主体单倍群(青海汉族60.53%~广东汉族92.7%),其下游亚单倍群呈现地域差异化分布。单倍群O2-M122高频分布于各地汉族,总体分布趋势北高南低;单倍群O1b-M268分布频率由南向北递减,尤其在岭南地区汉族人群中分布显著;单倍群O1a-M119在中部汉族人群中分布频率较高。汉族人群遗传结构研究表明,其主要分为北部、中部及南部三个聚类簇,其中青海汉族与其他地区汉族存在一定的遗传分化。在合并少数民族的遗传关系研究中,汉族人群彼此之间遗传关系更紧密,但北部汉族与回族遗传关系更近,而南部汉族则与仡佬族、黎族遗传关系更近。总之,本文基于89个Y-SNP和16个Y-STR,系统地研究了中国不同地域的汉族人群的单倍群分布、遗传亚结构及其与周边少数民族的遗传关系,为群体遗传学、法医遗传学补充理论依据,为Y染色体的法医学应用提供数据支撑。Y-SNP单倍群结合Y-STR单倍型对于分析汉族人群遗传亚结构以及法医学应用具有重要作用。

关键词: 群体遗传学, 法医遗传学, Y-SNP, Y-STR

Abstract:

The Han populations represent the largest ethnic group in China. Previous studies have primarily focused on investigating their genetic origins, migration and integration, as well as paternal genetic relationships within specific regional Han populations. However, a comprehensive analysis of the global paternal genetic structure of Han populations is lacking. In this study, we performed Y-chromosome sequencing on 362 unrelated male samples from Chinese Han individuals collected from Qinghai, Sichuan and Liaoning provinces. We then integrated relevant data from reported studies. Our final dataset comprised 1830 samples from 16 Han populations across 15 provinces in China, encompassing information on 89 Y-SNPs and 16 Y-STRs. Statistical analyses were conducted to assess Y-STR haplotype diversity (HD) and Y-SNP haplogroup frequencies. Additionally, we employed principal component analysis (PCA), phylogenetic tree and haplotype network to explore genetic differentiation within Han populations and the genetic relationships between Han populations and ethnic minorities surrounding them. Our results demonstrated that the O-M175 haplogroup represents the predominant paternal lineage in Han populations, with frequencies ranging from 60.53% (Qinghai Han) to 92.7% (Guangdong Han). Moreover, the subclades downstream of O-M175 showed distinct regional variations in their distribution patterns. The O2-M122 haplogroup was prevalent in all Han populations and demonstrated a gradual decline in frequency from north to south. Conversely, the distribution frequency of the O1b-M268 haplogroup decreased from south to north, particularly showed significant presence among Han populations in the Lingnan region. Haplogroup O1a-M119 distributed more frequently in the central Han populations. Our findings revealed that Chinese Han populations can be categorized into three subgroups: northern, central, and southern. Notably, there were significant differences among Han in Qinghai and other regions. Regarding the genetic relationships between Han populations and surrounding ethnic minorities, we observed a closer genetic affinity between different Han populations, but northern Han demonstrated a stronger relationship with the Hui ethnic group, while southern Han exhibited a closer connection with the Gelao and Li ethnic groups. In summary, this study presented a systematic analysis of haplogroup distribution, genetic substructure of Han populations and genetic relationships between Han populations and surrounding ethnic minorities based on 89 Y-SNPs and 16 Y-STRs systematically. Our research supplemented valuable insights into population genetics and forensic genetics, and provided data support for the forensic application of Y chromosome. The integration of Y-SNP haplogroups with Y-STR haplotypes offers enhanced understanding of the genetic substructure within Han populations, which holds significance for both population genetics research and forensic science applications.

Key words: population genetics, forensic genetics, Y-SNP, Y-STR