ENCODE计划和功能基因组研究

doi:10.3724/SP.J.1005.2014.0237

遗传 ›› 2014, Vol. 36 ›› Issue (3): 237-247.doi: 10.3724/SP.J.1005.2014.0237

ENCODE计划和功能基因组研究

丁楠^{1, 2}, 渠鸿竹¹, 方向东¹

1. 中国科学院北京基因组研究所, 中国科学院基因组科学及信息重点实验室, 北京 100101;
2. 中国科学院大学, 北京100049

收稿日期:2013-09-17 修回日期:2013-12-23 出版日期:2014-03-20 发布日期:2014-02-27
作者简介:丁楠, 在读博士研究生, 专业方向：基因组学数据挖掘。Tel: 010-84097538; E-mail: dingnan@big.ac.cn
基金资助:
中国科学院干细胞与再生医学研究战略性科技先导专项子课题(编号：XDA01040405)资助

The ENCODE project and functional genomics studies

Nan Ding^1,2, Hongzhu Qu¹, Xiangdong Fang¹

1. CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China;
2. University of Chinese Academy of Sciences, Beijing 100049, China

Received:2013-09-17 Revised:2013-12-23 Published:2014-03-20 Online:2014-02-27

摘要/Abstract

摘要：

人类基因组计划完成以来, 科学家们一直在努力阐释基因组信息所代表的生物学意义。自2003年开始, 美国国家人类基因组研究所(National Human Genome Research Institute, NHGRI)投资近3亿美元启动“DNA元件百科全书(Encyclopedia of DNA Elements, ENCODE)”计划, 集结了来自美国、中国、英国、日本、西班牙和新加坡等国家的32个实验室的440余名科学家, 共同鉴定并分析人类基因组中所有的功能调控元件。高通量测序技术等实验手段的发展和生物信息学技术的不断完善使得ENCODE计划取得了丰硕的成果：确定了甲基化和组蛋白修饰等表观修饰区域及其对染色质结构的作用, 进而确定染色质结构的改变影响基因表达; 确定了转录因子及其结合位点的信息, 并构建了转录因子调控网络; 进一步修订更新了假基因和非编码RNA数据库; 并确定了调控序列的单核苷酸多态性(Single nucleotide polymorphism, SNP)并与疾病相关联。这些发现一方面有助于系统解析基因和基因组信息、调控元件的调控作用以及非编码区转录调控等分子机制; 同时也将为转化医学等生命科学研究领域提供丰富的数据来源。文章综述了高通量测序技术等实验手段的发展和生物信息学技术的不断完善对ENCODE计划的贡献、表观遗传学研究与ENCODE计划的关联性、ENCODE计划的主要科学成果等, 同时展望了ENCODE计划对基础医学、临床医学和转化医学等生命科学研究领域的巨大推动作用。

关键词: ENCODE, 表观遗传学, 新一代测序技术, 转录调控

Abstract:

Upon the completion of the Human Genome Project, scientists have been trying to interpret the underlying genomic code for human biology. Since 2003, National Human Genome Research Institute (NHGRI) has invested nearly $0.3 billion and gathered over 440 scientists from more than 32 institutions in the United States, China, United Kingdom, Japan, Spain and Singapore to initiate the Encyclopedia of DNA Elements (ENCODE) project, aiming to identify and analyze all regulatory elements in the human genome. Taking advantage of the development of next-generation sequencing technologies and continuous improvement of experimental methods, ENCODE had made remarkable achievements: identified methylation and histone modification of DNA sequences and their regulatory effects on gene expression through altering chromatin structures, categorized binding sites of various transcription factors and constructed their regulatory networks, further revised and updated database for pseudogenes and non-coding RNA, and identified SNPs in regulatory sequences associated with diseases. These findings help to comprehensively understand information embedded in gene and genome sequences, the function of regulatory elements as well as the molecular mechanism underlying the transcriptional regulation by noncoding regions, and provide extensive data resource for life sciences, particularly for translational medicine. We re-viewed the contributions of high-throughput sequencing platform development and bioinformatical technology improve-ment to the ENCODE project, the association between epigenetics studies and the ENCODE project, and the major achievement of the ENCODE project. We also provided our prospective on the role of the ENCODE project in promoting the development of basic and clinical medicine.

Key words: ENCODE, epigenetics, next-generation sequencing, transcriptional regulation

丁楠, 渠鸿竹, 方向东. ENCODE计划和功能基因组研究[J]. 遗传, 2014, 36(3): 237-247.

Nan Ding, Hongzhu Qu, Xiangdong Fang. The ENCODE project and functional genomics studies[J]. HEREDITAS, 2014, 36(3): 237-247.

参考文献

[1] Qu HZ, Fang XD. A brief review on the Human Encyclopedia of DNA Elements (ENCODE) project. Genomics Proteomics Bioinformatics, 2013, 11(3): 135–141. <\p>

[2] Weinstock GM. ENCODE: more genomic empowerment. Genome Res, 2007, 17(6): 667–668. <\p>

[3] ENCODE Project Consortium, Birney E, Stamatoyannopoulos JA, Dutta A。 Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature, 2007, 447(7146): 799–816. <\p>

[4] American Association for Cancer Research Human Epigenome Task Force E U, Network of Excellence, Scientific Advisory Board. Moving ahead with an international human epigenome project. Nature, 2008, 454(7205): 711– 715. <\p>

[5] Shendure J, Ji H. Next-generation DNA sequencing. Nat Biotechnol, 2008, 26(10): 1135–1145. <\p>

[6] Metzker ML. Sequencing technologies-the next generation. Nat Rev Genet, 2010, 11(1): 31–46. <\p>

[7] Thurman RE, Rynes E, Humbert R, Vierstra J, Maurano MT, Haugen E, Sheffield NC, Stergachis AB, Wang H, Vernot B, Garg K, John S, Sandstrom R, Bates D, Boatman L, Canfield TK, Diegel M, Dunn D, Ebersol AK, Frum T, Giste E, Johnson AK, Johnson EM, Kutyavin T, Lajoie B, Lee BK, Lee K, London D, Lotakis D, Neph S, Neri F, Nguyen ED, Qu H, Reynolds AP, Roach V, Safi A, Sanchez ME, Sanyal A, Shafer A, Simon JM, Song L, Vong S, Weaver M, Yan Y, Zhang Z, Zhang Z, Lenhard B, Tewari M, Dorschner MO, Hansen RS, Navas PA, Stamatoyannopoulos G, Iyer VR, Lieb JD, Sunyaev SR, Akey JM, Sabo PJ, Kaul R, Furey TS, Dekker J, Crawford GE, Stamatoyannopoulos JA. The accessible chromatin landscape of the human genome. Nature, 2012, 489(7414): 75– 82. <\p>

[8] Satterlee JS, Schubeler D, Ng HH. Tackling the epigenome: challenges and opportunities for collaboration. Nat Biotechnol, 2010, 28(10): 1039–1044. <\p>

[9] Harrow J, Frankish A, Gonzalez JM, Tapanari E, Diekhans M, Kokocinski F, Aken BL, Barrell D, Zadissa A, Searle S, Barnes I, Bignell A, Boychenko V, Hunt T, Kay M, Mukherjee G, Rajan J, Despacio-Reyes G, Saunders G, Steward C, Harte R, Lin M, Howald C, Tanzer A, Derrien T, Chrast J, Walters N, Balasubramanian S, Pei B, Tress M, Rodriguez JM, Ezkurdia I, van Baren J, Brent M, Haussler D, Kellis M, Valencia A, Reymond A, Gerstein M, Guigo R, Hubbard TJ. GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res, 2012, 22(9): 1760–1774. <\p>

[10] Natarajan A, Yardimci GG, Sheffield NC, Crawford GE, Ohler U. Predicting cell-type-specific gene expression from regions of open chromatin. Genome Res, 2012, 22(9): 1711–1722. <\p>

[11] Cheng C, Yan KK, Yip KY, Rozowsky J, Alexander R, Shou C, Gerstein M. A statistical framework for modeling gene expression using chromatin features and application to modENCODE datasets. Genome Biol, 2011, 12(2): R15. <\p>

[12] Dong X, Greven MC, Kundaje A, Djebali S, Brown JB, Cheng C, Gingeras TR, Gerstein M, Guigo R, Birney E, Weng Z. Modeling gene expression using chromatin features in various cellular contexts. Genome Biol, 2012, 13(9): R53. <\p>

[13] Li G, Ruan X, Auerbach RK, Sandhu K S, Zheng M, Wang P, Poh HM, Goh Y, Lim J, Zhang J, Sim HS, Peh SQ, Mulawadi FH, Ong CT, Orlov YL, Hong S, Zhang Z, Landt S, Raha D, Euskirchen G, Wei CL, Ge W, Wang H, Davis C, Fisher-Aylor K I, Mortazavi A, Gerstein M, Gingeras T, Wold B, Sun Y, Fullwood M J, Cheung E, Liu E, Sung WK, Snyder M, Ruan Y. Extensive promoter- centered chromatin interactions provide a topological basis for transcription regulation. Cell, 2012, 148(1–2): 84–98. <\p>

[14] Tilgner H, Knowles DG, Johnson R, Davis CA, Chakrabortty S, Djebali S, Curado J, Snyder M, Gingeras TR, Guigo R. Deep sequencing of subcellular RNA fractions shows splicing to be predominantly co-transcriptional in the human genome

编辑推荐

Metrics

www.chinagene.cn
备案号：京ICP备09063187号-4
总访问:,今日访问:,当前在线:

[1]	安赛男, 杨欢淳, 姜姗, 李靖轩, 张根发. 融入生物信息学分析的综合性探究型表观遗传学实验设计与探索[J]. 遗传, 2025, 47(5): 600-608.
[2]	任佳琳, 童依泽, 蔡瑞. 染色质相关RNA的m⁶A修饰调控染色质可及性与基因转录研究进展[J]. 遗传, 2025, 47(11): 1186-1196.
[3]	王纪龙, 李青, 战廷正. 自转录活性调节区测序技术在增强子发现研究中的应用[J]. 遗传, 2024, 46(8): 589-602.
[4]	刘岱缘, 张朝晖, 康现江. 精子染色质完整性对功能的影响及其检测方法研究进展[J]. 遗传, 2024, 46(7): 511-529.
[5]	尤琳琳, 张余. 细菌转录终止的分子机制研究进展[J]. 遗传, 2024, 46(12): 982-994.
[6]	王艳妮, 李佳. 单细胞DNA甲基化测序数据处理流程与分析方法[J]. 遗传, 2024, 46(10): 807-819.
[7]	王承贤, 容益康, 崔敏. 果蝇限制端粒转座子的分子机制[J]. 遗传, 2023, 45(3): 221-228.
[8]	欧秀芳, 吴莹, 李宁, 姜丽丽, 刘宝, 宫磊. 基于科教融合培养大学生拔尖创新能力的表观遗传学综合实验课程[J]. 遗传, 2023, 45(12): 1158-1168.
[9]	吴丹丹, 朱明昆, 方忠艳, 马伟. 植物B染色体的分子结构组成及遗传机制研究进展[J]. 遗传, 2022, 44(9): 772-782.
[10]	张元, 赵语婷, 庄乐南, 贺津. 转录中介体复合物在心血管发育和疾病中的转录调控作用[J]. 遗传, 2022, 44(5): 383-397.
[11]	刘国芳, 任沛东, 叶文新, 陆光涛. 十字花科黑腐病菌中转录因子HpaR1与Clp调控一个糖苷水解酶基因表达的分析[J]. 遗传, 2021, 43(9): 910-920.
[12]	王天一, 王应祥, 尤辰江. 植物PHD结构域蛋白的结构与功能特性[J]. 遗传, 2021, 43(4): 323-339.
[13]	张向前, 李楠, 解新明. 表观遗传学综合性实验设计与探讨[J]. 遗传, 2021, 43(12): 1179-1187.
[14]	刘国芳, 王欣欣, 苏辉昭, 陆光涛. 细菌GntR家族转录调控因子的研究进展[J]. 遗传, 2021, 43(1): 66-73.
[15]	邱晓芬, 汤冬娥, 虞海燕, 廖秋燕, 胡芷洋, 周俊, 赵鑫, 何慧燕, 梁灼健, 许承明, 杨明, 戴勇. 基于单细胞ATAC测序技术对18-三体综合征染色质开放性区域转录因子的分析[J]. 遗传, 2021, 43(1): 74-83.

ENCODE计划和功能基因组研究

The ENCODE project and functional genomics studies

PDF (PC)

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics