遗传 ›› 2018, Vol. 40 ›› Issue (11): 1039-1043.doi: 10.16288/j.yczz.18-190

• 资源与平台 • 上一篇    下一篇

生命与健康大数据中心资源

张源笙1,2,3,夏琳1,2,3,桑健1,2,3,李漫1,2,3,刘琳1,2,3,李萌伟1,2,3,牛广艺1,2,3,曹佳宝1,2,3,滕徐菲1,2,3,周晴1,2,3,章张1,2,3()   

  1. 1. 中国科学院北京基因组研究所,生命与健康大数据中心,北京 100101
    2. 中国科学院北京基因组研究所,中国科学院基因组科学与信息重点实验室,北京 100101
    3. 中国科学院大学,北京 100049
  • 收稿日期:2018-07-05 修回日期:2018-09-12 出版日期:2018-11-20 发布日期:2018-09-18
  • 通讯作者: 章张 E-mail:zhangzhang@big.ac.cn
  • 作者简介:张源笙,硕士研究生,专业方向:生物信息学。E-mail: zhangyuansheng@big.ac.cn|夏琳,博士研究生,专业方向:生物信息学。E-mail: xialin@big.ac.cn|桑健,博士研究生,专业方向:生物信息学。E-mail: sangj@big.ac.cn 张源笙、夏琳和桑健并列第一作者。
  • 基金资助:
    中国科学院战略性先导科技专项(XDA19050302);中国科学院战略性先导科技专项(XDB13040500);中国科学院战略性先导科技专项(XDA08020102);国家重点研发计划(2016YFC0901603);中国科学院“十三五”信息化建设专项(XXH13505-05)

The BIG Data Center’s database resources

Yuansheng Zhang1,2,3,Lin Xia1,2,3,Jian Sang1,2,3,Man Li1,2,3,Lin Liu1,2,3,Mengwei Li1,2,3,Guangyi Niu1,2,3,Jiabao Cao1,2,3,Xufei Teng1,2,3,Qing Zhou1,2,3,Zhang Zhang1,2,3()   

  1. 1. BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
    2. CAS Key Laboratory of Genomics and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
    3. University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2018-07-05 Revised:2018-09-12 Online:2018-11-20 Published:2018-09-18
  • Contact: Zhang Zhang E-mail:zhangzhang@big.ac.cn
  • Supported by:
    Supported by Strategic Priority Research Program of the Chinese Academy of Sciences(XDA19050302);Supported by Strategic Priority Research Program of the Chinese Academy of Sciences(XDB13040500);Supported by Strategic Priority Research Program of the Chinese Academy of Sciences(XDA08020102);the National Key Research & Development Program of China(2016YFC0901603);the 13th Five-year Informatization Plan of Chinese Academy of Sciences(XXH13505-05)

摘要:

生命与健康多组学数据是生命科学研究和生物医学技术发展的重要基础。然而,我国缺乏生物数据管理和共享平台,不但无法满足国内日益增长的生物医学及相关学科领域的研究发展需求,而且严重制约我国生物大数据整合共享与转化利用。鉴于此,中国科学院北京基因组研究所于2016年初成立生命与健康大数据中心(BIG Data Center, BIGD),围绕国家人口健康和重要战略生物资源,建立生物大数据管理平台和多组学数据资源体系。本文重点介绍BIGD的生命与健康大数据资源系统,主要包括组学原始数据归档库、基因组数据库、基因组变异数据库、基因表达数据库、甲基化数据库、生物信息工具库和生命科学维基知识库,提供生物大数据汇交、整合与共享服务,为促进我国生命科学数据管理、推动国家生物信息中心建设奠定重要基础。

关键词: 大数据, 组学, 数据共享, 数据资源, 生物信息学

Abstract:

Omics data in life and health sciences are of fundamental significance for scientific research and biomedical technology development. However, there is yet to be a platform for biological data management and sharing in China, making it difficult to meet the development needs of biomedical and related fields and consequently leading to severe issues in big data management, sharing and translation. To address these issues, Beijing Institute of Genomics (BIG) of Chinese Academy of Sciences founded the BIG Data Center (BIGD) in 2016, which is dedicated to establish a biological big data management platform and multi-omics databases, with a particular focus on national population healthcare and important strategic biological resources. In this paper, we describe core database resources in BIGD, including GSA (Genome Sequence Archive), GWH (Genome Warehouse), GVM (Genome Variation Map), GEN (Gene Expression Nebulas), MethBank (Methylation Bank), BioCode and Science Wikis. Taken together, all these resources provide a series of services for data deposition, integration and sharing, laying solid foundations for enhancing national biological science data management and further promoting the construction of national bioinformatics center.

Key words: big data, omics, data sharing, data resource, bioinformatics