遗传 ›› 2020, Vol. 42 ›› Issue (2): 212-221.doi: 10.16288/j.yczz.20-030

• 资源与平台 • 上一篇    下一篇

2019新型冠状病毒信息库

赵文明1,2,3(), 宋述慧1,2(), 陈梅丽1,2(), 邹东1,2(), 马利娜1,2(), 马英克1,2, 李茹姣1,2, 郝丽丽1,2, 李翠萍1,2, 田东梅1,2, 唐碧霞1,2, 王彦青1,2, 朱军伟1,2, 陈焕新1,2, 章张1,2,3, 薛勇彪1,3(), 鲍一明1,2,3()   

  1. 1. 国家生物信息中心&中国科学院北京基因组研究所国家基因组科学数据中心, 北京 100101;
    2. 中国科学院北京基因组研究所基因组科学与信息重点实验室,北京 100101
    3. 中国科学院大学,北京 100049
  • 收稿日期:2020-01-31 修回日期:2020-02-07 出版日期:2020-02-20 发布日期:2020-02-08
  • 基金资助:
    国家重点研发计划项目(2016YFE0206600);国家重点研发计划项目(2017YFC1201202);中国科学院“十三五”信息化建设专项(XXH13505-05);中国科学院地球大数据先导A类专项(XDA19050302);中国科学院基因组科学数据中心能力建设项目(0202);中国科学院青年创新促进会和中国科学院关键技术人才项目资助

The 2019 novel coronavirus resource

Wenming Zhao1,2,3(), Shuhui Song1,2(), Meili Chen1,2(), Dong Zou1,2(), Lina Ma1,2(), Yingke Ma1,2, Rujiao Li1,2, Lili Hao1,2, Cuiping Li1,2, Dongmei Tian1,2, Bixia Tang1,2, Yanqing Wang1,2, Junwei Zhu1,2, Huanxin Chen1,2, Zhang Zhang1,2,3, Yongbiao Xue1,3(), Yiming Bao1,2,3()   

  1. 1. China National Center for Bioinformation & National Genomics Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China;
    2. CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
    3. University of Chinese Academy of Sciences, Beijing 100049, China
  • Received:2020-01-31 Revised:2020-02-07 Online:2020-02-20 Published:2020-02-08
  • Supported by:
    the National Key Research & Development Program of China(2016YFE0206600);the National Key Research & Development Program of China(2017YFC1201202);13th Five-year Informatization Plan of CAS(XXH13505-05);Strategic Priority Research Program of the Chines Academy of Sciences (CAS)(XDA19050302);Capacity building project of genome science data center of Chinese Academy of Sciences(0202);Key Technology Talent Program of the CAS, The Youth Innovation Promotion Association of Chinese Academy of Sciences

摘要:

2019年12月在中国武汉开始爆发的新型肺炎已造成全球25个国家/地区的31516人感染、638人死亡(截止2020年2月7日16时),引起该肺炎的病毒被世界卫生组织命名为2019新型冠状病毒(2019-nCoV)。为促进2019-nCoV数据共享应用并及时向全球公众提供病毒的相关信息,国家生物信息中心(CNCB)/国家基因组科学数据中心(NGDC)建立了2019新型冠状病毒信息库(2019nCoVR,https://bigd.big.ac.cn/ncov)。该信息库整合了来自德国全球流感病毒数据库、美国国家生物技术信息中心、深圳(国家)基因库、国家微生物科学数据中心及CNCB/NGDC等机构公开发布的2019-nCoV核苷酸和蛋白质序列数据、元信息、学术文献、新闻动态、科普文章等信息,开展了不同冠状病毒株的基因组序列变异分析并提供可视化展示。同时,2019nCoVR无缝对接CNCB/NGDC的相关数据库,提供新测序病毒株系的基因组原始测序数据、组装后序列的在线汇交、管理与共享、国际数据库同步发布等数据服务。本文对2019nCoVR数据汇交、管理、发布及使用等进行全面阐述,以方便用户了解该信息库各项功能及数据状况,为加速开展病毒的分类溯源、变异演化、快速检测、药物研发以及新型肺炎的精准预防与治疗等研究提供重要基础。

关键词: 冠状病毒数据库, 2019新型冠状病毒, 国家生物信息中心, 国家基因组科学数据中心, 基因组数据共享

Abstract:

An ongoing outbreak of a novel coronavirus infection in Wuhan, China since December 2019 has led to 31,516 infected persons and 638 deaths across 25 countries (till 16:00 on February 7, 2020). The virus causing this pneumonia was then named as the 2019 novel coronavirus (2019-nCoV) by the World Health Organization. To promote the data sharing and make all relevant information of 2019-nCoV publicly available, we construct the 2019 Novel Coronavirus Resource (2019nCoVR, https://bigd.big.ac.cn/ncov). 2019nCoVR features comprehensive integration of genomic and proteomic sequences as well as their metadata information from the Global Initiative on Sharing All Influenza Data, National Center for Biotechnology Information, China National GeneBank, National Microbiology Data Center and China National Center for Bioinformation (CNCB)/National Genomics Data Center (NGDC). It also incorporates a wide range of relevant information including scientific literatures, news, and popular articles for science dissemination, and provides visualization functionalities for genome variation analysis results based on all collected 2019-nCoV strains. Moreover, by linking seamlessly with related databases in CNCB/NGDC, 2019nCoVR offers virus data submission and sharing services for raw sequence reads and assembled sequences. In this report, we provide comprehensive descriptions on data deposition, management, release and utility in 2019nCoVR, laying important foundations in aid of studies on virus classification and origin, genome variation and evolution, fast detection, drug development and pneumonia precision prevention and therapy.

Key words: 2019nCoVR, 2019 novel coronavirus, China National Center for Bioinformation (CNCB), National Genomics Data Center (NGDC), genomic data sharing