基因组二代测序数据的自动化分析流程

doi:10.3724/SP.J.1005.2014.0618

[an error occurred while processing this directive]

HEREDITAS(Beijing) ›› 2014, Vol. 36 ›› Issue (6): 618-624.doi: 10.3724/SP.J.1005.2014.0618

• Technique and Method • Previous Articles

Automatic analysis pipeline of next-generation sequencing data

Wenke Li¹, Fengyu Li^{1, 2}, Siyao Zhang¹, Bin Cai¹, Na Zheng¹, Yu Nie¹, Dao Zhou², Qian Zhao¹

1. State Key Laboratory of Cardiovascular Disease, Fuwai Hospital, National Center for Cardiovascular Disease, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100037, China;
2. College of Biomedical Engineering, South-Central University for Nationalities, Wuhan 430074, China

Received:2013-09-07 Revised:2014-01-20 Online:2014-06-20 Published:2014-05-28

Abstract

Abstract:

The development of next-generation sequencing has generated high demand for data processing and analysis. Although there are a lot of software for analyzing next-generation sequencing data, most of them are designed for one specific function (e.g., alignment, variant calling or annotation). Therefore, it is necessary to combine them together for data analysis and to generate interpretable results for biologists. This study designed a pipeline to process Illumina sequencing data based on Perl programming language and SGE system. The pipeline takes original sequence data (fastq format) as input, calls the standard data processing software (e.g., BWA, Samtools, GATK, and Annovar), and finally outputs a list of annotated variants that researchers can further analyze. The pipeline simplifies the manual operation and improves the efficiency by automatization and parallel computation. Users can easily run the pipeline by editing the configuration file or clicking the graphical interface. Our work will facilitate the research projects using the sequencing technology.

Key words: next generation sequencing, automatic data analysis, pipeline, variantion detection

Wenke Li, Fengyu Li, Siyao Zhang, Bin Cai, Na Zheng, Yu Nie, Dao Zhou, Qian Zhao. Automatic analysis pipeline of next-generation sequencing data[J]. HEREDITAS(Beijing), 2014, 36(6): 618-624.

[1]	Yongxin Liu,Yuan Qin,Xiaoxuan Guo,Yang Bai. Methods and applications for microbiome data analysis [J]. Hereditas(Beijing), 2019, 41(9): 845-862.
[2]	Fang Liu, Xiaozhen Song, Hua Xie, Xiaoli Chen. The pathogenicity of somatic mutation to common tumors and developmental malformation of the nervous system [J]. HEREDITAS(Beijing), 2016, 38(3): 196-205.
[3]	Sheng Shen, Yanchun Qu, Jun Zhang. The application of next generation sequencing on epigenetic study [J]. HEREDITAS, 2014, 36(3): 256-275.
[4]	Huijun Yuan, Yu Lu. Application of next generation sequencing in gene identification and genetic diagnosis of hereditary hearing loss [J]. HEREDITAS(Beijing), 2014, 36(11): 1112-1120.
[5]	TANG Hai-Ming, CHEN Hong, ZHANG Jing, REN Jing-Yi, XU Ning. Application of next generation sequencing in microRNA detection [J]. HEREDITAS, 2012, 34(6): 784-792.
[6]	GAO Shan, ZHANG Ning, LI Bo, XU Shuo, YE Yan-Bo, RUAN Ji-Shou. Processing and analysis of ChIP-seq data [J]. HEREDITAS, 2012, 34(6): 773-783.
[7]	LIANG Ye, CHEN Shuang-Yan, LIU Gong-She. Application of next generation sequencing techniques in plant transcriptome [J]. HEREDITAS, 2011, 33(12): 1317-1326.
[8]	BO Xing-Hua, SHU Hai-Yang, MARJANI Sadie L.. Technological advances in single-cell genomic analyses [J]. HEREDITAS, 2011, 33(1): 17-24.
[9]	QIN Qiao-Beng, ZHANG Lan-Lan, LI Na-Yi, CUI Yong-Yi, XU Kai. Optimizing of cDNA preparation for next generation sequencing [J]. HEREDITAS, 2010, 32(9): 974-977.
[10]	. Establishment of target genomic DNA capturing system for next generation sequencing [J]. HEREDITAS, 2010, 32(12): 1296-1303.

Automatic analysis pipeline of next-generation sequencing data

PDF (PC)

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 10

Recommended Articles

Metrics

Comments