遗传 ›› 2014, Vol. 36 ›› Issue (1): 58-68.doi: 10.3724/SP.J.1005.2014.00058

• 研究报告 • 上一篇    下一篇

樟树5种化学类型叶片转录组分析

江香梅, 伍艳芳, 肖复明, 熊振宇, 徐海宁   

  1. 江西省林业科学院, 国家林业局樟树工程技术研究中心, 南昌 330032
  • 收稿日期:2013-04-25 修回日期:2013-07-23 出版日期:2014-01-20 发布日期:2013-12-20
  • 通讯作者: 江香梅, 博士, 研究员。研究方向:林木遗传育种。E-mail: zjiang2013@126.com E-mail:zjiang2013@126.com
  • 作者简介:江香梅, 博士, 研究员。研究方向:林木遗传育种。E-mail: zjiang2013@126.com
  • 基金资助:

    “赣鄱英才555工程”领军人才培养计划项目樟树全基因组研究和江西省自然科学基金项目(编号:20122BAB214029)资助

Transcriptome analysis for leaves of five chemical types in Cinna-momum camphora

Xiangmei Jiang, Yanfang Wu, Fuming Xiao, Zhenyu Xiong, Haining Xu   

  1. Camphor Engineering Technology Research Center for State Forestry Administration, Jiangxi Academy of Forestry, Nanchang 330032, China
  • Received:2013-04-25 Revised:2013-07-23 Online:2014-01-20 Published:2013-12-20

摘要:

樟树(Cinnamomum camphora )是樟科植物的一个代表种, 具有材用、药用、香料、油用和生态环境建设等多种用途。叶精油中富含利用价值极高的樟脑、芳樟醇、1,8-桉叶油素、异-橙花叔醇和右旋龙脑等萜类化合物。依据叶精油中主要成分的种类和含量, 可将樟树划分为脑樟、芳樟、油樟、异樟、龙脑樟5种化学类型。文章采用Illumina HiSeq™ 2000高通量测序技术, 对5种化学类型叶片转录组进行测序, 对测序得到的所有Unigene进行GO(Gene Ontology)、COG(Clusters of Orthologous Groups)和KEGG(Kyoto Encyclopedia of Genes and Genomes)分类, 给出功能注释和Pathway注释, 并预测Unigene蛋白编码区(Coding sequence, CDS)。De novo组装共获得156 278个Unigene, 序列平均长度584 bp, N50(覆盖50%所有核苷酸的最大Unigene长度)为1 023 bp。通过与其他核酸、蛋白数据库的Blast搜索比对, 共有55 955条Unigene获得了基因注释, 占所有Unigene的35.80%。其中, 有24 717条Unigene得到GO注释, 有21 806条Unigene得到COG注释。KEGG pathways分析结果表明, 共有3 350条基因(10.19%)注释到次生代谢生物合成途径, 其中参与单萜、二萜、倍半萜和萜类骨架合成的Unigene有424个。在单萜合成的代谢通路中, 有9条Unigene可能编码芳樟醇合成酶基因, 且表达分析结果显示, 芳樟醇合成酶基因在芳樟化学类型中优势表达, 在油樟化学类型中表达水平较低。这些注释信息的完成为樟树功能基因及相关候选基因的发掘提供了基础数据和重要依据。

关键词: 樟树, RNA-Seq, 基因注释, 功能分类, CDS预测

Abstract:

Camphor tree (Cinnamomum camphora) is a representative species in Lauraceae family, and can be subdivided into five types: linalool, camphor, cineol, iso-nerolidol and borneol. In this paper, the leaves transcriptomes of Cinnamomum camphora were sequenced with the platform of Illumina HiSeq™ 2000. Based on the GO (Gene Ontology), COG (Clusters of Orthologous Groups), and KEGG (Kyoto Encyclopedia of Genes and Genomes) database, the function classification, pathway annotation, and the coding sequence prediction of all-Unigenes were carried out. 156 278 Unigenes with an average length of 584 bp and N50 (N50 value is defined as the Unigene length where half the assembly is represented by Unigenes of this size or longer) of 1 023 bp were generated by de novo assembly. A total of 5 5955 Unigenes (35.80%) were annotated through similarity comparison, in which 24 717 and 21 806 Unigenes were assigned into GO and COG, respectively. By searching KEGG database, 3 350 Unigenes were involved in biosynthesis of secondary metabolites, in which 424 Unigenes were involved in monoterpenoids, diterpenoids, sesquiterpenoids, and terpenoid backbone biosynthesis. The analysis of monoterpenoids biosynthesis pathway showed that 9 Unigenes likely encode (+)-linalool synthase, and their expression levels were higher in linalool type but lower in cineole type. This study provides a foundation for further characterizing the functional genes in C. camphora.

Key words: Cinnamomum camphora, RNA-Seq, gene annotation, function classification, CDS prediction