[关键词]
[摘要]
目的 获得刺五加Eleutherococcus senticosus转录组数据库和差异表达基因。方法 采用皂苷高含量组和低含量组2个样本作为受试材料,采用二代测序方法中的Illumina HiSeq 4000进行转录组测序,并进行系统的生物信息学分析。结果 共获得8.34 Gb数据,拼接得到77 087条Unigenes,与5个基因数据库进行比对,可归类于55个Geneontology(GO)分类中,涉及到116个KEGG标准代谢通路。通过差异性分析发现,差异性表达基因共530条,其中上调基因占42.08%,下调基因占57.92%,相差较大。进行GO和Pathway富集,得到408个GO注释和40个代谢通路。结论 对刺五加转录组进行拼接、组装和功能注释,得到大量转录本信息,为刺五加分子生物学研究提供了宝贵的基因组数据库资源。
[Key word]
[Abstract]
Objective To obtain the transcriptome database and differentially expressed genes of Eleutherococcus senticosus. Methods We choose the high content group and the low content group of saponin as experimental materials, and use the high-throughput sequencing technology (Illumina HiSeq 4000) to sequence the transcriptome of E. senticosus, then we systematically analyze the sequencing results in the bioinformatic way. Results We have assembled 8.34 Gb database, after assembly steps, we get 77 087 of E. senticosus unigenes, then blasting them with five data banks. All unigenes are involved in 55 GO-terms and 116 metabolic pathways. Though the differentially expressed analysis of two materials, we get 530 differentially expressed genes, the up-regulated genes account for 42.08%, the down-regulated genes account for 57.92%. After GO and Pathway enrichment analysis, we get 408 GO-natations and 40 metabolic pathways. Conclusion These data represent the abundant messeges about transcripts and provide the valuable genome data sources in molecular biology of E. senticosus.
[中图分类号]
[基金项目]
国家自然科学基金项目(31570683);河北省教育厅资助科研项目(QN2014102);华北理工大学培育基金(SP201508)