[关键词]
[摘要]
目的 基于转录组数据筛选鉴定分析连翘Forsythia suspensa FsUGTs家族。方法 基于转录组与基因组鉴定连翘FsUGTs基因家族,运用生物信息学技术对其展开综合分析,深入探究其蛋白的理化特性、亚细胞定位。通过系统发育树构建、染色体定位、顺式作用元件分析、保守基序、结构域和基因结构分析,进一步揭示这些基因家族成员间的进化关系。此外通过RT-qPCR检测FsUGTs在不同组织(根、茎、叶、果)的表达状况,并将其与连翘苷的组织分布结合进行相关性分析。结果 在连翘的转录组数据中经过筛选、鉴定,最终得到82个FsUGTs,其氨基酸长度为385~558 aa,相对分子质量在43 140~60 750,理论等电点范围为4.81~8.26,主要定位在质膜上,均含有PSPG box结构域。系统发育分析将82个FsUGTs分为17组,其中L组的UGT最多,有16条,其次是A组和E组,分别为10条和9条,Q组没有FsUGTs分布。染色体定位结果显示16个FsUGTs定位于Chr10,而Chr11、Chr12、Chr13上各有7个FsUGTs,Chr3上的FsUGTs数量最少,只有2个。顺式作用元件分析表明FsUGTs的表达受光的影响最大,茉莉酸甲酯的影响次之。结构域分析显示82个FsUGTs中都有3个保守的结构域,分别为GT1_Gtf-like、PLN02448和Glycosyltransferase_GTB-type superfamily。组织差异性表达与连翘苷在不同组织中含量分布的相关性分析结果显示FsUGT1~FsUGT3、FsUGT9、FsUGT12~FsUGT16、FsUGT18、FsUGT21~FsUGT24和FsUGT31可能在连翘脂素糖基化形成连翘苷的过程中发挥作用。结论 鉴定并分析了连翘FsUGTs家族的理化性质、顺式作用元件、蛋白结构、系统发育关系、组织差异性表达等,对可能参与到连翘脂素糖基化形成连翘苷的FsUGTs进行了筛选,为后续的深入寻找连翘脂素糖基转移酶及理解连翘药用成分合成的调控机制奠定了基础。
[Key word]
[Abstract]
Objective To screen, identify, and analyze the FsUGTs family from Forsythia suspensa based on transcriptomic data. Methods The FsUGTs gene family of F. suspensa was identified from transcriptomic and genomic data. Bioinformatics techniques were applied for comprehensive analysis to explore the physicochemical properties and subcellular localization of the proteins encoded by these genes. The evolutionary relationships among these gene family members were further revealed through phylogenetic tree construction, chromosomal localization, cis-acting element analysis, conserved motif analysis, domain analysis, and gene structure analysis. In addition, RT-qPCR was used to detect the expression patterns of FsUGTs in different tissues (roots, stems, leaves, and fruits), and correlation analysis was performed by integrating these results with the tissue distribution of phillyrin. Results After screening and identification in the F. suspensa transcriptome data, a total of 82 FsUGTs were finally obtained. Their amino acid lengths ranged from 385 to 558 amino acids (aa), relative molecular masses from 43 140—60 750, and theoretical isoelectric points (pI) from 4.81 to 8.26. These proteins are primarily localized to the plasma membrane and all contain the PSPG box domain. Phylogenetic analysis classified the 82 FsUGTsinto 17 groups: Group L contained the highest number of UGTs (16 members), followed by Groups A and E with ten and nine respectively, while no FsUGTs were distributed in Group Q. Chromosomal localization results showed that 16 FsUGTs were located on Chr10, with seven each on Chr11, Chr12, and Chr13, while Chr3 contained the fewest FsUGTs (only two). Analysis of cis-acting elements indicated that the expression of FsUGTs was most strongly influenced by light, followed by methyl jasmonate. Domain analysis revealed that all 82 FsUGTs contained three conserved domains, namely GT1_Gtf-like, PLN02448, and Glycosyltransferase_GTB-type superfamily. Correlation analysis between tissue-specific expression of FsUGTs and phillyrin content in different tissues indicated that FsUGT1-FsUGT3, FsUGT9, FsUGT12-FsUGT16, FsUGT18, FsUGT21-FsUGT24, and FsUGT31 may play roles in the glycosylation of phillygenin to form phillyrin. Conclusion This study identified and analyzed the physicochemical properties, cis-acting elements, protein structures, phylogenetic relationships, and tissue-specific expression of the FsUGTs family from F. suspensa. It also screened FsUGTs that may be involved in the glycosylation of phillygenin to phillyrin, laying a foundation for the further identification of phillygenin glycosyltransferases and the understanding of the regulatory mechanisms underlying the synthesis of medicinal components in F. suspensa.
[中图分类号]
R282.12
[基金项目]
中央本级重大增减支项目“名贵中药资源可持续利用能力建设项目”(2060302);河南省中央引导地方科技发展资金项目(Z20241471030);河南省科技攻关项目(242102110325);河南省中药材产业科技特派员服务团项目