虎尾
轮转
录组测序
生物
信息学
分析
郑晓艳
分子植物育种,2023 年,第 21 卷,第 14 期,第 4618-4626 页Molecular Plant Breeding,2023,Vol.21,No.14,4618-4626研究报告Research Report虎尾轮转录组测序及生物信息学分析郑晓艳1陈育青1黄建军1郭月丽2*1 漳州卫生职业学院药学院,漳州,363000;2 漳州卫生职业学院医学技术学院,漳州,363000*通信作者,摘要虎尾轮(Uraria crinita(L.)Desv.ex DC)为闽南民间常用保肝护肝中草药,是一种极有使用、研究、开发价值的天然药用植物资源,但对其分子机制研究严重滞后,在一定程度上限制了该民间资源的利用和开发。本实验通过 Illumina Hiseq 对虎尾轮的根和茎转录组测序,采用 Trinity 软件完成虎尾轮高质量数据转录本拼装,而后进行生物信息学分析。研究结果表明:总共获得的高质量 unigene 有 255 407 896 个,N50 的长度是 992 bp,其平均长度是 559.07 bp;将虎尾轮的 122 207 条 unigene 与 NR、GO、KEGG 等 7 个重要的数据库系统进行比对,获得了虎尾轮基因功能的注释信息,结果表明,有 75 530 条 unigene 至少注释到 1 个数据库,占 61.8%;虎尾轮的根对茎的差异表达 unigene 有 19 714 个,其中有 9 038 个上调表达,10 676 个下调表达。在对虎尾轮根与茎差异基因分析中,利用 KEGG 数据库代谢途径对 132 个显著性 KO 途径分析,发现共挖掘出 114 个 unigene 与虎尾轮黄酮类生物合成途径密切相关。提取虎尾轮转录组中与黄酮类生物合成密切相关基因的 TPM 值,60.71%的基因在根中的 TPM 值高于茎中的 TPM 植,根中的表达远高于茎的表达,符合民间习用其根的用法。以上结果为今后进一步挖掘虎尾轮药用活性成分的功能基因、生物合成调控、品质改良等提供参考依据。关键词虎尾轮(Uraria crinita(L.)Desv.ex DC);转录组测序;差异表达;黄酮;生物信息学Sequencing and Bioinformatics Analysis of Uraria crinita(L.)Desv.ex DCZheng Xiaoyan1Chen Yuqing1Huang Jianjun1Guo Yueli2*1 College of Pharmacy,Zhangzhou Health Vocational College,Zhangzhou,363000;2 College of Medical Technology,Zhangzhou Health VocationalCollege,Zhangzhou,363000*Corresponding author,DOI:10.13271/j.mpb.021.004618AbstractUraria crinita(L.)Desv.ex DC is a commonly used Chinese herbal medicine for protecting the liverand protecting the liver in southern Fujian.It is a natural medicinal plant resource with great use,research anddevelopment value.However,the research on its molecular mechanism is seriously lagging behind,which limitsthe utilization and development of this folk resource to a certain extent.In this experiment,Illumina Hiseq was usedto sequence the root and stem transcripts of Uraria crinita(L.)Desv.ex DC,complete the assembly of high-qualitydata transcripts of Uraria crinita(L.)Desv.ex DC with Trinity software,and then bioinformatics analysis wascarried out.The research results showed that 255 407 896 high-quality unigenes were obtained,with an N50 lengthof 992 bp and an average length of 559.07 bp.122 207 unigenes of Uraria crinita(L.)Desv.ex DC were comparedwith NR,GO,KEGG,etc,and their function annotation information was obtained.The results showed that 75 530unigenes were annotated to at least one database,accounting for 61.8%.There were 19 714 root to stem differentialexpressions of unigene in Uraria crinita(L.)Desv.ex DC,of which 9 038 were up-regulated and 10 676 were基金项目:本研究由国家中医药管理局全国中药特色技术传承人才培训项目(国中医药办人教函 2019 43 号)资助引用格式:Zheng X.Y.,Chen Y.Q.,Huang J.J.,and Guo Y.L.,2023,Sequencing and bioinformatics analysis of Uraria crinita(L.)Desv.ex DC,Fenzi Zhiwu Yuzhong(Molecular Plant Breeding),21(14):4618-4626.(郑晓艳,陈育青,黄建军,郭月丽,2023,虎尾轮转录组测序及生物信息学分析,分子植物育种,21(14):4618-4626.)down-regulated.In the analysis of different genes in the root and stem of Uraria crinita(L.)Desv.ex DC,throughthe analysis of metabolic pathway in KEGG database and the further analysis of 132 significant KO pathways,itwas found that a total of 114 unigenes were closely related to the flavonoid biosynthesis pathway of Uraria crinita(L.)Desv.ex DC.The TPM value of the genes closely related to flavonoids biosynthesis in the transcription groupof Uraria crinita(L.)Desv.ex DC was extracted.The TPM value of 60.71%of the genes in the root was higher thanthat in the stem,and the expression in the root was much higher than that in the stem,which was consistent withthe folk usage of its root.The above results provide a reference for further exploring the functional genes,biosyn-thesis regulation and quality improvement of the active ingredients of Uraria crinita(L.)Desv.ex DC in the future.KeywordsUraria crinita(L.)Desv.ex DC;Transcriptome sequencing;Differential expression;Flavone;Bioin-formatics虎尾轮民间常用其根,是豆科(Leguminosae sp.)兔尾草属(Uraria picta(Jacq.)Desv.)植物:猫尾射(Uraria crinita(L.)Desv.ex DC)的根,该植物普遍生长在中国云南、福建、两广等地。性味温、甘,入肾、肺、胃经,在民间自古被广为使用,强壮腰骨、补肾、祛湿化痰、理气的效果显著。泉州本草 载有“益肾滋肝,治肾虚遗精”(李南,2021)。虎尾轮药效主要化学成分包括:黄酮、多糖、萜类、香豆素等(陈素慧等,2011)。研究人员证实,它有抑制小鼠的肾阳虚、预防其生殖器官衰退、补充阳气抵御疲劳、提高免疫功能、增强实验鼠应激能力的功效(王燕燕等,2011)。高通量测序技术也被称为下一代测序技术(next-generation sequencing technology,NGS),是一种可以一次性测序数十万到数百万个核酸分子的技术(许云霄,2019),与传统的第一代测序技术桑格测序(Sanger sequencing)不同,该技术通过随机片段化DNA,并添加接头,制备测序文库,通过延伸文库中数万个克隆检测相应信号,最终获得序列信息(李炳堂等,2019),为研究物种的生物生化信息提供技术手段。其近年来的发展极大推动了药用植物转录组学研究,成为挖掘药用植物的功能基因、探索药用植物有效成分的生物合成途径与调控机制、阐明基因功能的重要手段,诸多中药如人参、丹参、石斛等的转录组相继得到解析。目前对虎尾轮的实验研究主要集中于有效成分提取分离、含量测定、动物药理实验等方面,虎尾轮的药用价值虽已被民间普遍使用验证,其待开发利用的经济价值亦很高,但其重要基因信息和分子机制研究仍然相当匮乏。本实验选取虎尾轮的根和茎为研究对象,进行 Illumina Hiseq 转录组测序,而后利用所得数据采用生物信息学方法分析其转录组的特征,并探究虎尾轮黄酮关键的基因表达特性,为虎尾轮药用功能活性组分的基因表达、生物合成调控、改良品种品质等提供理论参考(邹毅辉等,2018)。1结果与分析1.1虎尾轮测序数据质量评估及质控和de novo组装对虎尾轮根和茎样本进行转录组测序,每个样本重复 3 次,共生成 255 407 896 个高质量 Reads,Q20碱基比为 98.38%98.69%,Q30 碱基比为 94.59%95.47%,GC 碱基含量为 46.81%50.79%(表 1)。该数据表明 Illumina HiseqTM对虎尾轮的转录组测序所得到的数据质量高,可进行后续组装处理。1.2虎尾轮转录本拼装因为虎尾轮无参考基因组,选择使用 Trinity 软件将高质量数据进行 de novo 组装,共得到 190 686 条Transcripts,122 207 条 unigene,其中 N50 分别为1 498 bp 和 992 bp,组装完整性较好。长度分布在200300 nt 的 unigene 最多,有 69 888 条,占 unigene总