目的 利用链特异性转录组测序(RNA-Seq)技术,通过生物信息学手段,筛选并鉴定喉鳞状细胞癌(鳞癌)患者转录组中新的长链非编码RNA并对其进行差异表达分析,为进一步阐明喉鳞癌发病的分子机制提供基础.方法 选取10例喉鳞癌患者的肿瘤组织并提取RNA,构建链特异性转录组文库,利用高通量测序平台进行测序.滤除低质量数据后进行比对组装,对得到的转录本进行分类和注释,采用优化的长链非编码RNA(long non-coding RNA,LncRNA)识别流程鉴定喉鳞癌转录中新的LncRNA,并对其特征进行分析.结果 建立了一个优化的流程,从10个喉鳞癌转录组中,共发现了134条目前LncRNA数据库中未收录的新的LncRNA,并分析其长度分布、开放阅读框(open reading frame,ORF)长度、表达差异特征.结论 通过高通量测序手段从喉鳞癌转录组中预测得到新的及其差异表达的LncRNA,可能为喉鳞癌LncRNA的生物信息学分析建立一个较好的流程.
Objective To screen and identify the new long non-coding RNAs from transcriptome of laryngeal squamous cell cancer using strand-specific RNA-Seq technology and bioinformatics tools,and to analyze the difference expression of these LncRNAs.Methods RNA was extracted from laryngeal squamous cell cancer tissues of 10 patients and the strand-specific libraries were constructed for high-throughput sequencing.The low-quality data were filtered and the high quality sequencing reads were mapped to the reference genome and assembled.The obtained transcripts were classified and annotated,the optimized LncRNA identification pipeline was used to discover novel LncRNA in these transcriptome,and the characteristics of LncRNA were analyzed.Results A more optimized pipeline were established and 134 new LncRNA transcripts were found,which was not included in the public database.The new LncRNA transcripts had some characteristics in length distribution,ORF length,and expression.Conclusion Some new LncRNA from the transcriptome of laryngeal carcinoma were identified,with different expression,and they may play an important role in laryngeal squamous cell cancer.