【目的】全基因组水平鉴定亚麻纤维素合酶超家族基因Ces A/Csls,并对基因的进化、基因结构及组织表达特性等进行分析,为亚麻纤维发育的机理研究奠定基础。【方法】利用Phytozome基因组数据库,通过生物信息学手段,鉴定亚麻纤维素合酶超基因家族成员,并进行蛋白理化特性分析;利用MEGA 5.0、GSDS、MEME等软件构建系统进化树,并进行基因结构、蛋白保守基序分析;根据RNA-Seq数据对Ces A/Csls进行表达分析。【结果】系统分析鉴定了45个亚麻Ces A/Csls超家族基因,该家族基因在scaffolds上是分散分布的,没有明显的成簇现象。Ces A/Csls蛋白主要分布于质膜上,氨基酸数目为409—1 167,分子量为47 401.1—130 578.3,等电点分布在5.43—9.08。Ces A/Csl蛋白均含有跨膜结构域,数目为2—8。根据系统进化分析将其分成Ces A与Csl两类,细分为Ces A、Csl A、Csl B、Csl C、Csl D、Csl E、Csl G共7组。基因结构分析显示,亚麻Ces A/Csls基因的长度在2.1—6.8 kb,外显子数量在2—14。保守基序分析表明,不同组间Motif组成有一定的差异,Motif 1、Motif 2、Motif 3、Motif 4、Motif 12在Ces A、Csl B、Csl D、Csl E、Csl G组蛋白中均有分布,Motif 18、Motif 20在Csl A、Csl C组蛋白中均有分布,而Motif 13、Motif 14、Motif 15、Motif 19的分布则表现出一定的组间特异性。表达谱分析结果表明,Ces A/Csls家族成员在不同发育阶段表达模式不同,部分Ces A/Csls可被Na Cl、BR和Brz诱导上调或下调表达,预示Ces A/Csls功能的多样性以及在植物发育过程中扮演着不同角色。【结论】鉴定出45个亚麻Ces A/Csls家族基因成员,分属于两类,7组,分布于scaffolds上,基因结构和蛋白基序具有组间多样性和组内保守性。不同的基因在不同发育阶段具有一定的时空特异性。Ces A/Csls中部分基因响应激素BR、Brz及Na Cl胁迫。
【Objective】 The CesA/Csls genes were identified in flax. Then the phylogeny, gene structure and tissue expression pattern were analyzed in order to provide a theoretical basis for studying the mechanism of flax fiber development.【Method】Based on flax genome database and bioinformatics method, CesA/Csls genes were identified and the physico-chemical characteristics were analyzed. The phylogenetic tree was constructed by MEGA 5.0 software. The gene structure and conservative motifs were analyzed by the bioinformatics softwares GSDS and MEME. Finally, the expression of CesA/Csl genes was analyzed by using the RNA-seq data. 【Result】 A total of 45 CesA/Csl genes were systematically identified in flax. The genes appeared to be dispersed within the chromosome and were not clustered. The CesA/Csl proteins were mainly located on the plasma membrane. The number of amino acid of the proteins ranged from 409 to 1 167. The isoelectric point distributed from 5.43 to 9.08. All of the 45 CesA/Csl proteins possessed the transmembrane domains, the number of which was from 2 to 8. The genes were classified into 2 classes(CesA and Csl) and seven groups(CesA, CslA, CslB, CslC, CslD, CslE, CslG) according to the phylogetic relationship. Gene structure prediction indicated that CesA/Csls genes ranged from 2.1 to 6.8 kb in size and most of them consist of 2 to 14 exons.The gene structure was conserved within a group. Obvious differences were observed in motif composition in genes from different groups. Motif 1 to motif 4, motif 12 were observed in most of CesA, Csl B, Csl D, Csl E, and Csl G group proteins,and motif 18, and motif 20 were observed in most of CslA, CslC group proteins. Motif 13, motif 14, motif 15, and motif 19 as the spectific motifs were observed in different groups. Futhermore, some CesA/Csl genes could be upregulated or downregulated by BR, Brz and NaCl stress. The results of digital gene expression profile showed that CesA/Csl were expressed differently at different development stages. It indicate