对实验确定的168条σ54启动子序列进行保守性分析,获得两个保守的区域-24区域和-12区域,均为最保守的功能元件。选取保守性最大的17个保守位点的三联体频数作为参数,引入伪计数构建位置权重矩阵,对168条σ54启动子进行预测,分别从编码区和汇聚非编码区共选取168条序列组成阴性集。使用Jackknife交叉验证法对模型进行检验,整体准确度达到82.0%,为σ54启动子的理论和实验研究提供新信息。
By analyzing the 168 experimental-confirmed σ54 promoter sequences, two conservative regions that are -24 and-12 regions are obtained. The trimer frequency at 17 positions in these conservative regions is selected as inputting parameter. By adding pseudo-count into position weight matrix, the σ54 promoter can be predicted. The 168 negative sequences are extracted from coding regions and convergent intergenic regions. In Jackknife cross-validation, the overall accuracy reaches to 82.0%, suggesting that the model can be further used in the theoretical and experimental study ofσ54 promoter.