基于蛋白质可折叠性和热稳定性具有长程相关性(Hp耦联性)的事实,使用改进的统计耦联方法分析了SH3结构域序列集中存在的祸联性信息.结果表明统计耦联方法采用的保守性统计能量可以较好地评估SH3结构域序列集的位点保守性,具有高平均耦联能量的位点可以基本上对应一些结构或功能上具有重要意义的位点.对统计耦联数据的一些位点扰动个案分析揭示出SH3结构中包含非紧邻扰动和紧邻扰动模式.统计耦联方法结合聚类重排可以对SH3折叠型序列群体中结构核心与非结构核心的位点进行区分,甚至可以区分其中几个功能相关位点的细节差异.SH3结构域中包含了一系列的扰动模式,不同的扰动模式涉及不同的位点组合:各种扰动模式通过一些共有的扰动位点和扰动响应位点相互影响,并最终决定结构中各位点的耦联响应模式.这些耦联信息对于理解蛋白质的序列与结构和功能的关系,以及设计新的蛋白序列有潜在价值.
Given the long range correlation characteristics of information about protein foldability and thermostability the multiple sequence alignment of a SH3 domain was analyzed using the modified statistical coupling analysis (SCA) method. Results show that the statistically conserved energy from the SCA method could be used to evaluate the site conservation of the SH3 sequence set properly. Sites with a high average coupling energy correspond to structurally and functionally important positions. Perturbing analysis on several sites revealed local and nonlocal perturbing modes in the SH3 domain. By combining the SCA and the clustering reorder method the structural core and the non-structural core sites of the SH3 domain, and detailed differences between several functional sites could be distinguished. Different perturbing modes that involve different sites exist in the SH3 domain. By sharing the common perturbing sites and the responding sites, different perturbing modes can interact. The coupling responding mode of all the sites in the structure was thus determined. Coupling information about the SH3 domain can improve our understanding about the relationship between the protein sequence and its structure as well as its function. It is also valuable in new protein design.