研究了蛋白质分子的结构,从RCSB公共数据库中收集蛋白质PDB数据文件,利用统计分析和数据挖掘知识,建立了以蛋白质形心为原点的蛋白质原子空间坐标系,从蛋白质的数字特征入手,讨论了五类蛋白质(肌蛋白、血蛋白、激素、抗体、生物膜)的结构特性及数字特征的分布,其中激素分子相对其他几类蛋白质较小,原子的分布也相对集中;并讨论了20种残基的结构特性,构造出蛋白质数字特征能量函数,其结论有助于蛋白质生物功能开发和蛋白质设计研究.
The article mainly deals with the structure of protein molecules.With the PDB files collected from the RCSB public database and the knowledge of statistical analysis and data mining.We build a spatial coordinate system with the geometrical center of a specific protein molecule being the origin.After discussing the features of five kinds of protein(muscle protein,blood protein,hormones,antibodies,biomembrane),we study the structural characteristics as well as the distribution of the features respectively.Consequently,in terms of our analytical system,hormone molecules are relatively smaller and the distributions of their atoms are more concentrated compared with others.Meanwhile,the detailed discussion on the structural characteristics of 20 kinds of the amino acid residues is conducted.Furthermore,we develop energy function based on the features of these residues,which will contribute to the development of protein biological function as well as the design research.