东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

采用伪氨基酸组成预测水解酶亚家族

ISSN号：1000-5013
期刊名称：华侨大学学报(自然科学版)
时间：0
页码：317-321
语言：中文
分类：Q556.03[生物学—生物化学]
作者机构：[1]华侨大学工业生物技术研究所,福建泉州362021
相关基金：国家自然科学基金资助项目（30770059）; 教育部博士点科研基金资助项目（20070685001）
相关项目：细胞浓度控制元件的合成生物学研究

关键词：水解酶亚家族, 特征值, 伪氨基酸, K-近邻, hydrolase subfamily, feature extraction, pseudo amino acid composition, k-nearest neighbor

中文摘要：

利用伪氨基酸组成提取蛋白序列特征值,考察参数λ和w对识别效果的影响,以k-近邻作为基础分类器,用于预测水解酶的亚家族类型.结果表明,伪氨基酸组成特征提取法与单纯的20个氨基酸组成特征方法相比,其识别精度有较大程度提高.20AA组成的平均预测精度为72.3%,而伪氨基酸组成特征提取的识别效果可达82.7%.在参数影响考察方面,自相关性函数个数的选取对识别效果影响较大,而权重因子w对识别效果影响则很小.

英文摘要：

Predicting the hydrolase subfamily is of great importance for designing a fast and reliable classification system.In this paper,the pseudo amino acid composition method was used to extract the features from protein sequencec,and the k-nearest neighbor algorithm was used as the classifier to predict the hydrolase subfamily.The influences of λ and ω on prediction accuracy were also studied.The results showed that the prediction accuracy of pseudo amino acid composition were much higher（about 10.4%） than that of amino acid composition,the prediction accuracy of amino acid was 72.3%,while the pseudo amino acid was 87.2%.The running parameter of λ had more influence on prediction accuracy when compared with ω.

同期刊论文项目