位置:成果数据库 > 期刊 > 期刊详情页
A Feature Selection Method for Prediction Essential Protein
  • ISSN号:1008-5599
  • 期刊名称:《电信工程技术与标准化》
  • 时间:0
  • 分类:TP391.41[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术] Q51[生物学—生物化学]
  • 作者机构:the School of Information Science and Engineering,Central South University, the College of Polytechnic,Hunan Normal University, the Computer Center,Kunming University of Science and Technology
  • 相关基金:supported by the National Natural Science Foundation of China(Nos.61232001,61502166,61502214,61379108,and 61370024); Scientific Research Fund of Hunan Provincial Education Department(Nos.15CY007 and 10A076)
中文摘要:

Essential proteins are vital to the survival of a cell. There are various features related to the essentiality of proteins, such as biological and topological features. Many computational methods have been developed to identify essential proteins by using these features. However, it is still a big challenge to design an effective method that is able to select suitable features and integrate them to predict essential proteins. In this work, we first collect 26 features, and use SVM-RFE to select some of them to create a feature space for predicting essential proteins, and then remove the features that share the biological meaning with other features in the feature space according to their Pearson Correlation Coefficients(PCC). The experiments are carried out on S. cerevisiae data. Six features are determined as the best subset of features. To assess the prediction performance of our method, we further compare it with some machine learning methods, such as SVM, Naive Bayes, Bayes Network, and NBTree when inputting the different number of features. The results show that those methods using the 6 features outperform that using other features, which confirms the effectiveness of our feature selection method for essential protein prediction.

英文摘要:

Essential proteins are vital to the survival of a cell. There are various features related to the essentiality of proteins, such as biological and topological features. Many computational methods have been developed to identify essential proteins by using these features. However, it is still a big challenge to design an effective method that is able to select suitable features and integrate them to predict essential proteins. In this work, we first collect 26 features, and use SVM-RFE to select some of them to create a feature space for predicting essential proteins, and then remove the features that share the biological meaning with other features in the feature space according to their Pearson Correlation Coefficients(PCC). The experiments are carried out on S. cerevisiae data. Six features are determined as the best subset of features. To assess the prediction performance of our method, we further compare it with some machine learning methods, such as SVM, Naive Bayes, Bayes Network, and NBTree when inputting the different number of features. The results show that those methods using the 6 features outperform that using other features, which confirms the effectiveness of our feature selection method for essential protein prediction.

同期刊论文项目
同项目期刊论文
期刊信息
  • 《电信工程技术与标准化》
  • 主管单位:中国移动通信集团公司
  • 主办单位:中国移动通信集团设计院有限公司
  • 主编:梅海涛
  • 地址:北京市海淀区丹棱街甲16号302室
  • 邮编:100080
  • 邮箱:tetas@cmdi.chinamobile.com
  • 电话:010-52696688-7308/7310
  • 国际标准刊号:ISSN:1008-5599
  • 国内统一刊号:ISSN:11-4017/TN
  • 邮发代号:82-942
  • 获奖情况:
  • 国内外数据库收录:
  • 被引量:4245