东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

机器学习方法用于建立乙酰胆碱酯酶抑制剂的分类模型

ISSN号：1000-6818
期刊名称：《物理化学学报》
时间：0
分类：O641[理学—物理化学;理学—化学]
作者机构：[1]四川大学化学学院,成都610064, [2]四川大学化学工程学院,成都610065, [3]Department of Pharmacy, National University of Singapore, Singapore 117543
相关基金：国家自然科学基金（20973118）资助项目

关键词：乙酰胆碱酯酶抑制剂, 机器学习方法, 变量筛选, 应用域, Acetylcholinesterase inhibitor, Machine learning method, Feature selection, Applicability domain

中文摘要：

我们构建了表征乙酰胆碱酯酶抑制剂分子组成、电荷、拓扑、几何结构及物理化学性质等特征的1559个描述符,通过Fischer Score排序过滤和Monte Carlo模拟退火法相结合进行变量筛选得到37个描述符,然后分别用支持向量学习机（SVM）、人工神经网络（ANN）和k-近邻（k-NN）等机器学习方法建立了乙酰胆碱酯酶抑制剂的分类预测模型.对于训练集的515个样本,通过五重交叉验证,各机器学习方法对正样本,负样本和总样本的平均预测精度分别为87.3%-92.7%,67.0%-81.0%和79.4%-88.2%;通过y-scrambling方法验证SVM模型是否偶然相关,结果正样本,负样本和总样本的平均预测精度分别为72.7%-82.5%,41.0%-53.0%和62.1%-69.1%,明显低于实际所建模型的预测精度,表明所建模型不存在偶然相关;对172个没有参与建模的外部独立测试样本,各机器学习方法对正样本,负样本和总样本的预测精度分别为93.3%-100.0%,74.6%-89.6%和86.1%-95.9%.所建模型中,SVM模型预测精度最好,且明显高于其它文献报道结果.

英文摘要：

A total of 1559 molecular descriptors including constitutional, charge distribution, topological, geometrical, and physicochemical descriptors were calculated to encode acetylcholinesterase inhibitors. The 37 molecular descriptors were selected using a hybrid filter/wrapper approach by combining a Fischer Score and Monte Carlo simulated annealing. Classification models for the acetylcholinesterase inhibitors were then built based on support vector machine （SVM）, artificial neural networks （ANN）, and k nearest neighbor （k NN） methods. For the 515 samples in the training set, we obtained average prediction accuracies of 87.3%-92.7%, 67.0%-81.0%, and 79.4%-88.2% for the positive, the negative, and the total samples, respectively, by 5 fold cross validation. Average prediction accuracies of 72.7%-82.5%, 41.0%-53.0%, and 62.1%-69.1% were obtained for the positive, the negative, and the total samples, respectively, by the y scrambling method, indicating that there was no chance correlation in our models. An external test was conducted on 172 samples that were not used for model building and we obtained prediction accuracies of 93.3%-100.0%, 74.6%-89.6%, and 86.1%-95.9% for the positive, the negative, and the total samples, respectively. The prediction accuracies obtained by all the machine learning methods especially by the SVM method were far better than previously reported results.

同期刊论文项目

碳氢氧化合物和自由基热动力学理参数的理论计算

期刊论文 19

同项目期刊论文

Prediction of the acute toxicity of chemical compounds to the fathead minnow by machine learning app

烷基自由基β位裂解反应类反应势垒与速率常数的精确计算

Classification Models for Acetylcholinesterase Inhibitors Based on Machine Learning Methods

Theoretical Investigations on Removal Reactions of Ethenol by H Atom

Reaction Class Isodesmic Reaction Method and Calculation of Thermokinetic Parameters for Reactions i

Accurate Prediction of Enthalpies of Formation for a Large Set of Organic Compounds

Prediction of HIV-1 Protease Inhibitors Using Machine Learning Approaches

Update of PROFEAT: a web server for computing structural and physicochemical features of proteins an

烷基自由基β位裂解反应类反应势垒与速率常数的精确计算

Identification of DNA adduct formation of small molecules by molecular descriptors and machine learn

Prediction of human major histocompatibility complex class II binding peptides by continuous kernel

In silico identification of human pregnane X receptor activators from molecular descriptors by machi

乙烯氧化动力学机理简化

期刊信息

《物理化学学报》
中国科技核心期刊

主管单位:中国科学技术协会
主办单位:北京大学化学与分子工程学院承办
主编：刘忠范
地址：北京大学化学楼
邮编：100871
邮箱：whxb@pku.edu.cn
电话：010-62751724

国际标准刊号：ISSN：1000-6818
国内统一刊号：ISSN：11-1892/O6
邮发代号:82-163

获奖情况:
中文核心期刊

国内外数据库收录:
俄罗斯文摘杂志,美国化学文摘（网络版）,荷兰文摘与引文数据库,美国科学引文索引（扩展库）,英国科学文摘数据库,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,英国英国皇家化学学会文摘,中国北大核心期刊（2000版）

被引量:24781