收集了155种有机化学品厌氧生物降解数据,以随机抽取的109种物质作为训练集,另外46种物质作为验证集,通过结构式拆分得到各基团,分别采用多元线性回归和BP人工神经网络2种算法对有机化合物结构与生物降解性定量关系(QSBR)进行研究。结果表明,多元线性回归模型验证集正确率为78.26%,总正确率为84.52%;BP人工神经网络模型验证集正确率为82.61%,总正确率为90.32%。可见,BP人工神经网络算法相对优于多元线性回归算法。
Anaerobic biodegradation data of 155 kinds of organic chemicals were collected. Among them the data of 109 chemicals were picked up randomly for use in the training set, and of the remaining 46 in the validation set. By splitting constitutional formula, radicals or groups of the chemicals were obtained and analyzed with muhiple linear regression and BP artificial neural network, separately, to explm'e for quantitative relationships between structure and biodegradability of organic chemicals (QSBR). Results show that the use of MLR in analyzing the validation set was 78.26% in accuracy and 84. 52% in total accuracy and for the use of BP-ANN, 82.61% in accuracy and 90. 32% in total accuracy. Obviously the latter is superior to the former.