满足条件独立性假设时,朴素贝叶斯分类器理论上比其它分类方法具有更高的分类正确率,但该假设在许多实际情况中并不成立,针对这一问题,提出了一种基于因子分析的朴素贝叶斯分类模型FA-NBC,并将其应用于边坡的稳定性识别。为了保证朴素贝叶斯分类器结构上的简单性,FA.NBC模型以方差贡献为依据构建新的属性集,新属性集包含原属性集的大部分信息且满足条件独立性假设。UCI数据集上的实验结果证明了FA-NBC模型的有效性。
Naive Bayesian classifier (NB) is popular for its simplicity and effectiveness. However, the accuracy of NB is affected when the conditional independence assumption is violated. A new algorithm based on factor analysis, FA-NBC, is proposed to retain the structure strength of NB while reducing error by alleviating the attribute interdependence problem. Then the classifier is applied to slope recognition. New independent attribute set which includes most of the information of the original property set is built based on variance to ensure the structural simplicity of naive Bayesian classifier. Experimental results on UCI data sets prove the validity of the model.