传统数据挖掘算法在处理多关系时,需要物理连接,因此存在效率不高的问题.为了解决这一问题,研究多关系数据挖掘中的分类问题,提出一种有效的多关系贝叶斯分类算法EMBC.EMBC算法的目标是提高分类的准确率,并且降低运行时间.EMBC算法利用元组ID传播的思想,结合朴素贝叶斯分类算法,可以直接在多关系上进行分类.实验表明,EMBC算法提高分类的准确率,并且显著降低运行时间.
While dealing with multi-relation, traditional data mmmg algorithms used physical join, rnus 1t nau me problem of low efficiency. In order to solve this problem, the problem of classification in multi-relational data mining was investigated, and an efficient multi-relational Bayesian classification algorithm called EMBC was propose& EMBC aims at increasing the accuracy of classification, and decreasing running time. By taking advantage of tuple ID propagation approach, and combined with naive Bayesian classification algorithm, EMBC can directly classify in multi-relation. Performance results demonstrate that, EMBC increases the accuracy of classification, and significantly decreases running time.