目前绝大多数频繁项目集(或最大频繁项目集)挖掘算法并没有考虑相关领域知识,其结果会产生许多无关的模式.因此,发现约束频繁(或约束最大频繁)项目集是多种数据挖掘应用中的关键问题,然而,这方面的研究工作却很少.为此该文提出了一种快速的基于频繁模式树(FP-tree:一种扩展前缀树结构)的约束最大频繁项目集挖掘及其更新算法.实验结果表明该算法是快速有效的。
Most algorithms of frequent itemsets (or maximum frequent itemsets) do not consider any domain knowledge. As a result they generate many irrelevant patterns. Therefore, finding constrained maximum frequent itemsets is a key in important data mining application such as discovery of constrained association rules, constrained strong rules, etc. Little work has been done on this problem. This paper presents an effective algorithm for mining constrained maximum frequent itemsets and its update, update constrained maximum frequent itemsets algorithm, based on a novel frequent pattern tree (FP-tree) structure that is an extended prefix-tree structure for storing compressed and crucial information about frequent patterns. Experiments show that the algorithm is effective.