网络媒体庞大的信息量及信息内容的各异,并不能把传媒聚类算法中适用于文本信息聚类的基本划分方法直接应用于互联网媒体信息热点生动发现的研究工作中。鉴于此,文中将基于密度的聚类思想引入CFK—Means算法,创造性地提出了全新的DCFK聚类算法。与此同时,文中基于DCFK算法构造大规模中文信息聚类模型,并且通过系列实验验证本聚类模型在互联网媒体信息主动热点发现领域的有效性和实用性。
bundant information and difference of Internet media can't directly apply the basic divided method,which suit the text information duster in media cluster algorithm, to actively discover the hotspots of Interact media information. So that, propose the the new cluster algorithm called DCFK based on CFK - Means algorithm. It is necessary to construct abundant Chinese information cluster model to discover the Intemet information hotspots. Finally, series of experiments will prove the effectiveness and utility of them.