文章提出一种基于图模型的关键词挖掘方法,应用K最邻近耦合图构造文档的图模型,将文档映射为一个语义结构图,然后结合聚类系数变化量,平均路径变化量,TF-IDF以及区域位置因子来衡量词语节点的重要性,根据重要性得分选择候选关键词集,最后根据短语合并规则形成最终的关键词,实验结果表明,该方法相比于TF—IDF和小世界特征方法性能有所提高。
In this paper, a new algorithm is proposed for keywords mining by graph model.Using k-nearest neighbor couple graph model,a document is mapped to a semantic structure graph,then combined with four variables,clustering coefficient increment,average path length increment,TF-IDF and regional location factor, to measure the importance of word nodes, according to the importance score selecte the candidate words set,finally, form the document words by merger rules.Experimental results show that the performance is more efficient than TF-IDF method and Small-World mode method.