为进一步改善关键词提取的效果,提出一种基于词序统计组合的关键词提取方法。通过词序统计、词性标注、停用词过滤、词语组合等步骤,实现短语或组合词的生成和候选关键词的过滤;通过其它特征项的引入,进一步提高最终提取关键词的准确度。实验结果表明,该方法对中文文本的关键词提取具有良好的效果。
To improve the effect of the keyword extraction ,a method based on the combination of the word order was proposed . Through steps including the statistic of word order ,the POS tagging , the filtering of the stop words , words combination ,the phrase or the combination of the word was constructed ,and the candidate of keyword was filtered .On the other hand ,the accu‐racy of the final keyword extraction was improved greatly by the introduction of the other features .The experimental results show that the method has a great contribution to the Chinese text keyword extraction .