针对中文产品命名实体,提出了一种基于多种特征融合的识别方法.以词为标注粒度,将多种特征融合到条件随机场模型中,采用递增式学习策略选取最优的特征模板,实现了从中文自由文本中识别产品命名实体.实验表明,该方法获得了令人满意的实验效果,准确率、召回率和F值分别达到94.87%、92.50%和93.67%.
A approach for Chinese product named entity recognition based on fusion of multiple features is pres- ented. The approach fuses multiple features to conditional random fields model, using Chinese word as processing unit, which adopts incremental learning strategy to optimize feature templates and recognizes product named entity from Chinese free text. The experimental results show that the approach performs quite well, the precision, recall and F value can reach 96. 87%, 94. 50% and 95. 67% respectively.