抽取一个句子的核心依存图是对句子进行语义理解的有效途径。在CFN自动标注的基础上,只能得到框架依存图,为了把框架依存图转换成框架核心依存图需要提取每个框架元素的语义核心词。该文提出了基于多词块标注的框架元素语义核心词识别和提取方法,通过对比分析,给出了多词块和框架元素的融合策略,并建立了在多词块标注基础上提取框架元素语义核心词的规则集。在6771个框架元素上的实验结果显示,采用该文的方法和规则集提取框架元素核心词的平均准确率和覆盖率分别为95.58%和82.91%。
It is an effective way to understand the semantic information of a sentence by extracting the frame kernel dependency graph from the sentence. It is necessary to extract semantic core words for each frame element to further establish the frame kernel dependency graph since we can only extract the frame dependency graph from a sentence based on the automatic annotation of CFN, This paper proposes a method to identify and extract the core words of frame elements by multi-word chunk. On the basis of comparative analyzing results, we propose the strategy of in- tegrating the multi-word chunk and frame element and the rules to extract the core words of frame elements from the multi-word chunk labeling. The experimental resutts from 6 771 frame elements show that the average precision and average coverage are 95.58% and 82.91%, respectively.