该文提出了结构关键词的概念,给出了结构概念格和内容概念格的形式化描述.结构概念格是对文档语义段的逻辑存储,内容概念格是对文档内容信息的逻辑存储.开发了一个基于文档的结构和内容构造两级概念格的信息抽取的实验系统.实验表明,该方法对减少信息抽取的时间和提高信息抽取的精度有显著的效果.
The concept of structure keywords is put forward, and the formal descriptions of structure and content concept lattice are introduced in this paper. The structure concept lattice is the logical storage of semantic structure of d octunents, and the content concept lattice is used to store content information of documents. Finally, an experiment system of IE based on two - level concept lattice is developed and the result indicates that the effectiveness of the proposed method is notable for reducing the time of IE and increasing the precision of IE.