学术论文的全文数据越来越容易获取使大规模的引文内容分析成为可能。文章通过设计引文内容标注框架,开发引文内容标注系统,分别从引用对象、引文功能、引用情感、引文位置、引文重要性、标注自信度等方面进行标注。构建用于引文内容分析的标准化数据集并进行统计分析,可为引文内容的特征分析等基础性研究及学术预测等应用性研究提供数据支撑。
As the structured data of academic literature becomes more and more accessible, it is likely toanalyze large-scale citation contentautomatically. In this paper, the framework of citation content annotation is constructed and a citation content annotation system is developed. Annotation is carried outon the objects, the functions, the sentiment, the location andthe importance of citations, and the degree of confidence. A standardized data set for citation analysis is then constructed and the statistical analysis is done, which provide data support for the basic research and applied research on citation content.