自动文摘是利用计算机自动地从文本或文本集合中提炼出能准确、全面地反映文本主要内容的精简、连贯的短文,以满足一般性的或特殊性的用户需求。首先对自动文摘的定义、作用和分类进行概述,然后给出一种基于关键词检索的自动文摘技术,接着提出基于自动文摘的论文抄袭检测方法,并对实验结果进行分析,最后总结全文并对后续工作加以简单介绍。
Automatic abstraction can automatically extract the brief and coherent essays reflecting the main contents of the text completely and accurately from the text or text collection, using the computer to meet the general or particular users' requirements. First, this paper refers to the definition, function and classification of automatic abstraction, and then gives a kind of automatic abstraction technology based on keywords retrieval. It also puts forward a method of detecting plagiarism in the theses based on automatic abstraction and analyzes the results of the experiment. Finally, the author introduces the further work in brief.