篇章作为词和句子之后的一种文本分析粒度在自然语言理解和自然语言生成中起到至关重要的作用。该文从计算语言学角度出发,对中英文篇章分析技术的研究现状进行了综述。介绍了中英文篇章分析技术在自然语言处理中的应用,并分别从篇章理论、篇章语料库及评测、篇章分析器的自动构建等方面详细阐述了中英文篇章分析技术。最后归纳出篇章分析技术后续研究的几个方向。
Discourse, a kind of text analysis granularity beyond word and sentence, plays a crucial role in natural lan- guage understanding and generation. This paper surveys the state of the art researches in Chinese and English dis- course analysis under the perspective of computational linguistics, including the applications of Chinese and English discourse analysis, the process of constructing a fui1 Chinese and English discourse parser according to different dis course theories, discourse corpus and evaluation, as well as algorithms and detailed implementation. Also, this pa- per outlines several directions /or further researches on discourse analysis.