随着海量文本的涌现,信息超载和数据过剩等问题促使了文本可视化技术的出现.文本可视化技术综合了文本分析、数据挖掘、数据可视化、计算机图形学、人机交互、认知科学等学科的理论和方法,为人们提供了一种理解复杂文本的内容、结构和内在规律等信息的有效手段.文中首先阐述了文本可视化的概念和重要性,然后按照不同可视化对象类型综述了文本可视化的研究现状,并介绍了典型的文本可视化方法与方案;最后,对文本可视化的未来研究方向进行了展望.
With the emergence of massive texts, information overload and data redundancy raise great challenges for information processing. To address these issues, text visualization has been proposed for understanding the content, structure and patterns hidden behind complicated textual information. Text visualization integrates several techniques including text analysis, data mining, data visualization, computer graphics, human computer interaction, cognitive science and so on. In this paper, we first introduce the concepts of text visualization. Afterwards, we present the research achievements according to different visualization objects, and introduce typical visualization methods and schemes. As a conclusion, we give an outlook to future research directions of text visualization.