应用自然语言处理技术和复杂网络技术,可以对中文文学作品中内含的社会网络进行抽取和分析。该文以《三国演义》为例,抽取了其中的社会网络,节点是作品中的人物,边是人物之间的联系,边的权重为各章回中的人物共现次数。借助背景知识和互联网构建了角色库辅助网络建模。对构建出来的社会网络进行分析,包括节点度分布、中心性、聚类特征等。结果表明,中文文学作品中的角色分布具有明显的小世界性、有限幂律分布特征和社区特性,同时也有多面性和多元性。
Through the technology of natural language processing and complex network analysis, the social networks in Chinese literature are extracted and analyzed. From the "Romance of the Three Kingdoms", as an example, this paper extracts the social networks, with nodes as novel characters, edges as the connections between the characters, and weight of the edges as the co-occurrence times the characters. The social networks are then analyzed for the node degree distribution, centrality, clustering characteristics, etc. The results show that the characters in Chinese literature have obvious small-world and limited power-law distribution. Again in "Romance of the Three King- doms", characters distribution have clear community characteristics, as well as versatility and diversity.