根据公共安全网络舆情研究的需求,将中文分词技术应用于突发事件应急管理中,提出基于ICTCLAS分词技术的网络舆情热点信息的自动发现方法。该方法读入新闻文本并进行分词和词频统计,从词频表中去除停用词,合并多单位关键词得到突发事件热点信息关键词列表,对网络信息及时进行检索,为突发事件应急决策提供技术支持。通过1个突发事件的实例验证了该方法的实用性和可靠性。
According to the needs of network public opinion research in public safety, using Chinese word segmentation technology in the field of emergency management, this paper presents a hot spot information auto-detection method of network public opinion based on ICTCLAS, which inputs the text of the news corpus, uses Chinese word segmentation and word frequency statistics, gives the hot spot information list of keywords through the removal of stop words and the merged of multiple keywords. It is verified through two emergency examples that the practicality and reliability of the method.