东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

微博客蕴含交通信息的提取

ISSN号：1006-8961
期刊名称：中国图象图形学报
时间：2013.1.1
页码：123-129
分类：P208[天文地球—地图制图学与地理信息工程;天文地球—测绘科学与技术]
作者机构：[1]中国科学院地理科学与资源研究所资源与环境信息系统国家重点实验室,北京100101
相关基金：国家高技术研究发展计划（863）基金项目（2012AA12A211）;国家自然科学基金项目（40871184,41101149）.
相关项目：面向个体行为时空过程的城市居民活动表达与分析方法

关键词：微博客, 交通信息, 分词, 模糊聚类, 畅通度, 置信度, micro-blog , traffic information , word segmentation , fuzzy clustering, clear degree , degree of confidence

中文摘要：

微博客消息中可能蕴含大量描述城市道路的交通信息，如交通状况、交通事件、交通管制等，提取这些交通信息能够为传统的固定式传感器和浮动车采集交通信息手段提供有效补充。然而，微博客消息描述的模糊性、差异性及非结构化特征，使得从海量微博客消息中快速准确地提取和甄别交通信息成为难题。提出一种从微博客消息中快速提取和融合交通信息的技术方法，首先对采集到的微博客消息进行分词解析和路网匹配，然后采用基于神经网络的模糊C聚类方法对描述路段交通状态的微博客消息定量化结果进行分析，获取各路段置信度最高的交通状态描述，最后得到各路段的交通畅通度水平。基于新浪微博客和北京路网的实验过程验证了本文技术方法的有效性。

英文摘要：

Micro-blog messages usually contain a great deal of traffic information such as traffic conditions, traffic events and traffic controls, which can be useed as a complement to conventional traffic information collection technologies like fixed sensors and floating cars. However, due to ambiguous narrating, uncertainty, and the unstructured characteristics of micro-blog messages, extracting traffic information from micro-blog messages is rather difficult. In this paper, we propose an approach for extracting traffic information from a large amount of micro-blog messages. First, we build a traffic informa- tion table by semantically extracting traffic related words from micro-blog messages and matching each word onto the corre-sponding road segment of the road networks. Then, according to the traffic information table, we evaluate the highest confidence level of traffic condition for each road segment by using a neural network based Fuzzy-C-Means （ FCM ） clustering method, to obtain the most confident road conditions. Experiments on Beijing road networks with a large number of Sina mi- cro-blog messages verify the effectiveness of the presented approach.

同期刊论文项目