位置:成果数据库 > 期刊 > 期刊详情页
面向网页文本的地理要素变化检测
  • ISSN号:1560-8999
  • 期刊名称:《地球信息科学学报》
  • 时间:0
  • 分类:P208[天文地球—地图制图学与地理信息工程;天文地球—测绘科学与技术]
  • 作者机构:[1]英国利兹大学地理学院,利兹LS29JT, [2]南京师范大学虚拟地理环境教育部重点实验室,南京210046, [3]中国国家基础地理信息中心,北京100830, [4]南京邮电大学计算机学院,南京210003
  • 相关基金:国家测绘科技项目“网络地理信息变化检测技术研究”;国家自然科学基金项目(40971231):“863”计划项目(2007AAl22221).
中文摘要:

地理要素变化检测已成为国家地理信息“十二五”规划和全国地理国情普查的重要组成部分。网页文本中蕴含海量的地理要素信息,尤其是新闻、政府、社交平台等网站的网页文本更新频繁,可为地理要素变化检测提供现势性的数据源。本文针对网页文本中地理要素变化的语言描述特点,构建了表达地理要素变化的语义知识库,设计了搜索引擎和通用主题相结合的网页爬虫,实现了相关网页文本的高效获取;采用规则模型和条件随机场模型,分别进行网页文本中地理要素变化信息抽取,包括地理要素名称、位置(地名)、时间和属性等。实验结果显示,本文设计的网页爬虫具有较高的相关网页文本获取能力,地理要素变化信息抽取的准确率能够达到70%以上,但是,语义知识库的完备程度对于信息抽取性能具有较大影响。研究成果表明,以网页文本为数据源的地理要素变化信息获取方法,能提供一种快速检测地理要素变化的新途径,与实地调绘和遥感影像检测等方法结合应用具有较好的优势互补性,可作为有力的辅助手段解决地理要素的持续更新和实时更新问题。

英文摘要:

Geographic features change detection has became a vital component of the national geographical in- formation 12th Five-Year-Plan and the national geographic general survey. In web pages, billions of geographic feature changes were contained, especially in government official websites, news homepages, social portals and etc. The web pages of these websites update frequently, which could provide the latest data for geographic infor-mation change detection. Considering the complex characteristics of the web geographic information description, this paper did some valuable achievements. First of all, the geographic information knowledge base was estab- lished by summarizing the geographic information words and phrases, which could give the great supports to geographic information semantics change detection. Then, the web geographic information was obtained using two kinds of web crawler technologies. Combining the Google Custom Search crawler and general topic crawler, the web geographic information obtainment could be more complete in both scope and depth. Thirdly, the geo- graphic information was parsed and extracted from the web text, which showed users the related features, place names, times and attributes. Last but not least, the prototype system was finally developed and the results were analyzed. The experiments indicated that the accuracy of related web pages obtainment and features change de- tection were over 74% and 70% respectively. In addition, the results of geographic information change detection highly relied on the integrity of knowledge base, which need to be completed further. Moreover, the uncertainty and fuzziness of web geographic information also limited the change detection results. Therefore, the web page based geographic information change detection could be a supplementary method of geographic information change detection. Combining the traditional surveying detection and remote-sensing imagery detection methods, it could solve the problems of continuous updating and timely updating of geog

同期刊论文项目
同项目期刊论文
期刊信息
  • 《地球信息科学学报》
  • 中国科技核心期刊
  • 主管单位:中国科学院
  • 主办单位:中国科学院地理科学与资源研究所 中国地理学会
  • 主编:徐冠华
  • 地址:北京大屯路甲11号
  • 邮编:100101
  • 邮箱:sxfu@lreis.ac.cn
  • 电话:010-64888891
  • 国际标准刊号:ISSN:1560-8999
  • 国内统一刊号:ISSN:11-5809/P
  • 邮发代号:82-919
  • 获奖情况:
  • 国内外数据库收录:
  • 中国中国科技核心期刊,中国北大核心期刊(2008版),中国北大核心期刊(2011版),中国北大核心期刊(2014版)
  • 被引量:3181