企业间竞争互动的高强度与高速度,突显出竞争情报的时效性,动态竞争情报是企业在复杂多变的环境下取得成功的关键。Web资源可分为surface Web和Deep Web。Surface Web由静态网页构成,Deep Web信息资源由动态页面、商业数据库、实时数据和企业内部数据库组成,具有质量高、实时性强、易于深度分析的特点,是企业动态竞争情报的重要来源,但常规网络信息采集工具不能直接获得这些信息。针对动态竞争情报采集中存在的信息源选择、信息抽取、信息分析中存在的障碍,提出面向Deep Web的动态竞争情报智能采集策略,详细探讨了动态数据源的智能选择、查询结果的智能抽取、智能化的数据集成和智能分析策略。
The high-speed and high-intensity characteristic of competitive interaction between enterprises highlight the requirement of timeliness for competitive intelligence. Dnamic competitive intelligence becomes a key factor for the success of an enterprise in the complex and varying enviroment. The Web can be divided into Surface Web and Deep Web. The former is built by static web pages. The latter is composed by dynamic web pages, commercial database, realtime data and internal databases in enterprises. Information resources on the Deep Web have features of high quality and real-time and easy-to-analyze. They are the important resources of dynamic competitive intelligence and can not be accessed by ordinary tools. Deep Web oriented acqusition strategy for dynamic competitive intelligence is suggested according to the obstacles in the cycle of intelligence acqusition, including problems in resource selection, information extraction, information analysis. The paper discussed stragies in detail about intelligent selection of danvmie data resource and smart extraction of result researched and intelligent data intergration and analysis.