将动态异构的Web信息资源进行抽取以统一的方式供用户查询和使用,是当前迫切需要解决的问题。介绍了分析相关Web页面的方法和经验,实现了自动提交HTML表单获得所需页面和对页面的信息抽取。最后,实验证明了此方法的有效性。
It was an open problem crying for being solved to integrate dynamic and heterogeneous websites for users to query in a uniform way. This paper presented a method of analyzing relevant websites, which implemented the automatic submission of HTML forms to get required websites and the information extraction of websites. The experiment performance demonstrates the efficiency and effectiveness of the method.