提出一种基于CSS选择器的深网结果页数据抽取方法,用于抽取深网结果页中的数据记录.实验结果表明在大多数情况下,该方法都能准确抽取出页面中的数据记录.
We propose a methodology sult pages. Experimental evaluation based on CSS selector to extract data records from deep web reon a large number of Web page collections indicates that our methodology correctly extracts data records in most cases