网站提供的站内查询系统存在着系统不能自动采集网络信息,以及模糊查询结果准确率低等缺点.因此,应用搜索引擎的工作原理,设计了一种基于Lucene的站内搜索引擎系统,实现了站内信息的增量采集、自动分词和建立倒排索引功能.该系统的建立,提高了用户站内搜索的准确率和查全率,使站内信息资源能得到充分利用.
The service of intranet query is provided on web site.However,it still has some disadvantages.First,the service system can t collect information automatically.Second,the accuracy of fuzzy query response is very low.With a view to surmounting these disadvantages,on the working principle of search engine,the intranet search engine based on Lucene is designed.The increment collection of information,automatic segmentation of Chinese words and creation function of inverted index have all become available.Consequ...