地质图书馆书籍多,数据资料庞大,然而却存在数据资料增长过快和难以发现读者兴趣点的问题。实现高效的图书馆借阅数据挖掘分析与推荐,是提高效率的重要手段。为此本文提出了基于大数据地质文献分析挖掘平台,包括聚类分析,中文分词,推荐系统,关联分析功能,再通过Hadoop集群多节点进行推荐,从而提高了工作的效率。
Geological library has a large number of books and data are huge.It is difficult to solve that data grows too fast and it is difficult to find the reader's point.To achieve efficient library borrowing data mining analysis and recommendation,is an important means to improve efficiency.For this reason,this paper puts forward a large-scale data mining platform,including clustering analysis,Chinese word segmentation,recommendation system,correlation analysis function,and then through hadoop cluster multi-node recommendation,thus improving the efficiency of the work.