东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

基于概念分组的Web搜索结果聚类算法

ISSN号：1000-565X
期刊名称：华南理工大学学报(自然科学版)
时间：0
页码：130-134
语言：中文
分类：TP391[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：[1]西安电子科技大学计算机学院,陕西西安710071, [2]西安电子科技大学理学院,陕西西安710071
相关基金：国家自然科学基金资助项目（60603098）
相关项目：多核学习研究

关键词：信息检索, 搜索引擎, WEB文档, 聚类, 概念分组, information retrieval , search engine , Web document , clustering , conceptual grouping

中文摘要：

为了便于用户浏览搜索引擎返回的搜索结果，快速有效地定位有价值的Web文档，提出了基于概念分组的Web搜索结果聚类算法．首先，建立特征词同现网络，利用概念分组技术挖掘特征词之间的语义关联，形成主题概念类；然后，计算文档与各概念类之间的距离，据此实现Web搜索结果的聚类；最后，综合考虑特征词在类内和文档集中的重要性进行类别标签的选择．实验结果表明本算法具有较好的聚类性能，明显优于k-均值算法，且产生的类别标签容易理解．

英文摘要：

In order to facilitate the browse of the search results obtained by search engines and to rapidly and effectively find valuable Web documents, this paper proposes a new clustering algorithm of Web search results based on the conceptual grouping. In this algorithm, first, the co-occurrence networks of characteristic terms are built. Next, the semantic relationships among characteristic terms are mined via the conceptual grouping to form different clusters related to the query topic. Then, the distances between the Web documents and the formed clusters are calculated for the clustering of Web search results. Finally, the cluster labels are selected according to the importance of characteristic terms in the search .results and the clusters. It is indicated by experiments that the proposed algorithm performs better than the k-means algorithm, and that the labels selected by the algorithm are apprehensible.

同期刊论文项目

多核学习研究

期刊论文 46 会议论文 14

同项目期刊论文

训练支持向量机的Huber近似算法

Huber approximation method for training the support vector machines

多源性数据 SVM 集成算法研究

求解SEB问题的有限记忆BFGS方法

High-dimensional indexing method based on elliptical-shaped clustering

Efficient nearest neighbor query based on extended B+-tree in high-dimensional space

一种基于椭圆体聚类的高维索引方法

极大极小问题的光滑化信赖域共轭梯度法

求解半定规划问题的一种光滑化方法

基于K均值聚类和多核SVM的微钙化簇检测

Efficient video indexing method using dynamic distance measure for the principal component

采用动态主分量距离测度的视频索引技术

Variant of Gaussian kernel and parameter setting method for nonlinear SVM

A new technique for generalized learning vector quantization algorithm

共轭梯度型支撑向量机

The application of successive quadratic programming algorithm to multiuser detection in CDMA

Image retrieval based on color distribution entropy

基于集成算法的半监督学习

Large scale classification with local diversity AdaBoost SVM algorithm

Clustering algorithm of web search results based on conceptual grouping

A new iterative algorithm training SVM

一种改进的元搜索排序合成算法

采用双目标优化的核参数选择方法

Efficient high-dimensional indexing by sorting principal component

On reliability of the folded hypercubes

Fault-tolerant analysis of a class of networks

Semismooth Newton support vector machine

元搜索引擎结果合成算法

Improved rank merging algorithm for meta search

On conditional diagnosability and reliability of the BC networks

Training hard-margin support vector machines using greedy stagewise algorithm

搜索引擎中的聚类浏览技术

基于分组特征多核支持向量机的微钙化簇检测

多模式扰动模型动态加权SVM集成研究

一种基于L1稀疏正则化和非负矩阵分解的盲源信号分离新算法

期刊信息

《华南理工大学学报：自然科学版》
北大核心期刊（2011版）

主管单位:国家教育部科技司
主办单位:华南理工大学
主编：李元元
地址：广州市天河区五山路华南理工大学17号楼
邮编：510640
邮箱：journal@scut.edu.cn
电话：

国际标准刊号：ISSN：1000-565X
国内统一刊号：ISSN：44-1251/T
邮发代号:46-174

获奖情况:
本学报荣获1996年国家教委系统优秀科技期刊二等奖...,1999年荣获全国优秀高校自然科学学报及教育部优秀...,2001年荣获广东省优秀期刊奖和广东省优秀科技期刊...,2004年获全国高校优秀科技期刊二等奖,2006年获首届教育部优秀科技期刊奖,2008年荣获第二届教育部优秀科技期刊奖

国内外数据库收录:
俄罗斯文摘杂志,美国化学文摘（网络版）,荷兰文摘与引文数据库,美国工程索引,美国剑桥科学文摘,英国科学文摘数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:22954