东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

一种基于多模态模型的随机子空间分类集成算法

期刊名称：南京师范大学学报(工程技术版)
时间：0
页码：57-62
语言：中文
分类：TP301.6[自动化与计算机技术—计算机系统结构;自动化与计算机技术—计算机科学与技术]
作者机构：[1]南京师范大学计算机科学与技术学院,江苏南京210097
相关基金：国家自然科学基金（60873176）、江苏省自然科学基金（BK2008430）资助项目
相关项目：双重稀疏化的大间隔分类器研究

关键词：多模态, 随机子空间, 分类器集成, Multi-modality, random subspace, classifier ensemble

中文摘要：

分类是当前机器学习的重要研究内容之一,已取得了一定的进展.现有的文本分类方法大多基于VSM模型,而VSM未能有效地利用隐含在文本中的结构信息.同时,VSM下的样本空间常常是高维的,单一的降维策略可能会丢失有用信息.为改进现有算法的不足,提出了一种基于多模态模型的随机子空间分类集成算法MMRFSEn,有效地利用文本中的结构信息（单词分布位置的均值和标准差）,且各基分类器是由随机选择的子空间构建而成.实验结果表明,该方法是有效可行的.

英文摘要：

Text Classification is an important machine learning research, in which some progress has been made. Most of the existing classification methods are based on Vector Space Model（VSM） , but VSM does not effectively utilize the structure information hidden in the text samples. At the same time, VSM vectors are often high-dimensional, merelv using dimensionality reduction strategy may lead to the loss of the useful information. To uvercome the shortcomings of the existing algorithms, we propose an algorithm called Multi-modality-based Random Feature subspaee classifier Ensemble （MMRFSEn） . which can eft）etively use the structure information hidden in the text such as the words＇ s average localion and standard deviation, and meanwhile each single classifier is constructed by a randomly selected subspace. The experimental results show that the newly developed method is effective and feasible.

同期刊论文项目