东篱科研大数据发现系统（DRDS）

欢迎您！东篱公司退出

申报数据库
1. 申报指南
立项数据库
成果数据库
1. 期刊论文
2. 会议论文
3. 著作
4. 专利
项目获奖数据库

位置：成果数据库 > 期刊 > 期刊详情页

基于样例选取的属性约简算法

ISSN号：1000-1239
期刊名称：计算机研究与发展
时间：2012.11.19
页码：2305-2310
分类：TP18[自动化与计算机技术—控制科学与工程;自动化与计算机技术—控制理论与控制工程]
作者机构：[1]河北省机器学习与计算智能重点实验室河北大学数学与计算机学院,河北保定071002
相关基金：基金项目：国家自然科学基金项目（61170040）;河北省自然科学基金及高校科技重点项目（F2010000323,ZD2010139）
相关项目：归纳学习中的不确定性研究

作者：王熙照|王婷婷|翟俊海|

关键词：信息系统, 样例选取, 粗糙集, 差别矩阵, 属性约简, information system, instance selection, rough set, discernibility matrix, attributereduction

中文摘要：

计算属性约简是粗糙集框架下归纳学习的关键部分．基于差别矩阵的属性约简算法是常用的属性约简算法之一．给定一个信息系统，利用该算法可以求出信息系统的所有属性约简．但是该算法需要的存储空间大，执行时间长，特别是对于大型数据库，差别矩阵的存储成为其应用的瓶颈．针对这一问题，提出了一种基于样例选取的属性约简算法，算法分为3步：首先从样例集中挑选出重要的样例；然后用选出的样例构造差别矩阵；最后计算信息系统的所有约简．实验结果显示，当处理大型数据库时，新算法能有效地减少存储空间和执行时间．

英文摘要：

Computing reduction of attributes plays an essential role in the framework of supervised learning based on rough sets. Attribute reduction algorithm based on discernibility matrix is one of the commonly used attribute reduction algorithms. Given an information system, all reductions can be found by using this algorithm. However, this algorithm suffers from the main problems： large memory requirement and large response time needed. Especially, for a large database, it is the bottleneck to store the discernibility matrix. To tackle this problem effectively, an attribute reduction algorithm based on instance selection is proposed. The algorithm consists of three stages： firstly, the most informative instances are selected from the training set; secondly, the discernibility matrix is computed by using the selected instances; finally, all reductions can be found. The experimental results show that the proposed method can efficiently reduce the computational complexity both of time and space especially on large databases.

同期刊论文项目

归纳学习中的不确定性研究

期刊论文 77 会议论文 17 著作 2

同项目期刊论文

距离加权极端支持向量机

Bayesian classifiers based on probability density estimation and their applications to simultaneous

增量密度加权近似支持向量机

Local similarity and diversity preserving discriminant projection for face and handwriting digits re

A new and informative active learning approach for support vector machine

Condensed fuzzy nearest neighbor methods based on fuzzy rough set technique

基于概率神经网络和K-L散度的样例选择

LearningELM-Treefrombigdatabasedonuncertaintyreduction

Non-Naive Bayesian Classifiers for Classification Problems with Continuous Attributes

基于模糊积分的不完全小波包子空间集成人脸识别

A new approach to classifier fusion based on upper integral

Fusion of extreme learning machine with fuzzy integral

Learning ELM-Tree from big data based on uncertainty reduction

改进的基于排序熵的有序决策树算法

An improved algorithm for calculating fuzzy attribute reducts

Effective algorithms of the Moore-Penrose inverse matrices for extreme learning machine

Fuzziness based sample categorization for classifier performance improvement

Combination of OSELM classifiers with fuzzy integral for large scale classification

Maximum ambiguity based sample selection in fuzzy decision tree induction

A study on relationship between generalization abilities and fuzziness of base classifiers in ensemb

Segment Based Decision Tree Induction with Continuous Valued Attributes

Performance improvement of classifier fusion for batch samples based on upper integral

Uncertainty in learning from big data

An improved differential evolution and its application to determining feature weights in similarity-

A study on residence error of training an extreme learning machine and its application to evolutiona

A study on relationships between heuristics and optimal cuts in decision tree induction

learning from uncertainty and its application to intelligent systems of web information

概率神经网络样例选择算法

OWA operator based link prediction ensemble for social network

Dynamic ensemble extreme learning machine based on sample entropy

基于相容粗糙集技术的连续值属性决策树归纳

Optimal bandwidth selection for re-substitution entropy estimation

Maximum Ambiguity-Based Sample Selection in Fuzzy Decision Tree Induction

一种考虑地图分布信息的分层路径搜索算法

加权最大夹角间隔核心集向量机的不平衡数据分类

基于有序决策树的改进归纳算法

中心向量夹角间隔正则化核向量机

核心集径向基函数极限学习机

一种多尺度路径规划的加速方法

基于小波子空间集成的人脸识别

粗糙集与决策树比较研究

相容粗糙模糊集模型

Zadeh 扩张原理下模糊数空间的代数结构

关于模糊数H差的存在性的讨论

基于多类重采样的非平衡数据极速学习机集成学习

变精度相容粗糙集模型

一种简单有效的人脸识别方法

一种改进的样例约简支持向量机

求解随机时变背包问题的精确算法与进化算法

期刊信息

《计算机研究与发展》
中国科技核心期刊

主管单位:中国科学院
主办单位:中国科学院计算技术研究所
主编：徐志伟
地址：北京市科学院南路6号中科院计算所
邮编：100190
邮箱：crad@ict.ac.cn
电话：010-62620696 62600350

国际标准刊号：ISSN：1000-1239
国内统一刊号：ISSN：11-1777/TP
邮发代号:2-654

获奖情况:
2001-2007百种中国杰出学术期刊，2008中国精品科...,中国期刊方阵“双效”期刊

国内外数据库收录:
俄罗斯文摘杂志,荷兰文摘与引文数据库,美国工程索引,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:40349