东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

大数据环境下多决策表的区间值全局近似约简

ISSN号：1000-9825
期刊名称：《软件学报》
时间：0
分类：TP181[自动化与计算机技术—控制科学与工程;自动化与计算机技术—控制理论与控制工程]
作者机构：[1]上海电力学院计算机科学与技术学院,上海200090, [2]同济大学电子与信息工程学院,上海200092
相关基金：国家自然科学基金（61272437,60305094）;上海市教育委员会科研创新项目（12YZ140,14YZ131）;上海市自然科学基金（13ZR1417500）

作者：徐菲菲[1], 雷景生[1], 毕忠勤[1], 苗夺谦[2], 杜海舟[1]

关键词：大数据, 区间值, 近似约简, 多决策表, 全局约简, big data, interval-value, approximate reduction, multi-decision tables, global reduction

中文摘要：

在电力大数据中，很多具体的应用如负荷预测、故障诊断都需要依据一段时间内的数据变化来判断所属类别，对某一条数据进行类别判定是毫无意义的。基于此，将区间值粗糙集引入到大数据分类问题中，分别从代数观和信息观提出了基于属性依赖度和基于互信息的区间值启发式约简相关定义和性质证明，并给出相应算法，丰富和发展了区间值粗糙集理论，同时为大数据的分析研究提供了思路。针对大数据的分布式存储架构，又提出了多决策表的区间值全局约简概念和性质证明，进一步给出多决策表的区间值全局约简算法。为了使得算法在实际应用中取得更好的效果，将近似约简概念引入所提的3种算法中，通过对2012上半年某电厂一台600MW的机组运行数据进行稳态判定，验证所提算法的有效性。实验结果表明，所提的3种算法均能在保持较高分类准确率的条件下从对象和属性个数两方面对数据集进行大幅度缩减，从而为大数据的进一步分析处理提供支撑。

英文摘要：

For the big data on electric power, many specific applications, such as load forecasting and fault diagnosis, need to consider data changes during a period of time to determine their decision classes, as deriving a class label of only one data record is meaningless. Based on the above discussion, interval-valued rough set is introduced into big data classification. Employing algebra and information theory, this paper defines the related concepts and proves the properties for interval-valued reductions based on dependency and mutual information, and presents the corresponding heuristic reduction algorithms. The proposed methods can not only enrich and develop the interval-valued rough set theory, but also provide a new way for the analysis of big data. Pertaining to the distributed data storage architecture of big data, this paper further proposes the interval-valued global reduction in multi-decision tables with proofs of its properties. The corresponding algorithm is also given. In order for the algorithms to achieve better results in practical applications, approximate reduction is introduced. To evaluate three proposed algorithms, it uses six months’ operating data of one 600MW unit in some power plant. Experimental results show that the three algorithms proposed in this article can maintain high classification accuracy with the proper parameters, and the numbers of objects and attributes can both be greatly reduced.

同期刊论文项目

电力监测无线传感器网络中感知数据的容忍与查询

期刊论文 5

同项目期刊论文

一种基于数据流的软子空间聚类算法

关键词有序排列的全文检索算法

面向对象技术在软件自动化测试中的应用

期刊信息

《软件学报》
北大核心期刊（2011版）

主管单位:中国科学院
主办单位:中国科学院软件研究所中国计算机学会
主编：赵琛
地址：北京8718信箱中国科学院软件研究所
邮编：100190
邮箱：jos@iscas.ac.cn
电话：010-62562563

国际标准刊号：ISSN：1000-9825
国内统一刊号：ISSN：11-2560/TP
邮发代号:82-367

获奖情况:
2001年入选中国期刊方阵“双百期刊”,2000年荣获中国科学院优秀科技期刊一等奖

国内外数据库收录:
俄罗斯文摘杂志,美国数学评论（网络版）,波兰哥白尼索引,德国数学文摘,荷兰文摘与引文数据库,美国工程索引,美国剑桥科学文摘,英国科学文摘数据库,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:54609