东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

一种基于离散度平衡的降维算法

ISSN号：1003-9775
期刊名称：《计算机辅助设计与图形学学报》
时间：0
分类：TP311.13[自动化与计算机技术—计算机软件与理论;自动化与计算机技术—计算机科学与技术]
作者机构：[1]宁波大红鹰学院信息工程学院,浙江宁波315175, [2]大连理工大学电子信息与电气工程学部计算机科学与技术学院,辽宁大连116024
相关基金：基金项目：国家自然科学基金项目（61370200）;宁波市自然科学基金项目（2013A6i0115,2014A610073）;浙江省教育厅一般科研项目（Y201432717）

关键词：高效用模式, 频繁模式, 频繁项集, 数据挖掘, TOP—K, high utility pattern, frequent pattern, frequent itemsets, data mining, TOP-K

中文摘要：

目前TOP—K高效用模式挖掘算法需要产生候选项集，特别是当数据集比较大或者数据集中包含较多长事务项集时，算法的时间和空间效率会受到更大的影响．针对此问题，通过将事务项集和项集效用信息有效地保存到树结构HUP—Tree，给出一个不需要候选项集的挖掘算法TOPKHUP；HUP—Tree树能保证从中计算到每个模式的效用值，不需要再扫描数据集来计算模式的效用值，从而使挖掘算法的时空效率得到较大的提高．采用7个典型数据集对算法的性能进行测试，实验结果证明TOPKHUP的时间和空间效率都优于已有算法，并对K值的变化保持平稳．

英文摘要：

M mining, and utility thresh TOP-K high utility pattern from a it aims to mine the patterns whose utilitie old. At present, it has been a topic in data K high utility pattern generate candidat of a dataset~ this hinders their performa large or there are many long transaction structure called HUP-Tree （high utility dataset is an extension of frequent pattern s are higher than a user-specified minimum mining. Existing algorithms of mining TOP- e itemsets in the mining process and they need multiple scans nce of runtime and memory usage, especially when a dataset is itemsets in a dataset. To address this issue, we propose a tree pattern tree） to maintain transaction values, and we also give an algorithm named TOPKHUP （TOP-K high uti TOP-K high ut utility value of algorithm is eff itemsets and their lity pattern） that ility patterns without generating candidates. HUP-Tree ensures efficient retri each p ectively attern without a improved. Seven dditional scan of the dataset, so the performance classical real and synthetic datasets are used in the utility mines eval of of the testing experiments and the results show that the proposed algorithm outperforms state-of-the-art algorithms significantly for both runtime performance and memory usage, and it is more stable along the change of the value K.

同期刊论文项目

基于流形学习和时序语义网挖掘的人体运动序列分析研究

期刊论文 23 会议论文 1

大数据环境下高维数据流挖掘算法及应用研究

期刊论文 5

同项目期刊论文

An Algorithm for Mining High Average Utility Itemsets based on Tree Structure

一种基于MapReduce的动态数据流分类算法

高维数据中鲁棒激活函数的极端学习机及线性降维

人体运动分割算法：序列局部弯曲的流形学习

GLPCA: Dimensionality Reduction for Image Retrieval

Scantter Balance: an Angle-based Supervised Dimentionality Reduction

Multi-label dimensionality reduction and classification with extreme learning machines

Robust activation function and its application: Semi-supervised kernel extreme learning method

Extend Semi-supervised ELM and a Framework

Sliding Window- based Frequent Itemsets Mining over Data Streams using Tail Pointer Table

Maximal Similarity Embedding

一种新的局部空间排列算法

MapReduce框架下的实时大数据图像分类

人体运动分割算法-序列局部弯曲的流形学习

高效用项集挖掘算法

自适应融合目标和背景的图像特征提取方法

一种不确定数据集上频繁模式挖掘的近似算法

基于DNA计算的多维分配数据关联算法①

Multi-label dimensionality reduction and classification with extreme learning machines

MapReduce框架下的实时大数据图像分类

自适应融合目标和背景的图像特征提取方法

基于模式增长的不确定数据的频繁模式挖掘算法

分组排序多特征融合的图像检索方法

期刊信息

《计算机辅助设计与图形学学报》
北大核心期刊（2011版）

主管单位:中国科学技术协会
主办单位:中国计算机学会
主编：鲍虎军
地址：北京2704信箱
邮编：100190
邮箱：jcad@ict.ac.cn
电话：010-62562491

国际标准刊号：ISSN：1003-9775
国内统一刊号：ISSN：11-2925/TP
邮发代号:82-456

获奖情况:
第三届国家期刊奖提名奖

国内外数据库收录:
俄罗斯文摘杂志,荷兰文摘与引文数据库,美国工程索引,英国科学文摘数据库,日本日本科学技术振兴机构数据库,中国中国科技核心期刊,中国北大核心期刊（2004版）,中国北大核心期刊（2008版）,中国北大核心期刊（2011版）,中国北大核心期刊（2014版）,中国北大核心期刊（2000版）

被引量:24752