位置:成果数据库 > 期刊 > 期刊详情页
基于Biased-SVM的非平衡半监督分类算法
  • ISSN号:1003-4978
  • 期刊名称:《河南大学学报:自然科学版》
  • 时间:0
  • 分类:TP391[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
  • 作者机构:[1]西南交通大学智能控制开发中心,成都610031, [2]河南大学药学院,河南开封475004
  • 相关基金:国家自然科学基金项目(61175055,61305074)
中文摘要:

针对非平衡数据的半监督分类问题,提出了一种基于Biased-SVM的非平衡半监督分类算法.该方法首先利用初始的标记样本集训练处理不平衡数据的Biased-SVM模型,然后用训练好的Biased-SVM模型为未标记样本加上标签,再把新标记样本加入到初始标记样本集中,重新训练Biased-SVM模型,最后在测试集上进行测试.选取公共数据库里的一些数据集进行实验,首先在两类不平衡数据集上实验的结果表明,在标记样本所占比例为20%~80%时,所提方法能够在不降低数据集整体G-mean值的基础上,提高小类的F-value值并具有较高的稳定性;然后在多类不平衡数据集上实验的结果表明,在标记样本所占比例为20%~80%时,所提方法能够在不降低数据集整体的EG-mean值的基础上,提高小类识别率并具有较高的稳定性.

英文摘要:

In view of the semi-supervised classification problem for imbalaneed data, a new semi-supervised learning algorithm based on Biased-SVM was proposed. The steps of the proposed algorithm were as follows.. Firstly, the Biased-SVM model that could dispose the unbalanced samples data was trained by the initial labeled sample set. Secondly, the trained Biased-SVM model was used to add labels to the unlabeled samples. Thirdly, the new labeled samples were added to the initial labeled sample set, and the Biased-SVM model was retrained. Finally, the classifier performance was tested. The proposed method was tested in several benchmark data sets. First, according to some binary unbalanced data sets, the experimental results showed that the proposed method not only improved the G- mean value and the F-value of the minor class effectively, but also had higher stability when the labeled sample rate was 20%--80%.Second, some multi-class unbalanced data sets were selected, and the experimental results showed that the presented method not only increased the EG-mean value and the precision of the minor class effectively, but also had higher stability when the labeled sample rate was 20%--80%.

同期刊论文项目
期刊论文 279 会议论文 102 获奖 8
同项目期刊论文
期刊信息
  • 《河南大学学报:自然科学版》
  • 中国科技核心期刊
  • 主管单位:河南省教育厅
  • 主办单位:河南大学
  • 主编:乔家君
  • 地址:河南省开封市明伦街85号
  • 邮编:475001
  • 邮箱:xbzrb@henu.edu.cn
  • 电话:0378-2860394
  • 国际标准刊号:ISSN:1003-4978
  • 国内统一刊号:ISSN:41-1100/N
  • 邮发代号:36-27
  • 获奖情况:
  • 河南省优秀科技期刊一等奖,河南省高校优秀自然科学学报,全国学术期刊规范执行优秀奖
  • 国内外数据库收录:
  • 美国化学文摘(网络版),德国数学文摘,中国中国科技核心期刊,中国北大核心期刊(2004版),中国北大核心期刊(2008版),中国北大核心期刊(2011版)
  • 被引量:5635