位置:成果数据库 > 期刊 > 期刊详情页
Multiple-instance learning with instance selection via constructive covering algorithm
  • ISSN号:1007-0214
  • 期刊名称:Tsinghua Science and Technology
  • 时间:2014
  • 页码:285-292
  • 分类:TP181[自动化与计算机技术—控制科学与工程;自动化与计算机技术—控制理论与控制工程] TP183[自动化与计算机技术—控制科学与工程;自动化与计算机技术—控制理论与控制工程]
  • 作者机构:[1]Department of Computer Science and Technology and Key Lab of Intelligent Computing and Signal Processing,Anhui University,Hefei 230601,China, [2]Department of Computer Science and Technology,Tsinghua University,Beijing 100084,China
  • 相关基金:This research was supported by the National Natural Science Foundation of China (No.61175046),the Provincial Natural Science Research Program of Higher Education Institutions of Anhui Province (No.KJ2013A016),the Outstanding Young Talents in Higher Education Institutions of Anhui Province (No.2011SQRL146),and the Recruitment Project of Anhui University for Academic and Technology Leader.
  • 相关项目:商空间链的表示与海量信息的问题求解方法研究
中文摘要:

Multiple-Instance Learning(MIL) is used to predict the unlabeled bags’ label by learning the labeled positive training bags and negative training bags. Each bag is made up of several unlabeled instances. A bag is labeled positive if at least one of its instances is positive, otherwise negative. Existing multiple-instance learning methods with instance selection ignore the representative degree of the selected instances. For example, if an instance has many similar instances with the same label around it, the instance should be more representative than others. Based on this idea, in this paper, a multiple-instance learning with instance selection via constructive covering algorithm(MilCa) is proposed. In MilCa, we firstly use maximal Hausdorff to select some initial positive instances from positive bags, then use a Constructive Covering Algorithm(CCA) to restructure the structure of the original instances of negative bags. Then an inverse testing process is employed to exclude the false positive instances from positive bags and to select the high representative degree instances ordered by the number of covered instances from training bags. Finally, a similarity measure function is used to convert the training bag into a single sample and CCA is again used to classification for the converted samples. Experimental results on synthetic data and standard benchmark datasets demonstrate that MilCa can decrease the number of the selected instances and it is competitive with the state-of-the-art MIL algorithms.

英文摘要:

Multiple-Instance Learning (MIL) is used to predict the unlabeled bags' label by learning the labeled positive training bags and negative training bags.Each bag is made up of several unlabeled instances.A bag is labeled positive if at least one of its instances is positive,otherwise negative.Existing multiple-instance learning methods with instance selection ignore the representative degree of the selected instances.For example,if an instance has many similar instances with the same label around it,the instance should be more representative than others.Based on this idea,in this paper,a multiple-instance learning with instance selection via constructive covering algorithm (MilCa) is proposed.In MilCa,we firstly use maximal Hausdorff to select some initial positive instances from positive bags,then use a Constructive Covering Algorithm (CCA) to restructure the structure of the original instances of negative bags.Then an inverse testing process is employed to exclude the false positive instances from positive bags and to select the high representative degree instances ordered by the number of covered instances from training bags.Finally,a similarity measure function is used to convert the training bag into a single sample and CCA is again used to classification for the converted samples.Experimental results on synthetic data and standard benchmark datasets demonstrate that MilCa can decrease the number of the selected instances and it is competitive with the state-of-the-art MIL algorithms.

同期刊论文项目
同项目期刊论文
期刊信息
  • 《清华大学学报:自然科学英文版》
  • 主管单位:教育部
  • 主办单位:清华大学
  • 主编:孙家广
  • 地址:北京市海淀区清华园
  • 邮编:100084
  • 邮箱:journal@tsinghua.edu.cn
  • 电话:010-62788108 62792994
  • 国际标准刊号:ISSN:1007-0214
  • 国内统一刊号:ISSN:11-3745/N
  • 邮发代号:82-627
  • 获奖情况:
  • 国内外数据库收录:
  • 美国化学文摘(网络版),美国数学评论(网络版),德国数学文摘,荷兰文摘与引文数据库,美国工程索引,美国剑桥科学文摘
  • 被引量:323