东篱科研大数据发现系统（DRDS）

位置：成果数据库 > 期刊 > 期刊详情页

视觉感知正反馈的显著性检测

ISSN号：1006-8961
期刊名称：《中国图象图形学报》
时间：0
分类：TP391.4[自动化与计算机技术—计算机应用技术;自动化与计算机技术—计算机科学与技术]
作者机构：中国计量大学信息工程学院,杭州310018
相关基金：浙江省自然科学基金项目（LY13F010004）;国家自然科学基金项目（61572449）

关键词：视觉显著性检测, 注视眼动, 机器学习, 正反馈, 视觉感知饱和, 迭代, visual saliency detection, fixational eye movement, machine learning, positive feedback, visual perceptual saturation , iterations

中文摘要：

目的人类视觉系统性能远超当前机器视觉，模拟人类视觉机制改进当前算法是有效研究途径，为此提出一种视觉感知正反馈模型，通过循环迭代、重复叠加视觉刺激生成更符合人类感知的视觉显著性图。方法首先用多种常规方法检测图像显著度，模拟人类视觉多通道特性，再组合这些显著图为综合显著图；利用显著度大的像素构建初始注视区。其次借助集成RVFL（随机向量功能网络）模拟人脑神经网络产生视觉刺激，对注视与非注视区内像素在线“随机采样—学习建模”，图像像素经模型分类获得新注视区。对新注视区与非注视区，可重复迭代进行“随机采样—学习建模—像素分类”；迭代中若注视区连续相同，则表明感知饱和，迭代终止。若将每次像素分类结果看做是一种视觉刺激，则多次视觉刺激输出叠加，可生成新的图像显著性图。最终的像素分类结果就是图像分割目标。结果将本文算法与现有方法在标准图像数据库上进行对比评测，包括通过对6种算法在ECSSD、SED2和MSRA10K 3个图像数据库上的P-R曲线，F-measure值和平均绝对误差（MAE）值上进行定量分析，对6种模型生成的显著性图作定性比较。数据表明，本文算法在SED2和MSRA10K图象数据库中性能最好，在ECSSD图象数据库中稍低于BL（bootstrap learning）和RBD（robust background detection）算法。本文算法的显著图与人类视觉感知更接近。且算法的正反馈迭代过程一般可迅速饱和，并未显著增加算法负担。实验结果表明，本文方法可作为一种有效的后处理手段，显著提升常规显著性检测算法的性能。结论提出了一种模拟人类视觉机制的数据驱动显著性检测算法，无需图像先验知识和事先的标记样本。面对多目标，背景复杂等情况，本文方法具有相对好的鲁棒性和适用性，并且能够较好?

英文摘要：

Objective The performance of current machine vision is inferior to that of human vision. Simulating human visual mechanism can improve existing algorithms. The human visual system can detect objects with high acuity and focus its attention on a region relevant to the current visual task. These advantages are all attributed to the visual attention mechanism. Humans accept attention by making a series of eye movements. Eye movement has two forms： saccades and microsaccades. 1） In the saccades stage, the human eyes aim to find a candidate object, thereby sharply shifting in the entire field of view. 2） While candidates are identified as a target, the eyes will make a series of dense tiny movements called microsaccades around the target to intensify objects and inhibit noises. Continuous microsaccades will lead to visual fading, and the eye movement will switch to the saccades stage to find new objects. The integration of saccades and microsaccades contribute to the rapid and efficient performance of the human vision system. This paper presents a novel saliency detection framework by simulating microsaccades and visual fading. The constructed positive feedback loop focuses on a fixation area and intensifies objects to provide saturation of visual perception that leads to visual fading. In this loop, multiple random sampling of the gaze area is used to simulate the behavior of microsaccades, and random vector functional link networks（RVFL） are utilized to simulate the human neural system to produce binary visual stimulus. The proposed framework is totally data-driven and does not require any prior knowledge and labeled samples. Method First, the conventional saliency detection methods could be used to produce a variety of saliency map. We group these saliency maps to an integrated saliency map to simulate multi-channel visual perception. The integrated saliency map can be subjected to further thresholding to form an initial fixation area. The following multiple random sampling could be executed from the pixe

同期刊论文项目

有记忆信源率失真模型及高并行度HEVC编码算法优化方法

期刊论文 3

同项目期刊论文

基于失真传递的时域自适应量化算法

视频监控系统的android终端设计与实现

期刊信息

《数码影像》

主管单位:
主办单位:中国图象图形学学会中科院遥感所北京应用物理与计算数学研究所
主编：
地址：北京市海淀区花园路6号
邮编：100088
邮箱：
电话：010-86211360 62378784

国际标准刊号：ISSN：1006-8961
国内统一刊号：ISSN：11-3758/TB
邮发代号:

获奖情况:

国内外数据库收录:

被引量:0