节点文献

视觉显著性模型研究及其在影像处理中的应用

Research on Visual Attention Models and Application Imagery Processing

分页下载
分章下载
整本下载
在线阅读
不支持迅雷等下载工具，请取消加速工具后下载。

【作者】李志强；

【作者基本信息】上海交通大学，模式识别与智能系统， 2009，博士

【摘要】随着仿生学的发展,在计算机视觉研究领域,研究者已使用视觉神经解剖学和神经心理学领域的研究成果指导计算机视觉研究,通过模仿人类视觉特性,构造出更灵活、更先进的计算机视觉算法。视觉注意模型正是基于仿生学发展起来的,其能够快速搜索到人类感兴趣的目标,该类目标被称为显著性目标。该类模型被称为显著性模型。目前,在视觉注意模型研究领域,一个具有代表性的视觉注意模型是Itti模型。该模型是模拟人类自底向上的视觉特性产生的,但作者并没有解释模型中的数学算法为何能够模拟人类的自底向上特性,导致较难理解算法本质,影响显著性模型研究的推进。基于此,本文深入分析了该模型。依据分析,提出了一些新的视觉注意模型,同时将Itti模型应用到遥感影像变化检测中。具体为:1、在Itti模型中,使用高斯金字塔产生强度显著图。在本文研究中,平均金字塔、小波低通金字塔分别能被用于产生强度显著图。实验发现,来自上述三类金字塔的强度显著图彼此非常相似。本文从数学和图像处理角度深入分析了平均金字塔产生的强度显著图。通过分析发现,强度显著图中突出的区域为与背景对比强烈的区域,同时指出,来自于高斯金字塔或小波金字塔的强度显著图具有同样属性。即显著区域仍为与背景对比强烈的区域。2、仍使用Gabor金字塔,但改变金字塔图像结合方式,提出了四种新的生成方向特征图方法,并且它们产生的方向特征图与Itti模型产生的方向特征图相似。本文从数学和图像处理角度深入分析了这些方向特征图,发现在方向特征图中,显著性区域仍为与背景对比强烈的区域。在分析方向特征图过程中,归纳出了函数或算法能用于产生方向特征图的条件,基于这些条件,理论上推测出一些现有算法能够被用于产生方向显著图。其中有一个条件,Gabor函数不完全满足。如果能够提出一个函数,完全满足该条件,将产生更好的方向特征图。3、基于归纳出的生成方向特征图条件,构造了三个新的用于产生方向特征图的函数。其中一个函数相似于Gabor函数,其产生的方向特征图相似于Gabor函数产生的方向特征图。其它两个函数比Gabor函数和刚提及的函数简单,但能完全满足Gabor函数不完全满足的条件,因此能产生更好的方向特征图。实验证实了该结论。4、基于归纳出的产生方向特征图条件,提出了两种新的产生显著图方式。一种是使用图像离散余弦变换相位谱信息产生显著图;另一种是使用小波变换产生显著图。通过实验发现,两类方法产生的显著图都能较准确突出人类关注的目标。5、深入分析Itti模型产生颜色显著图部分发现,产生颜色显著图方式相似于产生强度显著图方式。不同之处,仅为输入的数据不同。一为强度分量,另一为颜色分量。基于此,推断出平均金字塔、小波低通金字塔也能被用于产生颜色显著图。强度显著图中突出的区域为与背景对比强烈的区域,方向显著图中突出的区域仍为与背景对比强烈的区域。由此,推断出所有产生方向显著图的方式都可用于产生颜色显著图。此外,本文还深入分析了Itti模型对噪声鲁棒的原因,并指出其存在的不足。6、在遥感影像变化检测中,噪声是影响变化检测准确性的一个重要因素。本文将视觉显著性模型应用到变化检测,减少噪声对变化检测的影响,提高检测的准确性。此外,本文还深入研究了边缘分组模型。该类模型又被称为形状显著性模型,属于视觉注意模型。通过研究发现,目前大部分边缘分组模型仅考虑完全形态心理学中的封闭性、紧凑性、平滑性、对称性和凸性几个指标,没有将完全形态心理学的平行性指标引入边缘分组。基于此,本文将完全形态心理学的平行性指标引入边缘分组,构建了一个新的边缘分组模型,用于检测遥感影像中的机场目标。更多还原

【Abstract】 With the development of bionics, many researchers in the computer vision have developed many novel machine algorithems in terms of outcomes from the research of neuroanatomy and visual neurophysiology. By simulating the characteristcs of human vision, some novel computer vision models are proposed. Visual attention models, which are included in these computer vision models, are proposed by simulating the bottom-up phase of human vision. They can be used to detect important objects which attract human eye in scene.In visual attention models, one classic and representative model is Itti model [8]. This model can process a scene image to generate a saliency map in which some objects, which attract human eye and are named as saliency objects, in the scene image are popped out. As well known, the reason why the saliency map from Itti model can pop out saliency objects was just explained in terms of the viewpoint of biologically-plausible, which results in an obstacle that it is hard to understand the real nature of Itti model. In order to find the real nature of Itti model, we analyse the model in detail from the viewpoint of image processing and mathmatics. Based on the analysis, we find the reason why Itti model can pop out saliency objects and propose some new ways to generate saliency map. These new ways and theory analysis are described as follows.(1) In Itti model, Gaussian pyramid is used to generate intensity conspicuity map. In our research, an interesting phenomenon is discovered. The phenomenon is that all of low-pass pyramids, including Gaussian pyramid, average pyramid, and wavelet pyramid generated by using the low-pass part of wavelet transform, can be used to generate intensity conspicuity map. Furthermore, these intensity conspicuity maps from low-pass pyramids are very similar to each other. As well known, the reason why intensity conspicuity map from Itti model can pop out saliency objects was just explained in terms of the viewpoint of biologically-plausible, which results in an obstacle that it is hard to understand the real nature of the intensity conspicuity map. In this paper, intensity conspicuity map from average pyramid is analyzed in detail from the aspect of image processing. The reason why the regions that have high intensity contrast can be popped out in the intensity conspicuity maps is explained. Meanwhile, the reason, why the conclusion from analyzing the intensity conspicuity map from average pyramid can be seen as the conclusion of the intensity conspicuity maps from all of low-pass pyramids, will be explained briefly.(2) Orientation conspicuity map is an important element in forming saliency map. Here, we discover other four ways which can be used to generate orientation feature maps besides the way used in Itti model. The orientation feature maps from these ways are similar to each other. We analyze these ways of generating orientation feature maps from the viewpoint of image processing. Based on the analysis, we find that the regions having high intensity contrast can be popped out in orientation conspicuity map.(3) We abstract three requirements which are used to ensure that the orientation conspicuity map from Gabor filter can be used to saliency detection. In addition, besides the three requirements, we add a modified requirement. If a new function satisfies the modified requirement besides the three requirements, the new function would be superior to the Gabor fitler when they are used to generate orientation conspicuity maps. Based on the theoretical analysis for orientation conspcuity map from Gabor fitler and four requirements, we propose three new functions which can be used to generate orientation conspicuity maps. The orientation conspicuity maps from two of three new functions will be better than the orientation conspicuity maps from Gabor fitler when they are used to generate orientation conspicuity maps.(4) Based on the theory analysis for orientation conspicuity map from Gabor filter, we propose two new ways to generate orientation map and analyse an existing saliency model. A new saliency model is based on wavelet transform. The other is based on phase spectrum of color information.(5) Color conspicuity map is an important component in the process of forming saliency map. In this paper, we study the way of generating color feature map and find that it is similar to the way of generating intensity feature map. Therefore, all of the low-pass pyramids used in generating intensity feature map can be applied to color feature map. Because in intensity feature map and orientation conspicuity map all the salient regions describe the intensity contrast between object and background, the method of generating orientation conspicuity map can also be used to generate color conspicuity map. Itti model has the merit of robust to noise. We analyze the model and discover that the robustness comes from the operation that all of the feature maps under different scales are resized to a same scale (σ=4). Further, we verify the theoretical analysis of the two aspects of saliency map studied in this paper by experiments.(6) A novel technique based on visual attention and context-sensitive is proposed for noise reduction in unsuperivised change detection. The technique is composed of two steps. The first step is that the intensity conspicuity maps algorithm of Itti model is used to process the difference image produced by comparing images acquired on the same area at different times. And a comparison map is produced. The second step is as follows: Bayes rule is used to distinguish the changed pixel in the comparion map. A changed detection map is made. Then, Markov Rondom Fields model is used to process the changed detection map. And the false changed pixels are removed. Experimental results confirm that the model can still detect the changed areas exactly when the noise intensity value in the images acquired on same area at different time is very large.Furthermore, a novel edge-grouping model is proposed in this paper. Edge-grouping belongs to visual attention. Most of existing edge-grouping models only detect the boundaries with closure, good continuation, proximity, convex and symmetry. In the poposed model, the boundaries of parallelism structure can be detected. This model is applied to airport detection. The accuracy of this model for airport detection is attractive.更多还原

【关键词】视觉注意模型；高斯金字塔； Gabor金字塔；变化检测；边缘分组方法；机场检测；显著性目标；显著性图；
【Key words】 visual attention model； Gaussian pyramid； Gabor pyramid； change detection； edge-grouping method； airport detection； saliency object； conspicutity map；

【网络出版投稿人】上海交通大学

【分类号】TP391.41
【被引频次】12
【下载频次】1429
攻读期成果

知网节下载

节点文献中：

本文链接的文献网络图示:

本文的引文网络

节点文献