节点文献

全局显著信息指导下的轮廓编组计算模型研究

Research on Computational Model of Contour Grouping Guided by Global Saliency Information

【作者】 钟晶晶

【导师】 罗四维;

【作者基本信息】 北京交通大学 , 计算机应用技术, 2010, 博士

【摘要】 知觉组织作为人类视觉感知系统的重要组成部分,近年来受到从事神经科学、认知心理学和计算机科学等多领域研究人员的高度关注。知觉组织的过程为自然界中的光信号和感知目标之间架起一道桥梁,是人类进行图像识别、注意分配以及记忆存储等高级加工过程的基础。轮廓编组作为以图像边缘为编组对象的一种知觉组织,是知觉组织研究中不可或缺的一部分。轮廓编组模型能够发现图像中的显著结构,是定义和获取感知目标的重要工具,为实现基于目标的注意模型、目标检测和目标识别等模型提供保障。因此研究知觉组织的认知和神经机制,建立符合人类感知特性的高效的轮廓编组模型,是当前视觉感知系统信息处理理论研究中一个非常重要的问题。将认知心理学和神经科学中有关知觉组织的研究成果应用到计算模型中,是建立符合人类感知特性的轮廓编组模型的关键。本文以心理学中格式塔的知觉组织规则为依据,将心理学和神经科学中的研究成果作为理论基础,从统计学的角度出发,强调注意对知觉组织过程的影响作用,提出了符合自然图像统计特性的格式塔编组线索量化模型,并构建了全局显著信息指导下的轮廓编组模型。本文的研究内容依据建立轮廓编组模型的三个步骤可分为三大部分。第一部分从优化轮廓编组输入的角度出发,设计实现了适合于轮廓编组的自然图像中的边缘检测算法。第二部分通过分析人类所接受的外部信息的统计特性,建立了边缘间格式塔编组线索的量化模型。第三部分确定轮廓编组算法的优化目标和优化过程,从中体现出注意对知觉组织的作用,实现对自然图像的轮廓编组。本文的主要创新点有:第一,提出了一种基于全局显著信息的多尺度边缘检测算法。与其它边缘检测算法相比,该算法能够获得较好的边缘检测结果,更符合轮廓编组的输入要求。算法采用非线性的方法将多种特征下的边缘检测结果进行融合,保证边缘的准确和完整;以图像中显著区域的边界位置作为边缘检测的空间先验知识,有效去除噪声和细节边缘;将空间位置相邻的边缘点连接成具有一定长度的边,确保检测结果的稳定性和对轮廓编组算法的适用性;最后在多尺度的变化中不断更新空间先验知识和追踪每条边缘的演化过程,用每条边从出现到消亡过程中的能量总和来度量其显著性。第二,提出了轮廓编组中新的边缘相似性度量方法-有向边区域度量。利用自行开发的交互式的图形化标注工具,人工标注得到了规范且准确的有向边缘数据集,并在此数据集上,以正则化方差准则为依据,具体讨论了影响有向边相似性度量的几个关键因素,明确给出了有向边区域度量的最佳参数。有向边区域度量的边缘相似性度量方法克服了边缘连接方式的不确定性,能够有效提高相似性线索在轮廓编组中的重要性。在有向边缘数据集上的统计信息显示,有向边区域度量明显优于已有的其它边缘相似性度量方法。第三,建立了轮廓编组中生成式的编组线索合并模型。结合认知心理学中有关格式塔规则的研究,将邻近性作为轮廓编组中的主导线索,讨论了在不同的邻近性条件下连续性和相似性线索的联合分布情况。用生成式的编组线索合并模型,拟合了连续性和相似性线索在人工标注数据集上具有特殊形式的联合分布,准确地描述了编组线索之间的相关性。此模型摒除了以往判别式模型中对于线索不相关的假设,更准确的拟合了由自然图像统计得到的格式塔编组线索的统计特性,是更为精确的格式塔线索合并模型。第四,提出了全局显著信息指导下的轮廓编组算法,并通过对轮廓编组算法的层次化实现了基于目标的注意。该轮廓编组算法以心理学中有关注意对知觉组织作用的研究成果为依据,用全局显著信息来指导轮廓编组的过程,实现对自然图像中显著感知目标的轮廓编组。模拟注意中的同物效果和禁止返回机制,将轮廓编组算法层次化,实现了注意在感知目标中的层次化转移。在自然图像上的实验结果表明,该轮廓编组算法能够编组得到图像中的显著感知目标,在绝大多数情况下获得了比其它算法更好的编组结果。在此基础上实现的基于目标的注意,其注意焦点转移路径更加规整,也更加符合人类的视觉感知。

【Abstract】 Perceptual organization is an important mechanism of human visual perception system. It attracts many researchers who employ themselves in neurophysiology, cognitive psychology, computer science and other fields in recent years. The process of perceptual organization is a link between the optical signals in natural world and the perceptual objects. It is also the basis of many high level visual tasks, such as image recognition, attention assignment, memory storage and so on. Contour grouping, which takes image edges as grouping elements, is an important part of perceptual organization. The contour grouping model can find salient structures in images, and is an important tool for the definition and obtainment of perceptual objects. It can be seemed as the foundation of constructing object-based attention model, object detection model and object recognition model. Researching the cogni-tive and neural mechanism of perceptual organization to design a contour grouping model with high efficiency which accords with human perception system is indis-pensable in the study of visual perception system inspired information processing theory.Applying the research results about perceptual organization in cognitive psy-chology and neurophysiology into computational models is the key of constructing contour grouping models which accord with human perception characteristics. In this dissertation, we propose the quantized model of Gestalt grouping cues and con-struct a contour grouping model guided by global saliency information. All of these studies are based on the Gestalt grouping rules and the research results in psychol-ogy and neurophysiology, take Statistic as the tool, and mainly emphasize the role of attention in perceptual organization.According to the three stage of constructing a contour grouping model, the study of this dissertation can be divided into three parts. The first part aims at improving the input quality of contour grouping. In this part, we design and realize the natural image edge detection algorithm which is suitable for contour grouping. The second part analyses the statistical properties of the natural images, and give a reasonable definition and quantization of gestalt grouping cues between directed tangents. The third part defines the grouping cost based on the role of attention in perceptual organization, gives the optimization of the grouping cost, and finally realizes the contour grouping in natural images. The main innovative points of the dissertation are as follows:First, we propose a multi-scale boundary detection algorithm based on global saliency information. Compared with other algorithms, our algorithm can get better detection results, which are more proper for contour grouping. Nonlinear combina-tion of the edge detection results in multiple features can ensure the accuracy and integrity of the edges. The spatial prior getting from the boundaries of salient re-gions can remove noises and trivial edges effectively. Linking neighbor edge pixels to form edges with a certain length can make the detection results more steady and proper for contour grouping. We update the spatial prior and trace each edge across multiple scales, and evaluate the saliency of each edge according to the edge energy existing in its whole life.Second, we propose a new similarity measurement for directed tangents in con-tour grouping, which is called directed edge region measurement. We develop a graphic interactive tangent label tool, and by which we construct a standard and accurate human labeled directed tangents set. We discuss the key factors which impact the similarity measurement and give the best parameters according to nor-malized variation standard for similarity measurement on labeled directed tangents set. The directed edge region measurement for tangent similarity measurement over-comes the indefiniteness of tangent groups, which can improve the importance of similarity cue in contour grouping. Comparison with other similarity measurements on the tangents set shows that the directed edge region measurement is much more accurate.Third, we construct a generative grouping cue combination model for contour grouping. This model makes no assumptions of the independence of grouping cues like other discriminative models, which can lead to a more satisfying description of the statistic properties of gestalt grouping cues. The generative model is a more exact gestalt cue combination model than other models. We use the proximity as a key grouping cue, and discuss the joint probability distribution of continuity and similarity in different proximity conditions. Using the generative cue combination model, we fit the joint probability distribution of continuity and similarity on human labeled set which has a special form, and give a more accurate description of the correlation of gestalt cues.Forth, we propose a contour grouping algorithm guided by global saliency infor-mation, and realize the object-based attention model based on the hierarchical con-tour grouping algorithm. We use the global saliency information to guide the process of contour grouping on the basis of the research achievements about the attention role in perceptual organization in psychology, and obtain the contour grouping of salient perceptual objects in natural images. We consider both the same object ef-fect and inhibition of return in attention, and realize the hierarchical attention shift among perceptual objects using hierarchical contour grouping model. The experi-mental results on natural color images show that our contour grouping algorithm can group the salient perceptual objects and is more effective than other grouping algorithms in most images. Also, the attention shifts of object-based attention are hierarchical and are agreed with human visual perception.

节点文献中: 

本文链接的文献网络图示:

本文的引文网络