节点文献

图像边缘的感知编组研究

Research on Perceptual Grouping of Image Edge

【作者】 吕剑

【导师】 陈俊杰;

【作者基本信息】 太原理工大学 , 计算机应用技术, 2010, 硕士

【摘要】 图像的主要特征包括了颜色、纹理和形状。其中“形状”这一特征对于物体识别的重要性可以鉴诸于人类视觉对物体的视觉感知,同时人类的视觉感知能力也为计算机视觉提供了参考模型。在实际应用当中,“形状”主要指的就是物体的边缘。图像边缘检测技术目前趋于成熟,厄待引入新的理论、方法,所以将新兴的感知编组技术与之相结合,无疑是一项很有意义的尝试。感知编组理论源自于心理学的格式塔理论,其理论研究的主要对象是知觉和解决问题的过程。在计算机视觉领域里,感知编组理论就是利用人类认知事物的规律进行来推演的理论。换言之,通过感知编组可以把视觉系统获取的原始数据组织成为有意义的组合或结构。其中,感知编组的编组规则主要借鉴自格式塔理论,而目前为止这些规则只有很少的一部分被应用到计算机视觉领域当中。其原因一方面在于一些规则的抽象性,增大了实现的难度,另一方面在于感知编组技术仍然处于研究当中,尚待完善。图像边缘检测技术旨在利用图像周围像素灰度变化来识别图像边缘。然而这样检测到的边缘本身只是像素点的集合,还无法直接应用感知编组原理。因此本文在对图像边缘的感知编组研究当中,于边缘检测之后加入了边缘连接与直线段检测,再进一步应用感知编组的原理对这些直线段进行编组得出物体的边缘轮廓。本文的主要内容如下:1.对源图像进行边缘检测:鉴于传统边缘检测算法种类繁多,首先对经典的边缘检测算法(Roberts算子法、Sobel算子法、Prewitt算子法、LOG算子法、Canny算子法)进行对比测试,最后选择效果最佳的Canny算子法作为边缘提取的工具,得到图像的边缘集合。2.边缘连接并组合成直线段:霍夫变换法是一种噪声敏感度低且很有效的边缘连接及直线检测方法。本文在分析霍夫变换原理之后,提出利用梯度阈值筛选有效像素点、用间隔阈值分割直线的改进方法。将第一步骤检测到的边缘集合利用改进后的霍夫变换进行直线段检测,得到感知编组的候选直线段组合。经实验证实,降低了计算复杂度并避免了小直线段的丢失。3.感知编组:前面两个步骤得到的图像边缘仅仅是经过了像素级的处理,与我们视觉认知到的物体轮廓有相当的差距,并不是图像中物体的真正轮廓,所以需要模拟人的感知能力进行进一步的完善,这一过程主要是通过制定一些规则进行计算,诸如邻近性规则、相似性规则、平行性规则、封闭性规则等,从而得到最后的编组。本文就一些主要的规则提出几组概率模型作为编组规则,经过实验证实了其可行性与有效性。

【Abstract】 The main features of image include color, texture and shape.“Shape”is very important in object recognition, which can be proved in human visual perception. In the mean time, the human visual perception ability can also provide reference models for the computer vision. By the way, the“shape”exactly means“edge”in practice. Nowadays, the Image Edge Detection technology has been developed, there are urgent requirements for introduction of new theories and methods. Thus, the combination of the Perceptual Organization technology and the Image Edge Detection technology will be a significant attempt undoubtedly.Perceptual Organization Theory is derived from the Gestalt Theory in psychology, and its mainly research objects are processes of perception and problem-solving. In the computer vision field, this theory utilize the human’s perception laws, in other words, we can organize significant combinations or structures by the original data from vision system. Laws of the Perceptual Organization Theory mainly come from the Gestalt Theory. However, few of the laws can be applied in the computer vision field at present, reasons are complicated: first, it is too abstract to carry out for some laws; then second, Perceptual Organization Theory is a new developing technology.The Image Edge Detection method takes advantage of the gray scale alternation to identify edges. However, in spite of the noise sensitivity and the detection precision, these edges we got are only sets of the pixels, and can not be applied to the Perceptual Organization principle directly. Thus, in the research on perceptual grouping of image edge, we add a process of edge connection and straight line segments detection after the Edge Detection, and then organize and get the contour by the ways of the Perceptual Organization Principle. The main content of the paper is as follows.1. Detect edges of the original image: For the amount of the classical edge detection algorithms, we have a contrast test on the classical algorithms at first, including the Roberts, Sobel, Prewitt, LOG, and the Canny algorithm. Finally we choose the most effective method-Canny algorithm to extract edges, and get the edge-sets.2. Connect the edge-sets and change them into straight line segments: The Hough Transform is a low noise sensitivity and effective method for edge-connection and straight line detection. By analyzing the principle of the Hough transform, we propose an improved method, which makes a gradient threshold to select valid pixels and another gap threshold to segment straight lines. Thus, we can take advantage of the improved Hough Transform to receive candidate of straight line-sets. Experiment confirmed that the improved method reduces computational complexity and avoid the loss of small straight line segments.3. Perceptual Grouping: Through the two steps above, we only get the pixel-level edges, but it is far away from the object contour obtained by human vision system, not exactly the real contour. So we need to simulate human vision system to constitute some laws, such as Proximity, Similarity, Parallelism, Closure laws, and etc, then we find the final groups. In this paper, we choose some of the major rules and propose several probability models as the grouping laws. Finally, experiments confirmed its feasibility and effectiveness.

节点文献中: 

本文链接的文献网络图示:

本文的引文网络