节点文献

立体匹配关键技术研究

Research of Key Technology for Stereo Matching

【作者】 卢思军

【导师】 唐振民;

【作者基本信息】 南京理工大学 , 模式识别与智能系统, 2011, 博士

【摘要】 立体匹配是计算机视觉最活跃的研究问题之一,立体匹配的目标是从同一景物不同角度的两幅或多幅图像中求得景物的立体深度,可以为三维重建、机器人视觉、自主车导航等提供有用的信息,但是立体匹配是一个病态问题,准确地恢复景物视差(深度)目前仍然面临很大挑战。因为在计算机视觉技术中,双目视觉更接近于人的双眼视觉原理,并且在实际应用中更容易实现,所以本课题对双目视觉的立体匹配相关理论及一些关键技术进行了研究,尝试对不同需求建立相应的算法,并通过对真实图像的实验证明了算法的可行性和有效性。目前固定窗口的立体匹配方法有重要的缺陷,原因是没有办法选择一个统一大小尺寸的匹配窗口。一方面,窗口尺寸要尽量大,以便为可靠匹配包容足够的灰度变化;另一方面,窗口尺寸要尽量小,以避开投影畸变的影响。为了解决这一问题,得到准确的视差图,本文研究了在不同纹理和邻域结构的像素如何采用自适应窗口这一问题,并把这一问题放在更为复杂的彩色图像中来研究,分别结合RGB颜色相似性和HSV模糊相似性提出了多窗口选择和自适应窗口尺寸缩减两种方法,提高了立体匹配的精确度。图像噪声会极大地影响立体匹配的准确率,为了抑制噪声对匹配结果的影响,本文根据Rank变换的原理,并引入了Census变换和色差梯度的约束条件,提出了基于Rank变换的彩色图像匹配方法。为了解决Rank变换对变换窗口中心像素灰度值过分依赖的缺点,另外本文提出了一种基于领域差值的非参量图像变换方法,实验结果表明抑制了噪声对匹配结果的影响。传统的立体匹配建立在Lambertian的漫反射模型之上,漫反射模型的立体匹配在一个图像的大多部分是有效的,但是在处理图像中包含镜面反射的部分时会产生严重的匹配错误。为了解决这一问题,本文利用漫反射和镜面反射在灰度和最大色度上的不同,对分离像素中镜面反射部分的方法进行了数学推导,然后提出了在镜面反射部分首先分离镜面反射内容然后再匹配的方法,同时提出了一种针对镜面反射的匹配测度,结果在图像中漫反射部分和镜面反射部分都能匹配得到正确的视差。图切割将立体匹配转变为全局能量最小化问题,是目前匹配准确率最高的立体匹配方法之一,但是它在视差最大的范围内构造网络规模,匹配时间太长。为了解决这一问题,本文提出了一种基于视差梯度和模糊规则的快速最小割立体匹配方法,根据匹配像素的不同特性,把匹配搜索范围限定在有限的几个候选匹配像素之中,极大地减少了构造网络的规模,在保证高匹配率的情况下明显减少了匹配时间。由于红外图像具有高噪声、低分辨率的特点,这使得在红外图像中计算场景的深度信息非常困难,目前多采用基于特征的匹配方法,但是提取的特征像素只有物体的边缘,要想生成整幅图的视差非常困难。本文提出了先对图像进行相位一致性变换,然后根据相位一致性变换后图的特点,采用非一致采样方法不均匀地提取出特征像素形成一个网格图,然后利用信任度传播方法计算出网格节点像素的视差,再插值求取整幅图像的视差方法。结果得到了准确的红外场景深度信息。

【Abstract】 Stereo matching has been one of the most researched areas of machine vision. It can bring critical advantages to a very wide spectrum of visual application domains, such as 3-D reconstruction, robot vision, automation land vehicle navigation, and so on. But stereo matching is an ill-posed problem with the influence of distortions, occlusions and low texture, obtaining exactly disparity still faces challenge. In computer vision, binocular vision is similar to the mechanism of human binocular vision, and easy to achieve in practical applications. The relevant theories and approaches of binocular stereo matching have been studied in this thesis, and some progressive achievements have been made.A central problem in stereo matching by computing correlation or sum of squared differences (SSD) lies in selecting an appropriate window size. The window size must be large enough to include enough intensity variation for reliable matching, but small enough to avoid the effects of projective distortion. If the window is too small and does not cover enough intensity variation, it gives a poor disparity estimate, because the signal to noise ratio is low. If, on the other hand, the window is too large and covers a region in which the depth of scene points (i.e., disparity) varies, then the position of maximum correlation or minimum SSD may not represent correct matching due to different projective distortions in the left and right images. For this reason, a window size must be selected adaptively. We have researched multiple windows and adapted windows which find a best matching windows for area-based stereo matching.Image transformation is widely and effectively used in image processing. According to the principle of Rank transformation and Census constraint and color difference gradient constraint, the paper presented a color image matching algorithm based on Rank transformation. The experiment results show that the disparity of Rank transformation image is more precise than that of intensity image. At the same time, the matching result is more robust by noise influence to a certain extent. In addition a new non-parametric transform founded on neighboring region disparity for stereo matching is proposed in the paper, the proposed algorithm is the more precise matching invariance to certain types of image noise than Rank transform and Census transform.Traditional stereo correspondence algorithms rely heavily on the lambertian model of diffuse reflectance. While this diffuse assumption is generally valid for much of an image, processing of regions that contain specular reflections can result in severe matching errors. In this paper, We address the problem of binocular stereo dense matching in the presence of specular reflections by introducing a novel correspondence measurement which is robust to the specular reflections. Accurate depth can be estimated for both diffuse and specular regions. Unlike the previous works which seek to eliminate or avoid specular reflections using image preprocessing or multibaseline stereo, our approach works in its presence.Some recent stereo matching algorithms are based on graph cuts, they transform the matching problem to a minimisation of a global energy function. The minimisation can be done by finding out an optimal cut in a special graph. Different methods were proposed to construct the graph, But all of them, consider for each pixel, all possible disparities between minimum and maximum values. In this article, a new method is proposed:only some potential values in the disparity range are selected for each pixel, These values can be found using disparity gradient and fuzzy logic. This method allows us to make wider the disparity range,and at the same time to limit the volume of the graph, and therefore to reduce the computation time.Infrared images have higher noise and lower resolution than visible images. This makes it more difficult to achieve a better disparity image in infrared images by using the method based on region matching. After analyzing the phase congruency transformed image, a sparse depth field may be obtained that can be interpolated to produce a dense depth field. In our proposed technique the sparse disparity map is produced by matching the stable features, extracted from the phase congruency model. A set of Log-Gabor wavelet coefficients is used to analyze and describe the extracted features for matching. The resulted sparse disparity map is then refined by triangular and epipolar geometrical constraints. In this work, we present a stereo matching algorithm based on belief propagation (BP). The algorithm is designed to work on sparse images originating from image content adaptive mesh representation techniques. There, an image is approximated with a mesh. The nodes of the mesh are the non-uniform samples which are the ones that form the sparse image. The key issue in the proposed method is to formulate BP such that it matches a sparse left stereo image with a dense right image to obtain a sparse depth map. Moreover, we propose a simple method that recovers the dense disparity map of the scene from the sparse one using the approximating mesh of the image.

节点文献中: 

本文链接的文献网络图示:

本文的引文网络