节点文献

微机视觉系统相关理论及技术研究

Study on the Theory and Techniques for Micro-Computer Vision System

【作者】 张艳珍

【导师】 欧宗瑛;

【作者基本信息】 大连理工大学 , 机械制造及其自动化, 2001, 博士

【摘要】 计算机视觉作为一个多学科交叉领域,其研究内容和应用领域都很广泛。近年来,硬件设备价格的不断降低,计算机性能的逐步提高,为计算机视觉的研究和应用准备了很好的条件。因此,对计算机视觉的进一步研究,不但具有重要的学术意义,而且具有重要的实用价值。 计算机视觉是通过一幅或多幅图像提取、识别、解释和重建三维环境信息的过程。主要包括五个方面:视频捕获、摄像机标定、图像预处理和特征提取、立体匹配以及三维重建。本文主要围绕计算机视觉这几方面的内容展开研究。 计算机视觉直接处理的对象是对三维环境采集到的二维图像,因此视频捕捉是计算机视觉后续处理的前提。视觉系统往往有实时性的要求,但一般随视频卡附带的应用程序并不能满足实时性的要求。本文分析探讨Windows环境下视频捕捉的一般系统框架和技术,开发了一个Windows环境下的视频捕捉系统,实现了视频数据的直接内存访问,能够很好地满足视觉系统实时性的要求。 空间物体表面某点的三维几何位置与其在图像中对应点之间的相互关系是由摄像机成像的几何模型决定的。这些几何模型的参数就是摄像机参数,摄像机标定就是确定摄像机的这些参数,是计算机视觉的一个重要内容。本文对现有的摄像机标定技术进行了分类研究,针对一般的工业需求,基于一阶径向畸变的小孔摄像机模型,提出一种新的逐步分解的摄像机线性标定方法,合理分解摄像机参数,全部求解过程均采用线性算法,避免了非线性优化的不稳定性,简单实用。实验结果表明,该算法具有较好的标定精度和较快的处理速度。 利用小波变换良好的局部化特性和多分辨率特征,提出基于小波变换的图像去噪方法和边缘轮廓提取方法。噪声和信号在小波变换域中有不同的表现特性。本文提出的移位冗余平均处理的小波去噪方法,在去除噪声的同时,具有良好的保持边缘特性的能力。在利用小波变换提取边缘轮廓中,采用自适应边缘链的平均幅度阈值,有利于根据图像特征提取图像边缘。 如何准确确定两幅或多幅图像中的对应点,即立体匹配,是计算机立体视觉中的一个难点。本文提出一种双阈值的分阶段匹配方法,以图像边缘点为匹配基元,选择米字条形窗口作为相关窗口,加快了相关运算的速度,采用双阈值进行判决,可在第一层将大量可能匹配特征检测出来,减少下一层匹配的搜索空间。 最后,本文论述了作者参与开发的一个足球机器人视觉识别系统,这是一个实时的计算机视觉系统。为满足足球机器人对抗比赛的要求,视觉系统必须实时快速准确地确定机器人小车和足球的瞬时位置和运动参数。本文提出一种基于区域投影的视觉识别新算法,充分利用系统中的颜色和形状特征,可快速准确地识别全部机器人小车和小球。实验结果表明,识别位置精度误差不超过IllllllQ piXCIS人采用 Pll350的 PC,识别速度可达到每帧仅用时 2毫秒,能够很好地满足该系统的速度和精度要求。

【Abstract】 Computer Vision has been evolving as a multidisciplinary subject. Its contours blend into those of artificial intelligence, robotics, signal processing, pattern recognition, control theory, psychology, neuroscience, and other fields. Its research area and application area spread widely. Recently, price of the video devices is going down, and performance of computer is going up. All of these promote the research and application of computer vision to a new level. Studying on computer vision has important significance not only on science but also on practical application.Computer vision is a synthesis and cognition science of 3D world from one or more digital images. Generally, the key techniques of a computer vision system is composed of five parts, which are video capture, camera calibration, image pre-processing and feature extraction, stereo correspondence and 3D reconstruction. This dissertation deals with these key problems in computer vision, and presented some new ideas, approaches and come to some valuable results.The original objects processed in computer vision are digital images captured by digital video devices. System structure and technical details in video capture are studied. Based on Windows platform, a real time video capture system is developed by accessing to buffer directly. It can satisfy the need of real time vision system well.As one of the basic tasks in computer vision, camera calibration determines the internal and external parameters of a vision system. In this dissertation, based on modified pine-hole camera model with one order radial lens deformation, a new linear multi-step method for camera calibration is developed. With some appropriate transformation and arrangement, the camera parameters can be calculated by solving these linear equations sequentially only. The running result of this new method shows that the algorithm is simple and efficient, and the precision is good enough. A image distortion correction method for radial lens distortion with segment slope is also described. Both simulated image and real image running results show that this correction method is robust and accurate.There has been growing interest of wavelet-based denoising schemes and edge detection techniques recently. Such popularity is mainly due to that wavelet provides an appropriate basis for separating noise signal from image signal and makes noise and edge show different attributes. A denoising algorithm based on wavelet transform with redundant representation is proposed, which can not onlymake improved denoising performance, but also suppress the Gibbs phenomena. A method of edge detection based on wavelets transform is also presented, in which a self-adapted method for selecting the threshold of edge chain evenness is adopted to detect edge well.Stereo correspondence is to determine that an item in one image corresponds to another item in other images of the same scene. Based on edge points, a two-stage stereo matching method with two thresholds is proposed.At last, a real time vision recognition system of MIROSOT is developed to recognize all vehicle robots and football. In this system, the rapid processing speed and high recognizing accuracy are needed. To meet such requirements, a recognition algorithm based on object projection is presented. The vision recognition is processed with two stages. In the first stage, a grid search scanning to find an object point inside an unrecognizing object is processed; then a following precision analyzing and recognizing procedure based on object region projection is processed for determining the robots and football’s position and also the robots’ orientation in the second stage. The recognition algorithm is robust in treating noisy image captured by CCD, even in the situation while multiple vehicle robots are closely adjacent each other. The algorithm is also very efficient, it takes only 2ms running in PII350 PC for recognizing and identifying all objects in a captured frame.

节点文献中: 

本文链接的文献网络图示:

本文的引文网络