节点文献

机器视觉标定与目标检测跟踪方法及其应用研究

Research on Machine Vision Calibration and Object Detection Tracking Methods & Application

【作者】 许海霞

【导师】 王耀南;

【作者基本信息】 湖南大学 , 控制科学与工程, 2011, 博士

【摘要】 随着现代社会的发展,对产品质量、生产效率、劳动条件、环境等要求不断提高,从大型生产线,高楼大厦窗户的自动擦洗,恶劣环境下的清洗工作,到国防武器装备制造等民用及军事应用的各种领域,智能化、自动化、机器人化是时代发展的必然趋势,而机器视觉技术是促进其发展的核心技术。研究机器视觉理论方法及其关键技术具有十分重要的理论意义及社会经济意义。为此,本论文开展了机器视觉系统标定,视觉目标信息检测跟踪方法,及其在大型冷凝器清洗机器人作业中的应用研究,全文的主要工作如下:论文首先分析了立论背景及意义,回顾了机器视觉理论、关键技术的发展现状,介绍了机器视觉的应用研究现状,总结和归纳出待研究的难点问题及研究主线,简要阐述了射影几何,机器视觉系统的视觉成像模型、成像视点,视觉几何的理论基础。提出了基于扩展摄像机成像模型的自标定算法,给出扩展成像模型,采用扩展模型在一幅图像中同时运用不同方向透视投影分析,建立不同方向的单应关系进而建立内参数约束方程,实现单幅图像标定。与以往由三幅图像基于单应阵的标定算法相比,克服了由于多幅图像的像点不对应而造成精度低的问题。标定成像模型外,提出了一种基于场景中单个目标点的手眼标定方法,精确控制机械手平台作五次以上平移及两次以上旋转运动,提取场景中单个目标点的像点,通过视差及景深反映摄像机的运动,建立机械手平台与摄像机两坐标系之间相对位置关系的约束方程组。线性求得摄像机内参数K的五个元素及手眼关系R、t,同时,可求得场景目标点深度值。算法中机械手平台作平移运动,无需正交,使得对机械手运动控制操作方便,易实现。提出了基于矢量差分的未标定摄像机的P5P问题的线性求解法,由5个控制点构成矢量差分,利用R的正交性,逐步建立摄像机姿态及相机矩阵的约束方程,从线性理论的角度给出未标定P5P的解析解;提出了基于平行线段对应的运动分析线性算法,在运动恢复结构(SFM)框架下,将线段表示为两要素:点,直线,利用平行性,由像线段恢复空间线段。根据运动学理论的螺旋定理,建立基于空间线段两要素的运动参数的线性约束方程,用四元数法线性求解运动参数,并建立PSO非线性优化算法对运动参数进行优化。姿态及运动参数的约束方程是线性的,有解析解,求解方便。讨论了基于MS迭代算法的视觉图像信息的检测、跟踪方法,目标的检测、跟踪相辅相成,为了提高目标模型颜色特征表征的抗噪性与匹配迭代的有效性,提出了通过MS聚类的方法进行检测,并用聚类模式点来表达目标模型;提出了分层MS匹配搜索的思想,给出分层MS匹配迭代跟踪算法,先将目标参考模型与目标候选模型的聚类模式点、聚类块匹配,再块内的像素匹配,分层执行估计出跟踪序列帧中目标质心模式点的位置。实验结果表明,与传统MS跟踪算法相比,分层MS确定性梯度迭代算法可取得较好的跟踪性能。跟踪单目标情况相对简单些,而多目标跟踪由于目标数目、交互运动等诸多的不确定因素,需在概率推论框架下进行状态估计来跟踪。提出一种基于RJMCMC的分层MS视觉多目标跟踪算法,多目标跟踪问题建模为贝叶斯推理下的最大似然估计,设计了四种可逆运动方式构造马氏链,并给出基于关联匹配阵的有效的先验建议分布,提高了目标的抽样置信度,进而提高算法迭代效率,基于分层匹配思想,给出像素级与聚类块级两级分层的似然度量。实验结果表明,分层跟踪在单目标、多目标跟踪中具有较强的鲁棒性。针对冷凝器清洗机器人作业的应用,开展了清洗机器人的视觉系统及其关键技术的应用研究,实现清洗机器人的自主移动及在线清洗大型冷凝器。为此:构建了视觉系统,由引导机器人定位导航的子系统和引导机械臂喷枪定位冷凝器管口的子系统组成,共四路信息通过图像采集卡连接传输给主控制柜,经视觉关键技术算法处理后,给出对机器人控制的决策信息。提出了移动机器人视觉三维SLAM定位导航算法,使机器人自主移动到当前待清洗的局部位置,执行清洗任务。利用3D相机获取二维三维信息作为观测量的两个属性值,耦合ICP,BA算法优化数据匹配及求解机器人任意时刻的运动量。根据视觉理论求解SLAM,实现机器人的6DOF定位,与三维地图的创建。视觉理论为三维空间下6DOF 3DSLAM过程分析提供了关键理论依据,视觉SLAM求解过程较传统的运动学KF, PF滤波更简洁,无须预测步,定位与地图创建为一个过程,且利用三维相机与三维激光相比,二维数据引导三维数据匹配,减小了三维数据的搜索范围,提高了三维SLAM的计算效率。设计了基于视觉的管口定位算法,辅助机器人确定冷凝器管口的位置。根据作业场地面积的大小及摄像机的有效视场,离线人工计算将工作面划分区域,分块粗定位,控制机器人移动,在机器人移动到粗定位的某一确定位置,利用机械臂视觉系统检测分割冷凝器管的管口,计算管口的中心像点位置,再根据视觉理论计算出管口的空间位置,实现对当前局部范围内每个管口的位置的精确定位。

【Abstract】 With the developing of science and technology of modern society, the demand is increasely improving on product quality and production efficiency and working conditions and the environment. Intelligence, robotics and automation is the inevitable trend of development for civil and military applications in the field from big production lines, automatic wash of buildings window, and cleaning work in bad environment, to defense weaponry and equipment manufacturing. It has a very important theoretical and economic significance to research machine vision theory method and its key technology. This paper has investigated vision calibration and visual object detection and tracking, and its application in the large condenser cleaning robots. Main results and contributions of this dissertation are as following.The background of the subject is analyzed, the vital theory and technique in the vision field are reviewed, and machine vision applications are presented, then the difficulty problems to be handled is discussed. For the sake of further research projective geometry, the imaging model and perspective, fundamental vision geometry are introduced briefly.An approach to self-calibration is proposed based on extended imaging model. Extended imaging models are described and three different homography between space plane and image plane are obtained from one image simultaneously under different directions perspective projection, further, constraints equation in intrinsic parameters are established. So a single image completed the process of calibration. Comparison with traditional method, the precision of calibration is improved without the step of matching image points in multi-view.Besides the calibration of camera imaging model, A self-calibration approach to hand-eye relation of manipulator is proposed based on a single point in the scene. The motions of manipulator are accurately controlled and read, then camera is required to observe one point in the scene at five (or more) pure translational motions and two (or more) pure rotational motions. The motions of camera are estimated from the disparity and depth value of the point. Thus, constrained equations are set up between the manipulator and the camera coordination. The five elements of intrinsic parameters of camera and hand-eye relation are determined linearly, and depth value of scene point is also solved. It is characteristic by conveniently controlling motions of manipulator and succinct implement of algorithm due to the utility of a single point in the scene, requiring neither matching, nor orthogonal motions.An algorithm is proposed for solving linearly the P5P problem with an un-calibrated camera based on vector difference. Vector difference is set up with five control points. Constraint equations in camera pose and intrinsic parameters are set up according orthogonal relation of rotation R. the analytic solutions of P5P with un-calibrated camera is determined in terms of linear theory.Motion Estimation from Image Sequences feature is investigated here, a linear algorithm for motion estimation is proposed based on parallel line segments (PLS) correspondences. Under the framework of structure from motion (SFM), Line segment is represented by two elements, point and line. The space line segment structure is reconstructed gradually by image lines under the help of parallelism. Then the two elements of space line segment based the motion parameters equations are established according to screw theory and solved using quaternion. Further, the motion parameters are optimized by PSO optimization algorithm. It is characteristic by linear constraint equations and analytic solutions.The method for visual object detection and tracking is discussed based on mean shift iteration. They are complementary, but the classical mean shift tracking algorithm has poor robustness in represent of color feature and complex iterations in matching. So the detection algorithm is proposed based on mean shift cluster and object model is represented by clustered modal points. Then a hierarchical mean shift(HMS)iteration for object tracking is proposed. The tracking match between object reference model and candidate model is performed at two levels, first in the clustered blocks, then in pixels within blocks. Finally, the centroid of tracking object is got layer by layer in the consecutive frames. Relatively, single object tracking is simpler and better performance is obtained using deterministic gradient algorithm of modified mean shift, however, multi-object visual tracking is done with probabilistic reasoning due to factors of unknown number of the object, and inter-acting each other. So a new approach to multi-object visual tracking is proposed based on Reversible Jump Markov Chain Monte Carlo (RJMCMC) sampling. The tracking problem is formulated as computing the MAP (maximum a posteriori) estimation given image observation Four types of reversible and jump moves are designed for Markov Chains dynamics, and the prior proposal distribution of objects is developed with the aid of association match matrix to improves the confidence of sampling and perform the iteration effectively. The joint likelihood distribution measurement is presented at two levels of clustered blocks subsets (CBS) and pixels. Comparisons with other two MS algorithms demonstrate the validity, robustness, and performance of hierarchical mean shift(HMS) algorithm used for single and multi-object.For the application in mobile robot for cleaning condenser, the paper research vision system of robot and key vision technique to implement autonomous movement of robot and online cleaning of the condenser.Therefore, we construct vision system, including the subsystem of guiding robot navigation positioning, and the subsystem of guiding blowtorch to position the condenser pipes. Altogether four vision signals are transmitted to control cabinet via image acquisition card. Then decision is made to control robot after the process of visual information.A visual SLAM for robot is proposed based on 3D camera sensor, so that robot autonomously moves to the current place to be cleaned in the environment of condenser. SwissRanger SR3000 Camera used for sensing 3D natural environment provides mobile robot with image and 3dimension data. Theses data are regarded as two property of environment observation. Observation at time k is matched with observation at time k-1 under the constraint of the coupled BA and ICP, and the estimation of movement is performed. A solution to SLAM is obtained with the respect of visual theory, containing implementation of the 6 DOF location of mobile robot and 3demension mapping of landmarks. The solution is easier than traditional kinematics Kalman or particle filter without prediction process. The robot location and mapping is solved simultaneously. Comparison with laser 3d data match, the proposed algorithm carries out match process by using 2Dimage to guide 3D data so that search range is reduced and the computation efficiency of 3D SLAM is improved.An approach is presented based on vision to help robot with the positioning of condenser tubes. The work place is partitioned manually into blocks off-line according to the size of the area and view filed of camera. Course position is performed firstly by counting blocks, then precise position of each tube in current block is conducted by mechanical arm visual system when robot move to a certain course position. With the help of visual theory, the captured image is converted into tubes spatial position via tube detecting, circle fitting, the center calculating.

  • 【网络出版投稿人】 湖南大学
  • 【网络出版年期】2012年 08期
节点文献中: 

本文链接的文献网络图示:

本文的引文网络