节点文献

图像准稠密匹配及协分割

Image Quasi Dense Matching and Co-segmentation

【作者】 郭晓杰

【导师】 操晓春;

【作者基本信息】 天津大学 , 计算机应用技术, 2013, 博士

【摘要】 如今,随着软件和硬件的迅猛发展,图像已经成为人们记录信息的主流载体。相比于数字和文本而言,图像数据包含了更为丰富的信息,这些信息对于人类,更客观、语义层次更高,也更贴近现实。换言之,图像反映了真实世界中的场景、物体、物体与物体之间的关系。而对于机器而言,图像不过是以一定格式组织在一起的数据而已。那么,如何使机器能够理解图像的内容是使机器理解真实世界的关键问题之一。也正是因为这个原因,图像理解成为计算机视觉和模式识别领域中最基础最重要的主题之一。计算机对图像的理解,通常要从低层信息入手,挖掘图像中具有代表性的信息,进而组织信息。再通过寻找图像之间信息的对应性来探索更为高层的信息。这与人类认知的本质是极为一致的。图像的特征主要分为两类:全局图像特征和局部图像特征。全局图像特征关注于图像的整体信息,其优点是简单高效,但是它对图像的变换、噪音和遮挡十分敏感。局部图像特征则倾向于图像的细节,相比于全局图像特征,其优点在于它对图像的变换、噪音和遮挡等因素具有良好的鲁棒性,但是处理的时间会增加。随着硬件的快速更新以及现实任务的需求,局部图像特征已成为图像特征的主要研究对象。本论文从图像的视觉信息出发,对局部图像特征的描述、场景级别的准稠密匹配、物体级别的(准稠密)匹配及物体的协同分割几个方面开展研究。1)赋赋予予特特征征描描述述符符以以镜镜面面翻翻转转不不变变性性的的方方法法。。尽管国内外学者已经设计了很多图像特征描述的方法,这些方法能够有效的处理图像的缩放、旋转和视角等变换。但是,对镜面翻转的情况却鲜有研究。文中提出一种具有镜面翻转不变性的特征描述框架,赋予传统的特征描述符以镜面翻转的鲁棒性并保持了原有的特性及优势,包括平移、旋转和尺度不变性。我们并不是要设计一种全新的特征描述符,而是提出了一个框架能提供大多数特征描述符以翻转不变性。镜面翻转不变性的提出拓宽了特征描述的应用范围。2)加加入入三三角角形形几几何何约约束束的的特特征征匹匹配配方方法法。。获取了图像特征的描述,图像特征描述符的匹配是图像理解的另一关键之处。衡量特征匹配性能主要包括两个方面:1)正确匹配的数量和2)正确匹配的比率。正确匹配数量对三维重建等任务十分关键,而正确匹配率则是结果正确性的保证。根据不同的应用需求,传统的匹配方法通常以牺牲一方而获取另一方。这种此消彼长的模式限制了图像理解本身以及其应用的性能改善。本文提出了一种加入几何约束的特征匹配方法,即三角约束特征匹配,来同时提高这两个性能指标得到准稠密的、高精度的匹配结果,从而突破了长期以来存在的瓶颈。3)物物体体级级别别的的特特征征匹匹配配算算法法和和物物体体协协同同分分割割方方法法。。相比于像素和局部特征,物体是人类认知中更为有效的单元。为此,本文还开展了基于图像的特征描述符的匹配进而挖掘图像中的物体之间的关系。物体级别的匹配充分利用了匹配特征点对之间的尺度、旋转、空间关系及描述符相似度,在没有任何先验信息的情况下,有效的寻找物体之间的对应关系。由于图像的特征匹配是以点对的形式存在,很难覆盖物体的全部信息。为了克服这一困难,本文最后设计了一种协同分割的方法,提出对应物体的更多信息。本文以图像特征描述、图像特征匹配、物体特征匹配和协同分割为纵向主线依次进行描述。为了验证所提出的方法的有效性和鲁棒性,文中结合大量模拟数据和真实数据对相应方法的性能进行定性及定量分析。从实验结果中我们发现,所提出的方法相比于对比方法性能上有显著的提高。

【Abstract】 Recently, as techniques rapidly develop, images become the dominant informationcarriage of people. Compared with digits and texts, the content of images is much richer,which is more objective with high semantic level. In other words, images reflect scenes,objects and relationships of the objects. As for computers, image data is just in a specificway of organization. As a result, the problem of how to understand the image data isone of the key issues for machines to be intelligent. That is why image understandingbecomes one of the most fundamental and important topics in the fields of computervision and pattern recognition.To understand what images represent, representative information usually is extract-ed from pixels, and then the description is employed to organize such information. Next,the high level information is explored via finding correspondence between images. Thisprocedure is very similar with the way that human beings perceive from real world. Im-age features can be grouped into two categories, i.e. Global Image Feature and LocalImage Feature. The former one focuses on the whole image, the advantage of which isits efciency. But, it is very sensitive to image transformation, noise and occlusion. Incontrary, local image features take care of local characteristic of images. It is relative-ly robust to the factors including image transformation, noise and occlusion with longercomputational time. Fortunately, due to the development of hardware and the demand oftasks in real world, local image features have attracted more attention from researcher-s. This dissertation starts from image visual information, and focuses on image featuredescription, scene-level quasi dense matching, object-level (quasi dense) matching andobject co-segmentation.1) Mirror Reflection Invariant Description Method. Although many image fea-ture descriptors have been developed by researchers which can efectively handle scale,rotation and view-point changes, the mirror reflection remains difcult and limited work iscarried out for addressing the difculty. In this work, we propose a framework for descrip-tors to be mirror reflection invariant, which enriches most of the existing descriptors withmirror reflection invariance meanwhile preserves the original advantages. The descriptorswith more invariances broaden the applicable range of image feature descriptors. 2) Geometric Constraint Based Image Feature Matching Method. In addition,the matching of image feature descriptors is another key issue of image understanding.The performance of image feature matching is measured by two metrics, including thenumber of correct matches and the matching accuracy. According to diferent require-ments of applications, traditional matching methods usually improve one aspect by sacri-ficing the other, which limits the improvement of performance for both image understand-ing itself and its applications. This dissertation proposes a matching method enforcing ageometric constraint, i.e. Triangle Constraint, to simultaneously improve both the num-ber of correct matches and the matching accuracy and thus obtain precise and quasi densematching results.3) Object-level Matching and Co-segmentation. Based on the matching result, wefurther explore the object-level relationship within images. The exploration utilizes thescale, rotation, relative position and descriptor similarity information of matched featurepairs, without any prior knowledge, to distinguish diferent objects. Due to the charac-teristic of point-based image features, it is very unlikely to recover the whole objects byonly using the matching pairs. To recover and extract the object information as much aspossible, we finally design a co-segmentation scheme.Extensive experiments on both simulated data and real data demonstrate the efec-tiveness and robustness of our proposed methods quantitatively and qualitatively. Fromthe results, we can find that the methods proposed in this work have better performancecompared with the state-of-the-arts.

  • 【网络出版投稿人】 天津大学
  • 【网络出版年期】2014年 11期
节点文献中: 

本文链接的文献网络图示:

本文的引文网络