节点文献

图像局部不变量特征描述方法研究

Study on Method of Image Local Feature Description

【作者】 梁胤程

【导师】 邹海林;

【作者基本信息】 鲁东大学 , 计算机应用技术, 2012, 硕士

【摘要】 用机器来感知和识别自然界的物体和场景,即使是很简单的物体,对于计算机来说也是很困难的事情。难点是如何来表达自然界的物体,既要区分其他物体,还要克服由于尺度变化,缩放,平移带来的差异性。选择什么样的特征来描述待识别的物体是计算机视觉的关键。近几年,图像局部特征的出现使计算机视觉的研究取得了重大进展。局部特征根据图像局部信息采用多尺度分析,统计学等相关技术形成特征向量,对图像形成了更好的表达,广泛应用于物体识别、配准、全景图像拼接和机器人视觉等领域。本文对当前的各种图像局部特征进行了分析,通过对主流的局部特征Harris角点检测、尺度不变特征转换(SIFT)、加速鲁棒特征(SURF)、最大稳定极致区域(MSER)进行对比分析,选择当前最流行的尺度不变特征转换算法为着手点,针对当前算法存在的不足提出了改进,并将改进的算法应用于词包模型的场景图像分类。具体内容如下:1. Lowe提出的尺度不变转换算法效率比较低,无法满足实时性的需要。论文提出了一种基于圆投影的尺度不变转换算法,通过对投影后的局部区域的快速傅里叶变换后计算一次谐波分量,对尺度不变转换算法提取的特征点进行预筛选。通过对筛选后的特征点计算局部区域描述子进行图像的匹配。实验结果表明:经过预筛选,该算法可以有效的减少待匹配特征点的个数,提高算法的执行效率和配准率。2.词包模型通过对SIFT算法检测的特征点在特征空间聚类来构造视觉单词。本文提出一种基于Fan-SIFT的词包模型,利用Fan-SIFT对不同角度的LOG算子响应值,检测出图像中的扇形斑点和圆形斑点,并利用扇形区域构造的特征描述符来构造视觉单词。相比于SIFT算法只检测图像中的圆形斑点构造的单词,本文算法构造的视觉单词更加具有针对性。在13类场景图像和Caltech101数据集上进行实验表明,基于Fan-SIFT算法生成的词包模型对场景图像的分类具有更高的准确率。另外,本文对主流的图像斑点局部特征进行了对比实验,重点关注了各种特征在尺度缩放、视角变化、光照变化、图像模糊情况下的匹配结果。对主流斑点特征的描述性能有了直观的表示。

【Abstract】 It’s very complicated for computer when it comes to the ability of perception andrecognition, even it’s a very simple object to be recognized. The most difficult point forcomputer recognition is how to express the object. It needs the ability to distinguish oneobject from another no matter its different size, different view and different position. Featureselection is the key process in computer vision which can greatly affects the results. Duringthe past decade, the progresses of local feature prompt computer vision research. With thehelp of multi-scale analysis technology and statistics technology, people draw various kind oflocal image feature from each block of image which has a better express of image. It’s widelyused in the area of object recognition, registration, image stitch and robot vision etc.We have a deep research on various local image features. A comparison study hasdeveloped on Harris, SIFT, SURF and MSER. SIFT algorithm is selected as the start point forits good effect. We put forward some improvement according the shortcomings. What’s more,the improved feature is used on scene image classification and the experiments demonstrateits good effect on image classification. The details and the innovation are as follows:1. The scale-invariant feature transform algorithm proposed by Lowe has a low efficiencyand restricts its application. The algorithm based on rounded projection proposed in our paperapplies Fast Fourier Transform algorithm (FFT) on the projection function to compute thefirst harmonic components which are used to pre-screen the feature points that extracted bySIFT algorithm. After the pre-screening, we get the descriptors according to the local areafeatures of left points. The experiments shows that it has a smaller number of feature pointsthan the original SIFT algorithm, so it improves the efficiency and has a better performance.2. The model of word bags use the SIFT descriptors to formulate the image vision word bycluster method. SIFT algorithm is a detector of blob region of image by LOG kernel function.Instead, we substitute the SIFT detector for Fan-SIFT algorithm. Fan-SIFT not only detectsthe blob region in image, but also the fan region. Accordingly, we use a feature descriptor offan shapes. Fan-SIFT can find different kinds of blob region in image and form the descriptorswith a smaller dimension. Experiments are processed on the data set of13scene images anddata set of Caltech101. The results show a better effect on image classification.We also process the comparison experiments on the blob image features which focus on the match results of different scale, different size and different position, analyze the repeatabilityof different feature detectors. We give an intuitive description on the quality of blob imagefeatures.

  • 【网络出版投稿人】 鲁东大学
  • 【网络出版年期】2012年 09期
节点文献中: 

本文链接的文献网络图示:

本文的引文网络