节点文献

目标跟踪中在线boosting学习算法的研究

Online boosting Learning for Object Tracking

【作者】 裴玉红

【导师】 马波;

【作者基本信息】 北京理工大学 , 计算机科学与技术, 2010, 硕士

【摘要】 视频目标跟踪是计算机视觉领域的重要研究分支,在许多领域有着重要的应用。最近,基于学习的跟踪算法逐渐引起相关学者的关注,并取得了较好的跟踪效果。具有代表性的是把跟踪看成目标和背景的分类问题,它不是建立复杂的模型来描述目标,而是找到决策边界来区分目标和背景,当目标外观变化时,该方法只需要更新决策边界而不用更新目标外观模型。目前,基于学习的视频目标跟踪算法典型的有Avidan等人提出的Ensemble Tracking和Toufiq Parag提出的基于boosting自适应线性弱分类器,其主要思想是通过在线训练一组线性弱分类器,并使用颜色、亮度等简单特征来区分目标和背景。但是,针对复杂场景下对目标的跟踪,以上所说的方法可能失去有效性。而本文是在基于boosting的视觉目标跟踪算法的基础上,主要从特征和分类器两个方面进行研究。融入基于核函数的分类器,以及Gaobr滤波和张量等特征。主要研究工作和贡献如下:1)提出了基于Gabor滤波的在线boosting跟踪算法,Gabor滤波因具有优良的空间局部性和方向选择性,能够提取图像局部区域内多个方向的空间频率和局部结构特征,所以对目标和背景具有较好的判别力,但是直接使用高维Gabor特征在跟踪过程中会影响其速度,针对这一问题,本文考虑了如何对Gabor滤波特征进行有效降维,这里拟采用以下两种方案对Gabor特征进行降维并提取最突出的判别特征:(a)使用局部Gabor滤波器;(b)使用自适应的Gabor滤波器组参数。把降维后的Gabor特征同在线boosting的跟踪框架结合起来,来实现对目标的跟踪。2)提出了基于张量特征的在线boosting跟踪算法,张量模式作为传统向量模式的扩展和补充,近年来已引起机器学习、模式识别等领域的广泛关注。张量特征能够提取物体梯度方向的特征,对纹理特征较强的目标具有较好的区分能力。所以,本文把张量特征同在线boosting的跟踪框架结合起来,对纹理特征较强的目标,具有较好的跟踪效果。3)提出了基于最小平方核回归的在线boosting跟踪算法,在复杂的跟踪场景中,线性分类器往往无法提供较好的分类结果,所以,这里使用基于最小平方核回归的分类器来代替线性分类器,其主要思想是:通过使用Mercer核函数,把低维空间线性不可分的模式通过非线性映射到高维特征空间来实现线性可分。为了解决使用核函数分类将会产生较高维数分类器的问题,这里使用在线稀疏算法,通过选取部分样例来训练得到基于最小平方核回归的分类器,然后把最小平方核回归算法同基于boosting的跟踪框架结合起来。实验表明,该方法能够准确的对复杂场景中的目标进行跟踪。特征选取和分类器设计是模式识别领域两个重要的方向,本文从特征和分类器两个方面进行研究,首先,选取一些具有判别力的特征,本文选取Gabor滤波和张量特征,然后,使用基于核函数的分类器代替线性分类器,和传统的算法相比,本算法对复杂的跟踪场景,能够取得较精确的跟踪成果,并且能够达到稳定、实时的跟踪,实验结果验证了该算法的有效性。

【Abstract】 Video object tracking is the important research branch in computer vision, and has applications in many fields. Recently, visual tracking based on learning has caused many scholars’attention, since it can achieve good tracking performance. The representative method treats tracking as a classification problem between object and background. Instead of building complex model to describe the visual object, this method intends to find a decision boundary between object and background. When the appearance of object changes, it only needs to update decision boundary, rather than the object appearance model. Currently, the representative tracking algorithms based on learning include Ensemble tracking and Adaptive linear weak classifiers boosting for online learning which proposed by Avidan and Tougiq Parag respectively. The basic principle behind the two methods is to train a set of linear weak classifiers for visual tracking in an online manner by use of simple image feature like color, intensity etc, and may fail to track the visual object in the complex scence. By incorporating better image features like tensor or gabor features and replacing the linear weak classifier with a nonlinear weak classifier, this thesis has done substantial research work on boosting based on visual object tracking. Specificially, this theis has made the following contributions:1) Gabor filter has been used to get better image feature for visual tracking by online boosting. Compared with intensity and color, Gabor filter has good spatial locality and orientation selectivity , and can extract multidirectional spatial frequency feature and local structure feature. As a result, it has a higher discriminative power between background and foreground. However, a trival application of high dimensional Gabor feature to tracking will affect tracking speed. So, we turn to two schemes to reduce dimension and select the most discriminative feature. (a).Using local gabor filter bank to extract the Gabor feature vectors; (b).Adjusting filter bank parameters adaptively.2) Tensor feature has been explored for visual tracking by online boosting. As complement to typical vector patterns, tensor feature can capture gradient direction information, and has good distinguishing ability for the object which has strong texture property. By combining tensor feature with online boosting algorithm, we have achieved good tracking result for textured visual object.3) Online boosting method using the recursive least-squares (RLS) algorithm has been proposed for visual tracking. Linear classfier cann’t acquire good discrimination power. So,we employ a nonlinear version of the recursive least square algorithm(RLS) here. It performs linear regression in a high-dimensional feature space induced by a Mercer kernel,and can therefore be used to recursively construct minimum mean-squared-error solutions to nonlinear least-squares problems. In order to regularize solutions and keep the complexity of the algorithm bounded, we use a sequenctial sparsification process that admits into the kernel representation a new input sample only if its feature space cannot be sufficiently well approximated by combining the previously admitted samples. So, using this sparsification proceduce, we can update weak classifiers online.Classifier and feature selection are two important fields in pattern recognition. In this paper, we focus on feature and classifier. First, using Gabor filter and tensor feature to extract some discriminative features. Then, we choose the classifier based on kernel rather than the linear classifier. When the tracking scenario is very complex, this method can still achieve good results. Experimental results verify the effectiveness of the algorithm.

节点文献中: 

本文链接的文献网络图示:

本文的引文网络