节点文献

视频对象分割算法研究

Research on Algorithm of Video Object Segmentation

【作者】 张光玉

【导师】 朱维乐;

【作者基本信息】 电子科技大学 , 信号与信息处理, 2006, 博士

【摘要】 视频对象分割是MPEG-4基于内容的编码系统中的关键技术之一,此外,它在基于内容的检索、对象识别、对象跟踪、视频电话、视频购物、视频监控、电视特技制作和交互式操作的多媒体中也有重要的应用。不仅如此,许多机器视觉问题都要借助视频对象分割技术才能完成。因此,视频对象分割技术具有重要的研究意义和应用价值,它是当前多媒体技术研究中的热点和难点之一。本论文正是针对上述问题,对视频对象分割算法问题作了创新性和探索性研究。主要内容为:1.首先分析和研究了视频对象分割所涉及的主要理论和方法。主要分析了模糊聚类理论和模糊C均值聚类方法,概率统计学中的贝叶斯(Bayes)分类器、最大后验概率模型和马尔可夫(Markov)随机场模型,形态学的膨胀、腐蚀以及开闭运算方面的理论和方法,神经网络方面的神经元模型、神经网络分类器和BP训练算法,全局运动估计中的六参数仿射运动模型、Gauss-Newton迭代法和Levenberg-Marquardet优化方法。2.在视频对象分割中,为了得到准确的视频对象轮廓,常用视频对象的边缘对所得到的对象轮廓进行校正。因此图像边缘提取对于视频对象分割非常重要。初步分析了克隆技术,建议了一种克隆算法,将它用于彩色图像边缘检测,建立了一种边缘检测模型,并联合使用边缘连接算法给出一种彩色图像边缘提取的新方法。在图像边缘提取中,阈值的选取是关键技术之一,分析了人类视觉系统特性,建立了视觉系统的数学模型,给出一种边缘检测中基于人类视觉特性的自动阈值选取方法。论文提出的边缘检测方法同其他算法相比,能得到更连续的边缘和更少的过检测点。3.分析和研究了基于时间信息或空间信息的自动视频对象分割算法的特点和优缺点,针对这两种类型的算法都不能充分地利用视频序列时间和空间信息的问题,提出一种有效的时空联合分割算法。首先输入一组视频序列帧,选择第一帧作为参考帧,求出其它各帧同参考帧的帧差,将得到的一组帧差求最大值获得初始帧差图像,利用模糊C均值聚类算法和遗传优化方法对帧差图像进行聚类分析,得到初始帧差模板。利用分层的块匹配算法进行运动矢量估计,由运动矢量获得初始运动模板,然后对帧差和运动模板求或,得到粗糙的视频对象模板。由于模板中存在许多空洞和噪声点,对模板进行填充和去噪。基于模板和运动窗技术获得对象的运动窗口,在运动窗内利用边缘提取算法获得视频对象的边缘,基于对象边缘对模板进行轮廓校正。用原始视频数据对校正后的模板赋值,得到初始视频对象。最后通过运动估计和背景注册信息完成视频对象的跟踪。仿真结果同其它相关算法的结果进行了比较,证明算法可以获得满意的分割和跟踪效果。4.研究了帧差分割算法存在的问题和背景差分割算法的优点。提出一种基于背景构建的自动视频对象分割算法。首先对多帧视频片段作统计性分析,基于帧差和运动信息提取多组关键帧,基于这些关键帧获得各组的关键背景,利用这些关键背景和背景预测方法得到视频序列的全景图像。输入视频序列帧,求出各帧同全景图像的背景差图像,利用BP神经网络分类器对背景差图像进行分类,通过分类得到各帧的对象模板,利用模板填充、局部噪声去除和局部平滑算法对模板进行后续处理,最后利用各帧的原始图像数据对模板赋值,提取出各帧视频对象。对于具有动态背景的视频序列,利用全局运动估计和补偿技术对摄像机的运动进行补偿,对补偿后的视频序列基于求多帧中值的方法得到每帧的背景。利用背景差算法进行视频对象的分割和跟踪。仿真结果和其它相关算法的分割结果进行了主客观两方面的比较,验证了本章算法的有效性。5.研究和分析了一些典型的半自动视频对象分割方法,这些算法中大部分需要过多的人工参与,大大降低了算法的速度和可操作性,不利于算法的有效执行。针对这种情况,提出一种联合背景构建和运动估计技术的半自动视频对象分割算法。首先在GUI中通过鼠标描绘出视频对象的大致边界,基于对象的边缘信息对轮廓进行校正,然后利用轮廓填充和平滑算法得到初始帧对象模板。基于模板得到初始帧视频对象和初始背景。利用背景差算法得到后续帧对象模板,利用运动估计对模板进行校正,根据相邻帧的对象模板对背景更新,最后通过背景的不断更新和背景差算法完成后续帧对象跟踪。仿真结果和其它相关算法的分割结果进行了主客观两方面的比较,验证了本章算法的有效性。

【Abstract】 Video object segmentation is one of the most important techniques for MPEG-4 content-based coding scheme, which has many important applications in content-base index, object recognition, object tracking, video phone, video conference and interactive multimedia. Many problems of computer vision are solved by using the techniques of video object segmentation. So the study on video object segmentation techniques has important significance and application value, which is one of the most difficult problems in the multimedia techniques.In this thesis, the novel methods are explored and investigated for video object segmentation. The main results are as follows:1.The related theories and methods of video object segmentation have been analyzed. Firstly, the theory of mode clustering and the fuzzy c-means clustering method are studied, the Bayes classifier, the maximum a posteriori probability model and Markov random field are discussed, secondly, the morphological dilation, erosion and open-close operations are introduced, the nerve cell model, neural network classifier and BP training algorithm are analyzed, finally, the six-parameter affine motion model of global motion estimation, the Gauss-Newton iteration method and the Levenberg-Marquardet optimization method are studied.2.The edges of objects are used to correct the contours in video object segmentation, in order to obtain the precise contours of video objects, so edge extracton is very important in video object segmentation. We analyze clone technique, and suggest a clone algorithm, then apply the algorithm to edge extraction of color images, finally, we propose an effective edge extraction algorithm by integrating edge connection method. Automatic threshold choice is difficult and important to color image edge detection. After human visual properties have been analyzed, an algorithm of automatic threshold choice in image edge detection is proposed. Experiments demonstrate the effectiveness of our algorithm in comparison with other related methods3.The video object segmentation based on temporal or spatial information is researched, then an effective video object segmentation algorithm is suggested. Firstly, a group of pictures are inputted, the initial frame difference image is obtained by employing the frame difference information, and the initial frame difference mask is obtained by integrating fuzzy c-means clustering and genetic algorithm. The initial motion mask is obtained by using motion estimation. Secondly, the coarse initial object mask is obtained by employing the initial frame difference and motion mask. The mask is filled and corrected by using the edges of objects in the motion window. An initial video object is obtained by using the mask and the original image data. Finally the object tracking is executed by using motion estimation and background information. Experiments demonstrate the effectiveness of our algorithm in comparison with other related methods4.After the frame difference method and the background difference one have been researched, an algorithm of automatic video object segmentation based on background constructing is suggested. A group of key background images are obtained by employing many extracted key frames based on motion information, and the background of video sequence is obtained by employing these key background images. After the background difference images have been obtained, the object masks are obtained by using BP neural network classifier. The masks are improved by using the post-processing procedures, such as mask filling, noise elimination and contour smoothing. The video objects are extracted by using the processed masks. For the video sequence with motion background, the global motion estimation and compensation is used, the background is constructed by the median of a group of compensated video frames. Finally, the object segmentation and tracking is executed by employing the background difference method. Experiments demonstrate the effectiveness of our algorithm in comparison with other related methods.5.A lot of semiautomatic video object segmentation methods are analyzed, an algorithm of semiautomatic video object segmentation by integrating background constructing and motion estimation is suggested. Firstly, a coarse contour of the object is obtained by employing manual method; the contour is corrected by employing edges of the object. Secondly, the corrected contour is filled and processed by employing smoothing method, and the initial object and the background are obtained by using the processed contour. The object masks of subsequence frames are obtained by employing background difference algorithm, and the masks are corrected by using motion estimation. Finally, the object tracking is executed by employing background updating and background difference method. Experiments demonstrate the effectiveness of our algorithm in comparison with other related methods.

  • 【分类号】TP391.41
  • 【被引频次】10
  • 【下载频次】1356
  • 攻读期成果
节点文献中: 

本文链接的文献网络图示:

本文的引文网络