节点文献

视频对象分割技术的研究

【作者】 邬正平

【导师】 陈纯;

【作者基本信息】 浙江大学 , 计算机软件与理论, 2002, 硕士

【摘要】 随着通讯和信息处理技术的发展,基于视频的应用展现出了强大的灵活性和可扩展性。视觉通讯随之成为成长最快的信息载体。数字化的应用和服务正大量涌现,如数字电视,远程会议,视频电话和基于图像的交互式多媒体等。这些伴随着大数据量的应用和服务要求更先进的数字信号处理技术,以便进行更高效的存储和传输,以及更准确的分析和更灵活的操纵。视频对象分割就是这样一种技术。 视频对象分割,旨在分割出视频序列中的运动对象并沿时间轴跟踪运动对象的演进。许多与图像处理、视频压缩、模式识别相关的应用都依赖于对运动对象的分割。视频对象分割技术同时也是基于内容的视频编码、视频内容的操纵和交互式多媒体等应用的重要工具。对视频对象的分割通常是将视频的内容分割成具有语义的区域,并进一步作为对象来处理。这些语义上分割出的对象能够独立地编码,从而实现交互式多媒体中对视频内容基于对象的操纵。比如,在MPEG-4标准中,视频序列被认为是由一系列相互独立的运动对象组成的,并且视频序列的编码是针对一个一个对象的。在MPEG-7中,基于帧间运动信息的分割结果以及对象的突然形变将被用于高层(对象层)的语义描述。 本文首先叙述了视频对象分割技术产生和发展的背景,然后讨论了视频对象分割技术发展的现状。接着,本文深入研究了视频对象分割技术:首先将信息融合技术应用于视频对象分割,利用视频流的图像信息和运动信息,提出了一种新的视频对象分割方法,为实时的视频流前景提取提供了一种新的思路与解决方案;然后,为增加通用性,本文又提出了一种基于动态规划的自动视频对象分割方法;最后,作为一种补充,本文还实现了一种交互式的视频对象分割方案。本文的研究思想和内容是通过对图像分割和视频跟踪等关键技术的研究,实现视频对象的自动分割和半自动分割,并在此基础上实现其在视频编码、编辑、检索,视频会议和视频理解等方面的应用。并在最后对这一领域的发展方向和前景做了展望。

【Abstract】 With advances in communication and information processing technologies, video-driven applications show a very large degree of flexibility and extensibility. Visual communication is the fastest growing vehicle for information. Many new digital applications and services are emerging such as digital TV, teleconference, videophone, and image-based interactive multimedia. These diversified applications and services with a large amount of data demand more advanced digital signal processing techniques for efficient storage and transmission, accurate analysis, and flexible manipulation. Video object segmentation is such a kind of technique.Video object segmentation aims to partition an image sequence into moving objects and to track the evolution of the moving objects along the time axis. Many applications related to video compression and transmission, and pattern recognition rely on video object segmentation. Video object segmentation techniques are also important tools for content-based video coding and manipulation, and interactive multimedia applications. Video object segmentation usually divides the contents of a video frame into semantic regions that can be dealt as objects. These semantically segmented objects can be coded so that object-based manipulation of video content can be realized in interactive multimedia applications. For example, in the context of the MPEG-4 standard, a video is considered to consist of independently moving objects and is encoded object by object. In the MPEG-7, segmented results based on the frame-to-frame motion information or abrupt shape change can be utilized for a high-level description.This paper first presents the background of research on video object segmentation, and introduces the status in quo of this area. And then this paper lucubrates the video segmentation techniques: first, a new algorithm based on information fusion, which can be used for the segmentation of video object is proposed; then, to enhance the generality, this paper presents a technique for automatic video object segmentation based on dynamic programming; at last, as an alternative, a user-assisted segmentation of video object is proposed. The idea and content of this paper are implementing automatic and semi-automatic video object segmentation via research work on key techniques and based on video object segmentation, building several applications such as video coding, video editing, video retrieval, video conference and video understanding, and give a prospect to the further research on this area.

  • 【网络出版投稿人】 浙江大学
  • 【网络出版年期】2002年 02期
  • 【分类号】TN919.8
  • 【被引频次】6
  • 【下载频次】304
节点文献中: 

本文链接的文献网络图示:

本文的引文网络