节点文献

基于MPEG-7标准的视频描述与检索

【作者】 汤义

【导师】 李国辉;

【作者基本信息】 中国人民解放军国防科学技术大学 , 管理科学与工程, 2002, 硕士

【摘要】 随着计算机以及通信技术的发展,以视频为代表的多媒体数据量和信息量急剧增长。视频数据的日益增加,应用也越来越广泛。现在,在数字图书馆、军事信息系统、Web信息环境、专业视频库等应用,需要对视频数据和视频信息进行组织和管理。同传统的文字信息相比较,视频数据具有信息量大、难以准确描述的特点,因此人们很难从海量的视频信息中找到自己所需的信息。虽然过去开展了大量的视频数据库、视频分析和信息检索的研究,但是缺乏对视频数据进行完整的、规范性的描述,以及建立在这些规范描述之上的视频信息检索方法。本文在分析研究了现有一些基于内容的视频处理和检索方法的基础上,结合MPEG-7标准的新框架,对视频内容规范描述及其检索方法进行了研究,主要的研究工作如下: 1.视频内容分析和规范化描述:根据MPEG-7标准,首先对视频内容进行分析,然后进行规范化的描述。本文在视频内容分析的基础之上,建立了基于MPEG-7标准的视频内容描述模型。该模型从视频数据的特性出发,既综合考虑了视频的各种特征,包括视觉特征、对象空间关系和时间结构,又充分考虑了视频信息检索的要求,采用层次化的描述结构。 2.视频内容描述工具的设计和实现:根据描述定义语言,建立基于MPEG-7标准的视频内容描述工具。描述工具建立的基础是上面提出的描述模型,它对于视频的结构特征,可以实现特征的自动提取并自动生成描述,而语义信息则可以通过手工输入一些基本信息的基础上自动生成描述。根据本文设计的视频内容描述工具,可以建立适合于视频检索的标准化描述,该描述的最终结果采用W3C的XML语言模式。 3.提出一种基于倒排索引的视频索引机制及其索引建立算法:根据上面的描述结果,设计了一种基于倒排索引的视频索引机制及其索引建立算法。在该算法中,设计了更能够代表媒体数据内容的特征以及高效的索引结构。该结构与文档中倒排序方法相结合,提出了一种基于MPEG-7标准的倒排索引机制及其索引建立算法。 4.建立在索引基础上的快速检索算法及其实现:在比较现有XML文档索引和检索算法的基础之上,本文设计并实现了一种基于XML文档的快速检索算法。该算法充分利用了上面提出的倒排序视频索引机制。本文还通过一系列实验对现有的一些XML文档检索算法以及本文设计的检索算法进行对比,最后的实验结果说明了本文算法的有效性和高效性。 综上所述,本文在探讨标准化描述方法的基础上,建立了一个基于MPEG-7标准的视频数据库以及由此建立一套完整的基于内容的检索机制,它具有及其重要的意义。

【Abstract】 With the rapid development of multimedia and communication technology, a great deal of video data and information has become available. At present, we need to organize and manage video data and information in many fields, such as digital library, military information system, Web information system and video database etc. Compared to traditional text information, video data contains too much information and is hard to describe, so it becomes difficult to find the proper information that we need. In the past, people have done much research on video database, video analysis and information retrieval, but there lacks complete and standardized description of video data, and video information retrieval method based on the standardized description. In this paper, based on the analysis of current video manipulation and retrieval methods, and according to the new framework of the MPEG-7 standard, the author raised a new method to describe and retrieve video content. The main work includes:1. Video content analysis and standardized description: According to the MPEG-7 standard, first, the author analyzed video content, then made standardized description of it. Based on the analysis of video content, this paper gives a model of video content description using the MPEG-7 standard. The model starts from the features of video data, it takes into consideration not only structural features of video including visual and spatio-temporal, but also the need to video information retrieval. The model applies a hierarchical structure.2. Design and implementation of the video content description tool: According to Description Definition Language (DDL), this paper gives a content description tool based on the MPEG-7 standard. The base of the description tool is the description model mentioned in the paper. The tool can automatically extract structural features of video and create the description. Besides, given some necessary information, the tool can also automatically create the description of semantic information. According to the video content description tool, we can build standardized description adapted to video retrieval. The final format of the description uses XML recommended by W3C.3. A mechanism of video indexing as well as its setting up algorithm based on inverted index: In this algorithm, the author designs an efficient index structure that can better contain features of media data contents. This mechanism of video index is based on invert index, which is similar to the one used in traditional document retrieval.4. A rapid retrieval algorithm based on the invert index and its implementation: Based on the comparison to the current XML document index and retrieval algorithms, this paper give the rapid retrieval algorithm based on XML document. In the end, the author lists some results of the experiments that compares some current XML document retrieval algorithms to the algorithms given in this paper. These results prove the availability and efficiency of the algorithm.In a word, based on the method of standardized description, this paper builds a video database and a complete retrieval mechanism based on video contents.

  • 【分类号】TN919.81
  • 【被引频次】4
  • 【下载频次】287
节点文献中: 

本文链接的文献网络图示:

本文的引文网络