节点文献

基于视觉哈希的视频拷贝检测算法研究

Researches on Visual Saliency Based Video Hashing for Video Copy Detection

【作者】 王静

【导师】 孙建德;

【作者基本信息】 山东大学 , 通信与信息系统, 2013, 硕士

【摘要】 随着多媒体技术的发展,网络视频的传播变得十分便捷迅速。由于数字视频的拍摄编辑处理非常容易,使得数以千计的数字视频每天都被创造出来。同时,非法盗版者往往对视频进行一些编辑处理(如添加噪声,添加边框,尺寸变化,滤波,画中画,添加字幕,JPEG压缩,对比度变换等攻击),使得盗版视频也在成倍出现飞速传播,极大侵害了版权所有者的切身利益。随着这一发展,对于视频拷贝检测技术的研究逐渐成为了多媒体信息版权处理领域中的研究热点,并且开始在视频跟踪,视频内容检索,视频内容认证,版权保护,视频内容过滤等方面进行应用。因此,如何建立更鲁棒的视频拷贝检测系统模型就成为了国内外研究的重点。本论文首先介绍了视频拷贝检测系统机制的基本理论;然后介绍了一种用于视频拷贝检测的时空联合哈希算法,并在此基础上,本论文针对目前视频拷贝检测算法存在的不足,结合时空联合特征在表征视频内容上的全面性以及顺序特征在鲁棒性上的贡献,以及视觉关注区域即图像中最能引起用户兴趣,最能表现图像内容的区域,这些区域特征的提出将会大大提高图像处理和分析的效率和准确度。引入人类视觉关注模型,提出了基于视觉关注的视频拷贝检测算法,分别研究了视觉关注模型的应用以及其在视频哈希形成以及视频哈希加权上的分析。论文最后还介绍了对基于视觉关注的视频拷贝检测算法的改进以及其在视频拷贝检测查全率查准率上的贡献。本论文的主要创新和贡献包括以下四个方面:(1)提出一种基于视频拷贝检测的时空联合哈希算法。该算法考虑到视频是一系列时间上连续的视频帧的集合,提取时空域特征来代替以往的只提取时域特征或空域特征,由于视频帧颜色的空间分布以及由于亮度变化和块效应导致的帧图像边缘信息变化,使得采用颜色直方图和运动矢量特征的视频内容特征提取方案不完善,这里采用视频帧块的顺序特征来提取视频内容的指纹,发现在检测中性能更好。(2)提出一种基于视觉关注的视频拷贝检测算法。该算法充分考虑到人的视觉系统对提取视频内容特征的影响,将人的关注加入到视频拷贝检测系统模型中,根据人眼对视频内容的关注程度的不同,赋予各视频帧块不同权重,进而在进行哈希匹配时,每一哈希比特位,不再是均一权重。这样对视频内容特征进行提取分析更符合人的感知。(3)介绍了视觉关注模型在视频哈希形成上的应用,区别与以前视频哈希指纹直接由提取视频帧块顺序特征得到,这里的改进是分别计算出时域信息代表图像的二值序列和视觉显著图的二值序列特征,进而将时域信息代表图像的二值序列特征和视觉显著图的二值序列特征进行融合得到最终的视频哈希指纹。这样所提取内容指纹包含了人的视觉关注,实验结果表明在保证查全率的同时,查准率得到提高。(4)进一步介绍了视频关注模型在视频拷贝检测系统上的改进,为了进一步提高视频拷贝检测的查全率和查准率,首先将时域信息代表图像的二值序列和视觉显著图的二值序列进行融合得到一个视频片段的二值序列,然后再次利用关注模型,根据人眼特性,对代表图像进行分块处理,计算出每一块的权重,再将此权重分配给上述视频序列的二值序列得到最终视频哈希指纹进行哈希匹配,其实验结果的稳定性,为视频拷贝检测提供了有利的参考价值。文章中提出的基于视觉关注的带有权重分配的哈希算法在视频拷贝检测上表现出了较好的鲁棒性与区分性。这样通过关注模型将得到的权重序列赋予上述对应的二进制比特流即得到最后的带有权重分配的视频哈希指纹。这样对视频内容特征进行提取分析更符合人的感知。

【Abstract】 With the development of multi-media, the spread of online video becomes very convenient and rapid. The shooting, edit and management of digital video are so easy that thousands of digital video are created everyday. Meanwhile illegal pirates always do some edit towards the video (for example, add noise, add frame, change scale, filtration, picture in picture, add subtitles, JPEG compression, change contrast and many other attack), making pirated video also appear in multiple rapid propagation, which violate the interests of copy owner heavily. With this development, the research of video copy detection technology becomes the hotspot of the fields of multimedia information copyright processing gradually, and come to use in video tracking, video content retrieval, video content authentication, copyright protection and video filtration. So how to build more robust video copy detection system model becomes the key research both at home and abroad.This paper introduces the basic theory of mechanism of video copy detection system firstly; and then introduces a kind of time and space combined hash algorithm used for video copy detection, and on this basis, this paper take the affect of human perception system to video content features into consideration, and introduce human visual attention model. And put forward video copy detection algorithm based on visual attention. They study the application of visual attention model and its analysis in video hash formation and video hash weighted. At last the paper introduces the improvement of the visual attention based video copy detection algorithm and its contribution in recall and precision ratio of video copy detection.The main innovation and contribution of this paper include the following four aspects:(1) Proposed a kind of video copy detection based time and space combined algorithm. This algorithm take that video is a set of a series of time continuous video frame into consideration. It extracts time domain and spatial feature instead of time domain feature or spatial feature only previous. Because of the space distribution of video frame color and image edge information changes owing to brightness changes and block effect, the extract scheme of video content feature is not perfect used color histogram and motion vector characteristic. Here we adopt the order feature of video frame block to extract the fingerprint of video content. And it turns out better performance in the detection.(2) Proposed a kind of video copy detection algorithm based on visual attention. This algorithm fully considers the influence of human visual system to the extracted video content feature, so it adds human attention to video copy detection system model. According to different attention degree of human eyes to video content, it gives different weights to each video. And thus there will be not only one weight per hash bit when do the hash matching. Then the extract and analysis of video content feature will more accord with human perception.(3) Introduced the application of visual attention model in hash formation. Compared to video hash fingerprint was formed directly from extracting the order feature of video frame block previously, the improvement was that compute the binary sequence feature of time domain information representative image and binary sequence feature of visual significant image. And then combining these two binary sequence features so that we can get the final video hash fingerprint. The content fingerprint extracted by this way includes human visual attention. The experiment show that it guarantees the recall ratio and meanwhile improve the precision ratio.(4) Introduced the improvement of video attention model in video copy detection system. In order to improve the recall ratio and precision ratio of video copy detection more, we combine the binary sequence feature of time domain information representative image and binary sequence feature of visual significant image to get a binary sequence of a video clip firstly, and then make use of attention model again, and do block process to representative image according to human eye characteristic. We compute the weight of every block and distribute this weight to the binary sequence of the above video to attain the final hash fingerprint and do hash matching. The stability of the experiment results provides favorable reference value for video copy detection.A novel video hashing algorithm is proposed, which takes account of visual saliency during hash generation. In the proposed algorithm, Experiments on different kinds of videos with different kinds of attacks verify that the proposed algorithm has better performance on robustness and discrimination.

  • 【网络出版投稿人】 山东大学
  • 【网络出版年期】2013年 11期
节点文献中: