节点文献

基于语义稀疏表示的不良图像检测算法

Adult Image Detecting Algorithm Based on Sparse Representation of Image Semantic Information

【作者】 李亮

【导师】 高新波;

【作者基本信息】 西安电子科技大学 , 智能信息处理, 2011, 硕士

【摘要】 高质量的多媒体通信使人们易于获取大量的有用信息。然而,淫秽、色情等不良信息的传播方式也更为隐蔽化和多样化,而且带来严重的社会问题。因此该方面的研究成果对净化社会环境、保护青少年健康成长,维护社会伦理道德等方面有积极的作用。本文系统分析了现有方法的优缺点,针对如下问题进行较深入的研究:鲁棒的肤色建模、不良特征表示和判别。主要研究成果如下:针对彩色不良图像中包含裸露的皮肤区域的特点,提出一种基于主动学习的贝叶斯肤色建模方法。根据YCbCr色彩空间与人类视觉感知机理相类似的特点和该空间中肤色的聚类特性,选择在YCbCr色彩空间上建立贝叶斯肤色判别模型;进而,通过Bootstrap主动反馈方法提升肤色训练样本的典型性和多样性,避免肤色模型发生过拟合。为在肤色区域内引入高层语义信息,提高不良图像判别的准确度,本文在肤色区域内检测高斯差分算子的极值点得到图像的感兴趣点,根据SIFT描述子对平移、旋转、尺度缩放、亮度变化、遮挡和噪声等具有良好的不变性,对仿射变换也保持一定程度的稳定性,本文采用SIFT算法对兴趣点进行描述,提取出能更好体现这类图像的特征。考虑到待测图像的多样性,本文提出基于词袋模型和稀疏表示的不良信息判别方法。SIFT特征匹配较为敏感,于是本文借鉴词袋模型将图像看作由码本中的码词组合而成。在不同的距离侧度下,通过K-均值聚类降低特征的敏感性,为不良图像和正常图像分别建立码本。将测试图像在两类码本分别进行稀疏表示,通过两类图像的重构误差来判断待测图像的性质。论文工作从实际应用中抽象出科学问题,涉及到计算机视觉和统计机器学习的最新理论。研究内容富有前瞻性和挑战性,具有极其重要的理论意义和应用价值。

【Abstract】 People can get plenty of useful information through high-quality multimedia communications. However, the transmission of pornographic and other undesirable information is also more subtle and concealed. This trend leads to serious social problems. Therefore, the research achievements for this purpose are helpful to purify the social environment, protect the mental health of the underages, and safeguard social ethics. The existing methods are systemically reviewed in this paper. Some crucial problems are discussed, such as, robust skin modeling, feature representation and evaluation of pornographic images. The main achievements of this paper are summarized as follows.Since pornographic images always contain skin area, we proposed an active learning based Bayes discriminative model for skin detection. According to the similar mechanism of perception between the human visual system and YCbCr color space, and the existence of clustering trend in YCbCr color space, we choose the YCbCr color space to build our Bayes discriminative skin model. We further incorporate the Bootstrap method to improve the diversity and typicality of training samples to avoid model overfitting.To improve the accuracy of image determination, we introduce the high level semantic information to the skin area analysis. The difference of Gaussian based extreme points detection operator is used to extract the is invariant to sifting, rotation, scale, brightness, occlusion and noise changes, and can maintain the stability for translation affine in certain degree. We adopt SIFT descriptor to represent the interesting points of the images, which can reflect the characteristics of the adult images better.To handle the diversity of the test image, we propose a bag of words model and sparse representation based method for adult image evaluation. SIFT features are sensitive to points matching. We regard the images as the combination of the visual words, which come from the codebook. Under different distance measurements, the SIFT features are clustered by K-means clustering algorithm to reduce the feature sensitivity, through which the codebooks of normal images and adult images are formed, respectively. Then we represent the test images on these two codebooks by two reconstruction errors.This work abstracts scientific issues from the practical applications. It involves the latest theories in computer vision and statistical machine learning areas. This research is forward-looking and challenging, it has an extremely important theoretical and practical value.

节点文献中: 

本文链接的文献网络图示:

本文的引文网络