节点文献

人脸画像—照片的合成与识别方法研究

Face Sketch-photo Synthesis and Recognition

【作者】 肖冰

【导师】 高新波;

【作者基本信息】 西安电子科技大学 , 信息与通信工程, 2010, 博士

【摘要】 人脸画像-照片识别是指以所提供的画像为示例在照片库中进行检索,确定待识别人的身份信息,在刑侦破案、反恐追逃和动漫设计中具有广泛的应用。在感兴趣目标照片缺失的情况下,该研究将为实现人脸识别提供有效的途径。画像是由画家根据其主观理解绘制而成,它利用线条的粗细、疏密来表达人脸的形状和纹理信息,而照片是通过光学成像设备或其它传感器获得的客观图像,它记录了图像的灰度或颜色信息。不同的产生机理和信息表达方式使得两者之间存在较大的形状结构及纹理灰度差异,即使同一个人的画像和照片具有相似的几何形状,但也一定有相差甚远的纹理信息。这使得现有的人脸识别算法很难在画像-照片识别中得以直接推广和应用。因此,以最低代价实现高性能的画像-照片转换成为本文研究的重点;将其中一种媒体的表达模式转化为另一种使其在同一特征模式空间实现相互识别,从而为跨媒体的信息检索和模式识别探索新的途径。本文针对画像-照片识别的基本问题,探索画像-照片不同信息表达模式之间的映射关系,在构建该映射关系模型的基础上提出画像-照片合成方法,从而将画像-照片识别转变为同一模式下的人脸识别,为提高画像-照片识别性能提供必要的前提条件。与此同时,融合人脸图像结构和纹理信息,提出了用于转换后特征空间的人脸识别方法。主要工作概括如下:1.提出了基于机器学习的伪照片合成方法。本文对现有的伪照片/伪画像合成方法进行了综合分析和实验,在此基础上,提出了基于嵌入式隐马尔可夫模型和分块策略的伪照片合成方法。该方法利用嵌入式隐马尔可夫模型对训练画像-照片块的映射关系建模,从而将画像转变到照片域表达模式下。通过理论分析和相关实验,证明了该方法只需要较少的训练样本就可以得到较精确的伪照片,克服了基于统计的方法无法用较少的训练样本获得伪照片的不足。2.确定了人脸照片/画像合成中的分块缝合策略。现有的基于局部策略的伪照片/伪画像合成方法均通过平均化分块重叠区域合并伪照片块/伪画像块,导致生成的伪照片/伪画像存在模糊和分块效应。针对这一问题,本文提出利用图像缝合思想将合成伪照片块/伪画像块进行拼接,该方法根据两个分块重叠区域中对应像素的差异,在重叠区域上搜索最短路径作为缝合线。实验结果表明了基于分块缝合策略的方法可以合成更高质量的人脸伪照片/伪画像,并给出了理论分析。3.探索了合成图像的质量评价方法。现有的图像质量评价方法主要针对噪声和人为因素造成的图像降质,鉴于合成伪照片和伪画像必须与原始图像具有相似的鉴别信息,我们阐述了一种新的图像质量评价测度,从图像本身的质量以及合成图像与原始图像的相似程度两方面来评估合成伪照片/伪画像的质量。同时,利用画像-照片识别的性能分析了该图像质量评价测度的有效性。4.验证了本文所提出的画像-照片合成方法可以有效提高画像-照片识别率。画像-照片被转换到同一特征空间后,利用主成分分析、独立成分分析、核主成分分析、保局投影、张量子空间分析和离线张量分析等无监督子空间学习算法在画像域和照片图像域进行人脸识别。实验结果表明,本文提出的画像-照片合成方法可以有效地提高画像-照片识别的正确率。5.构建了融合结构和纹理信息的双模人脸识别方法。本文主要研究的是不含有任何主观因素的写实素描画,其中每个人的画像和照片具有较为相似的形状结构信息和差异较大的纹理信息。通过两种特征空间的相互转换,在减弱其纹理信息差异的同时,保持了形状结构信息相似性,故而本文提出了融合人脸图像纹理和结构模型的双模人脸识别方法,通过在识别过程中加强结构模型相似性的比重进一步减弱纹理模型差异对正确鉴别人物身份的影响。实验结果表明,该识别方法对于人脸图像的光照、表情和尺度变化都具有较好的鲁棒性,为后续实现多因素影响下的画像-照片识别提供了有效的技术手段。同时,提出了两种度量结构模型相似性的图编辑距离算法,这两种算法独立于代价函数定义,具有执行效率高、对图像聚类分类准确率高、通用性强等优点。6.建立了具有自主知识产权的人脸照片-画像数据库和实验测试平台。现有的照片-画像库中画像的绘画风格单一,为了研究对不同绘画风格的画像具有鲁棒性的画像-照片识别算法,并为后续研究提供用于检验算法性能的通用数据库,我们从标准人脸库中选取照片,邀请多位画家创作了相应的素描画,尝试进行素描画画像数据库的构建。通过现有画像-照片识别算法的检验,初步验证了绘制的素描画是合格的。由于多位画家绘画风格不同,对应于同一幅照片的多幅画像存在差异,从而为基于机器学习的方法研究绘画风格对画像-照片识别算法性能产生的影响奠定了数据基础。

【Abstract】 Face sketch-photo recognition aims to determine the person’s identity by retrieving in the photo database using simulated sketches automatically. It is applied extensively to criminal investigation, anti-terrorism, animation design, etc.When the photos of the person in question are absent, we have to fall back on simulated sketches and then face recognition is performed by matching the sketches and photos in database. Sketches are produced by artists according to their subjective understanding, in which the thickness and density of lines are used to convey shape and texture information. Photos are obtained with optical imaging equipments or other sensors objectively, which record grayscale or color information of images. Different mechanism for generating and expressing sketches and photos leads to geometrical deformations and very different texture between them. Even though the sketch and photo of a person may be similar in geometry, their texture is always very different, which makes most of the existing significant achievements for face recognition inactive for sketch-photo recognition. Consequently, this paper delves into the perfect conversion of sketches and photos with the least cost. One of the above-mentioned two expression modalities is converted into the other and face recognition is performed in the same modality, which explores novel ways to conduct cross-media information retrieval and pattern recognition.In this dissertation, in view of essential problems of sketch-photo recognition, the mapping relationship between sketches and photos is explored, based on which sketch-photo conversion algorithms are proposed so as to transform sketch-photo recognition into the face recognition in the same modality. They are necessary preconditions to improve the sketch-photo recognition performance. Furthermore, the face recognition algorithm applied in the converted feature space is proposed, in which structure and texture informations are combined to express face images. The main contributions of this dissertation are summarized as follows:1. Machine learning based pseudo-photo synthesis algorithm is proposed. The existing sketch / photo synthesis algorithms are summarized and analyzed in this dissertation. Based on this, we propose a photo synthesis algorithm based on local strategy and embedded hidden Markov model. The embedded hidden Markov model is used to learn the mapping relationship of training sketch patch-photo patch pair, so that pseudo-photos are produced in terms of sketches. Theoretical comparison and experimental results show that the proposed algorithm derives high quality pseudo-photos with less training samples, and consequently overcomes the problem that statistics based methods require a large set of training samples to generate pseudo-photos.2. Image quilting is introduced in the synthesis of pseudo-photo / pseudo-sketch. In the existing local strategy based pseudo-photo/pseudo-sketch synthesis methods, pseudo-photo patches / pseudo-sketch patches are combined by averaging the overlapping regions, which leads to the synthesized sketches/photos with blurring effect and noticeable block edges. Aiming at this problem, pseudo-photo patches / pseudo-sketch patches are stitched into a pseudo-photo / pseudo-sketch with image quilting in this dissertation. According to the different values of corresponding elements in the overlapping areas of two patches, the minimum cost path through the overlapping area is searched for and serves as quilting edge of two overlapping patches, which combines these two patches smoothly. Experimental results show that image quilting based methods may derive pseudo-photos / pseudo-sketches with higher quality, and theoretical analysis is performed.3. Image quality assessment of the synthesized images is explored. The existing image quality assessment algorithms mainly aim at quality degradation caused by noise and artifacts. Because the synthesized sketches/photos should preserve discrimination information as the original images, a novel image quality metric is expounded. The quality of the synthesized sketches / photos is evaluated from two aspects that are visual quality of the synthesized images themselves, and the similarity of the synthesized image and original image. The effectiveness of the proposed image quality assessment metric is analysed with the performance of sketch-photo recognition.4. It is proved that the performance of sketch-photo recognition is enhanced effectively with the proposed sketch-photo synthesis algorithms. After sketches and photos are transformed into the same modality, unsupervised subspace learning methods, such as principal components analysis, independent component analysis, kernel principal component analysis, locality preserving projection, tensor subspace analysis and offline tensor analysis, are adopted for face recognition in photo space and sketch space. It is proved experimentally that the proposed sketch-photo synthesis algorithms are effective to achieve promising results of sketch-photo recognition. 5. The biview face recognition algorithm is constructed by integrating texture information with shape information. In this dissertation, the sketches are realistic and affected slightly by subjective factors. The sketch and photo of a person are similar in geometry and very different in their texture. The difference of their texture is weakened and the similarity of their structure is preserved by the transformation of two feature spaces. So biview face recognition algorithm which relies on the cooperation of texture model and structure model of face images is proposed. The influence of texture difference on face recognition is further weakened by strengthening the structure similarity in recognition. Experimental results show that the proposed face recognition algorithm is robust against variation of illumination, expression and scale, which provides sketch-photo recognition under many variations with effective ways. Besides that, two graph edit distance algorithms are proposed for measuring similarity of structure models. These two algorithms are completely independent on cost function definition, and they have the virtue of high efficiency, generability and correct rat of clustering and classifying images.6. A new face photo-sketch database and experiment testing platform are constructed with independent intellectual property rights. Drawing style of sketches in the existing photo-sketch database is simplex. Aiming at sketch-photo recognition algorithms robust to sketches with different drawing style and a universal database for examining the performance of sketch-photo recognition algorithms in the further research, a new photo-sketch database is constructed. Photos are selected from standard face databases, according to which some artists are invited to produce sketches. The eligibility of the produced photo-sketch pairs is validated by the existing sketch-photo recognition algorithms. Because of different drawing styles, sketches corresponding to a photo are distinct,which is the foundation of performing machine learning based researches on the influence of different drawing styles on sketch-photo recognition.

节点文献中: 

本文链接的文献网络图示:

本文的引文网络