节点文献

超分辨率重建与图像增强技术研究

Research on Super-Resolution Reconstruction and Image Enhancement Technique

【作者】 乔建苹

【导师】 刘琚;

【作者基本信息】 山东大学 , 通信与信息系统, 2008, 博士

【摘要】 随着多媒体通信和信息处理技术的发展,人们对图像分辨率的要求越来越高。然而许多成像系统受其物理条件的限制,所得到的图像往往分辨率很低,而且会有退化变形和受噪声污染,图像的视觉效果难以满足人们的需要。如果采用改善物理硬件的方法来获取人们满意的高分辨率图像,则成本较高,而且有时难于实现。因此有必要考虑一种能够克服这些限制的新措施来提高图像空间分辨率。超分辨率重建技术就在这种背景下应运而生。该技术弥补了硬件方面的不足,从软件的角度提高图像的分辨率,增强图像的可用性。因此这种技术一经提出,便引起了许多学者的广泛重视和关注,目前在电视、遥感、医学和公安系统等领域都具有非常重要的实际应用价值。超分辨率重建技术是指融合多幅包含相似信息但存在不同细节的低分辨率图像得到一幅或多幅高分辨率图像,同时消除加性噪声以及由有限检测器尺寸和光学元件产生的模糊,而无须提高系统的硬件成本。近年来,该技术已成为国际上图像处理领域最活跃的研究课题之一。本论文系统回顾了超分辨率技术的相关理论和经典算法,针对实际应用中面临的诸多复杂问题,如模糊函数未知、存在光照影响、噪声等因素,以矢量量化、支撑向量机、流形学习等为主要数学工具,提出一系列超分辨率重建和图像增强方案。同时作为人脸识别的一个预处理环节,研究了基于独立分量分析的人脸超分辨率重建,有效提高了人脸识别率。本论文的主要成果概括如下:1、多帧图像的超分辨率重建。主要包括三个部分:第一部分针对不同类型的噪声,结合L1范数和平稳小波变换,提出基于平稳小波变换的鲁棒解决方案,同时由于综合考虑了图像各个方向的边缘特性,该方案还具有一定的自适应边缘保持能力。第二部分针对低信噪比环境下的超分辨率重建问题,提出基于峭度的重建方案。该方案首先对峭度图像进行了定义,然后分析得出峭度图像的两个重要性质:(a)峭度图像对高斯噪声具有鲁棒性;(b)峭度的绝对值随着图像模糊程度的加大而变小,因而最清晰的图像的峭度的绝对值应该最大。基于这两个特性,该方案在满足高分辨率图像与低分辨率图像之间的反卷积的剩余误差有界的前提下,最大化峭度绝对值来求解未知的高分辨率图像,然后采用Lagrange乘子法则求解此约束优化问题。与传统方法相比,该方案在低噪环境下具有明显的复原效果,且计算复杂度较低。第三部分主要研究了基于分割的超分辨率重建方案。利用高阶统计将图像分割成不同的区域,根据分割的结果对平坦区域和纹理区域实施不同的规整化泛函,以改善重建结果,有效保持图像细节信息。2、基于学习的单帧超分辨率重建。鉴于传统方法易受光照因素影响的问题,提出两种同时实现超分辨率重建和图像增强的方案。在对流形学习、自商图像(SQI)进行了深入研究的基础上,提出一种不随光照变化的图像表示方法—对数-小波变换(Log-WT)。然后分别利用Log-WT和SQI提取光照不变量作为图像特征,并假设高分辨率图像块构成的空间和低分辨率图像块构成的空间具有相似的局部几何结构,在流形学习的框架下,借助于局部线性嵌入的思想获得高分辨率图像的初始估计,最后对其加入先验约束,从而同时实现了超分辨率重建和图像增强。该方案在提高图像空间分辨率的同时克服了光照因素的影响,特别是对阴影效应的消除具有明显效果。3、提出基于矢量量化的盲超分辨率重建方案。该方案假设模糊函数的类型已知,且可由某一参数来表征,利用矢量量化技术,依照最小距离准则从一组候选参数中估计真实的模糊函数的参数。利用Sobel算子形成特征矢量增强了算法对辨识不同类型图像的模糊函数的鲁棒性,并利用DCT对特征矢量降维,减小了计算量。同时将其扩展应用于超分辨率图像重建中,辨识出多幅低分辨率图像的模糊函数,然后融合具有不同模糊函数和信噪比的低分辨率图像,实现了盲超分辨率图像重建。4、提出基于支撑向量机的盲超分辨率重建方案。该方案从模式识别的角度出发解决模糊函数的辨识问题。首先通过边缘检测和局部方差从训练图像中提取能够表征模糊参数信息的特征向量,然后利用支撑向量机建立特征向量与对应的候选参数的映射关系,最后通过建立的模型辨识低分辨率图像的模糊函数,进行超分辨率重建。5、提出基于独立分量分析的人脸超分辨率重建方案。该方案利用独立分量分析(ICA)理论从高分辨率训练图像中提取出独立分量,并对ICA系数进行先验估计。给定一幅低分辨率人脸图像,结合最大后验概率(MAP)估计理论求出ICA系数,然后ICA反变换得到高分辨率图像的估计。仿真结果表明该方案有效实现了人脸超分辨率重建,保持了人脸整体结构特征,且对光照、表情、姿态等因素具有一定的鲁棒性。将重建结果应用于人脸识别,有效提高了识别率。综上所述,本论文针对目前超分辨率重建技术中存在的问题,以模糊辨识、增加图像动态范围、消除光照和噪声影响为核心进行了深入研究,同时将支撑向量机、流形学习、ICA等理论应用于问题的建模和求解当中,增加了技术的实用性和灵活性。论文最后总结了该研究领域亟待解决的一些问题和下一步的研究重点,同时对该领域的发展趋势进行了展望。

【Abstract】 With the development of multimedia communication and information processing, there is a high demand for high-resolution (HR) images. However, it is hard to get the desired image because of the physical limitations of the image acquisition system. The recorded images are usually degraded, noisy and downsampled. Improving the performance of the hardware is clearly one way to increase the resolution of the acquired images. But this method may not be feasible due to the increased cost. Therefore, alternative methods should be provided in order to increase the spatial resolution. The technique of super-resolution (SR) is developed under this circumstance. The aim of the SR is to improve the resolution using software techniques which makes up the hardware deficiency. Many researchers have devoted themselves to this area and the research of SR will be of great value in a lot of applications such as TV, remote sensing, medical imaging, surveillance system and so on.Super-resolution reconstruction refers to obtain a high resolution image or image sequence by fusing multiple similar but different low-resolution (LR) images and remove the additive noise and optical blur simultaneously. Recently, it has been one of the most active research areas in the world.This dissertation reviews the SR related theory and classical algorithms systematically. Due to the various problems in real applications such as blur estimation, noise, different light conditions and so on, this dissertation presents several super-resolution reconstruction and image enhancement schemes based on vector quantization (VQ), support vector machines (SVM) and manifold learning. In addition, as a preprocessing technique of face recognition, an independent component analysis (ICA) based facial image SR reconstruction approach is proposed and the recognition rate is improved efficiently by this method. Generally, this dissertation consists of the following parts:1. Multi-frame super-resolution reconstruction. Three schemes are presented in this part. Firstly, by combining the L\ norm minimization and stationary wavelet transform (SWT), a robust SWT based SR scheme is proposed to deal with different noise models. This method is also effective to preserve the edges of the image due to the high-frequency information in different directions extracted from the image. Secondly, a kurtosis-based scheme is proposed to address SR image reconstruction in low SNR environments. After the definition of the kurtosis image, its two important properties are analyzed: (a) kurtosis image is free from Gaussian noise; (b) the absolute value of kurtosis image becomes smaller as the image gets smoother. Therefore, the estimated HR image should have the largest absolute local kurtosis. Based on these two characteristics, the HR image is estimated by solving an absolute local kurtosis maximization problem with the constraints that residue of the observed data and the solution are bounded. Lagrange multiplier is applied to solve the combinatorial optimization problem. The proposed method is better than the conventional algorithms in terms of visual inspection under severe noise background and has low computational complexity. Thirdly, a segmentation-based scheme is proposed. In this scheme, the image is divided into various regions by making use of high order statistics. Different regularization terms are applied to homogenous regions and non-homogenous regions according to the segmentation label. Detail information of the reconstructed SR image is well preserved.2. Learning based single-frame super-resolution reconstruction. Two schemes for simultaneous super resolution and image enhancement are presented to solve the illumination problems in SR technique. Based on the manifold learning and self quotient image (SQI), a logarithmic-wavelet transform (Log-WT) is defined for the elimination of lighting effect in the image. After that, illumination-free features are extracted by exploiting Log-WT or SQI. Under the framework of manifold learning and local linear embedding, an initial estimation of high resolution image is obtained based on the assumption that small patches in low resolution space and patches in high resolution space share the similar local manifold structure. Finally the desired HR image is reconstructed by applying the reconstruction constraints in pixel domain. The proposed method simultaneously achieves single-image super-resolution and image enhancement especially shadow removing.3. A blind SR scheme based on vector quantization is proposed. Assume that the blur type is known and blur function is parameterized by one parameter. Based on the VQ technique, the best estimation is found within a set of candidates according to the minimum distortion. Feature extraction by Sobel operator improves the robustness of the method to different types of images. DCT is utilized to reduce the dimension of the vector which leads the low computational complexity. Meanwhile, extension of this method to blind super-resolution image reconstruction is achieved. After blur identification, a super-resolution image is reconstructed from several low-resolution images obtained by different foci.4. A support vector machines based scheme of blind SR image reconstruction is proposed. In this scheme, blur identification problem is solved from the viewpoint of pattern recognition. Edge detection and local variance are used to extract feature vectors which contain the information of blur parameter from training images. Then SVM is used to classify these feature vectors. Finally, the acquired mapping between the vectors and corresponding blur parameters provide the identification of the blur and further estimate the HR image.5. An independent component analysis based face SR scheme is proposed. In this scheme, the independent components (ICs) are obtained by offline training high resolution face images. The prior of ICA coefficients are estimated by performing PCA on training images. Given a LR image, the high resolution image is reconstructed by the linear combination of the ICs where the weight coefficients are obtained by the method of maximum a posteriori (MAP). Experimental results demonstrate that the proposed method is robust to various pose, expressions and lighting conditions. The hallucination results preserve both the global structure and the high spatial-frequency information better such as sharp edges and high contrast. The HR results are then applied to face recognition which improves the recognition rate.In summary, blur identification and image dynamic improvement as well as light conditions and noise removal are investigated in this dissertation. Meanwhile, the new mathematical tools including SVM, manifold learning, and ICA are used in the modeling and solution of the proposed schemes which makes the schemes more useful and flexible. Finally, the problems to be solved related to this research area and future research topics are summarized, furthermore, the prospect of the developing tendency is analyzed as well.

  • 【网络出版投稿人】 山东大学
  • 【网络出版年期】2009年 01期
  • 【分类号】TP391.41
  • 【被引频次】25
  • 【下载频次】3062
  • 攻读期成果
节点文献中: 

本文链接的文献网络图示:

本文的引文网络