节点文献

基于稀疏表示模型的图像复原技术研究

Research on Image Restoration Technology Based on Sparse Representation Model

【作者】 张健

【导师】 赵德斌;

【作者基本信息】 哈尔滨工业大学 , 计算机应用技术, 2014, 博士

【摘要】 随着信息技术的蓬勃发展,社交网络和移动互联网的全面兴起,以及大数据时代的汹涌来袭,数字化的多媒体技术已经无处不在,在人类社会的各个领域都得到了广泛的应用。图像/视频等视觉信号已经成为数字多媒体内容的主要载体。数字图像的质量可以说在人们进行交流通信和视觉感知的过程中起着至关重要的作用。在数字图像的采集、存储、传输和处理过程中,由于拍摄设备及人为操作不当等诸多因素不可避免使得最终获得的图像失真,或称为图像退化,从而影响图像质量。因此,利用退化图像对原始真实图像进行重建的图像复原技术,一直以来都是数字图像处理领域里的研究热点,对其具有十分重要的理论和现实意义。目前,图像复原已经演变成图像处理、计算机视觉和计算成像三者之间的一个交叉领域。由于图像在退化过程中丢失了信息,图像复原逆问题通常具有病态特性。图像的先验模型在图像复原问题中起着重要作用。利用自然图像的先验模型可以对真实解空间进行正则化约束,从而将具有不适定性的图像复原病态问题转换为适定问题,获得符合人眼视觉特性的稳定解。本文以图像稀疏表示先验模型为出发点,重点针对图像修补、图像去模糊、图像噪声去除、图像超分辨率以及图像压缩感知重建五个图像复原问题展开深入的研究。具体研究内容分为以下四个部分:第一,提出了一种基于空间-变换域上联合稀疏统计模型的图像复原算法。传统的基于图像先验模型正则化图像复原算法存在着两个缺陷。一方面,通常是只利用一个图像的先验特性,不能得到令人满意的复原效果;另一方面,在刻画图像的非局部自相似性时大多数都是利用一种加权的方式,没能够充分利用这一特性。针对以上问题,本文从图像统计的角度出发,分别在图像二维空间域上建立了刻画图像局部平滑特性的像素级稀疏统计模型和在图像三维变换域上建立了刻画图像非局部自相似特性的图像块级稀疏统计模型,并将以上两种稀疏统计模型合并,构成了一个混合空间-变换域上能够同时刻画图像局部平滑特性和非局部自相似性的联合稀疏统计模型。将联合稀疏统计模型嵌入正则化框架中,提出了一个新颖的求解一般图像复原逆问题的目标函数。通过图像修补、图像去模糊和图像混合高斯和脉冲噪声去除三种应用验证了提出算法的有效性、鲁棒性和收敛性。第二,提出了一种基于图像结构组稀疏表示模型的图像复原算法。传统的基于自然图像块的稀疏表示通常面临两个问题。一是在字典学习的过程中需要求解一个具有非常高计算复杂度的大规模优化问题;二是在稀疏编码和字典的学习过程中,每一个图像块都是独立考虑,忽略了块与块之间的相关性,从而导致了稀疏编码系数不够准确。为了解决以上两个问题,本文打破了传统的基于图像块为图像稀疏表示基本单位的束缚,提出了一个新颖的以结构组为像稀疏表示基本单位的图像结构组稀疏表示模型。图像结构组是由具有相似结构的图像块组合,从而在对图像结构组进行稀疏表示的过程中能够显式地在一个统一的框架下同时刻画自然图像固有的局部稀疏性和非局部自相似性。在正则化框架下,提出一个新颖的具有一般性的求解图像复原逆问题的基于图像结构组稀疏表示模型的L0范数目标函数,并且对于每一个结构组,设计了一个高效且低复杂度的自适应稀疏表示字典。在图像修补、图像去模糊和图像压缩感知重建三个应用中,实验结果表明提出的基于结构组稀疏表示模型的图像复原技术取得较目前许多主流算法更好的性能。第三,提出了一种基于L0范数自适应学习稀疏基的图像压缩感知重建算法。目前存在的压缩感知重建算法中大都采用固定的基函数,也就是在确定的域中对信号进行分解,比如:DCT域、小波域和梯度域,但这些域都忽略了自然信号的非平稳特性,缺乏自适应能力,从而不能够将图像分解得足够稀疏,也就使得压缩感知重建的效果很差,限制了压缩感知在图像方面的应用。针对以上问题,本文将刻画整幅自然图像稀疏性的基于块的冗余稀疏表示模型引入到图像压缩感知重建问题中,提出了基于L0范数自适应学习稀疏基的图像压缩感知重建框架。在此框架中,基于L0范数自适应学习稀疏基的作用是能够获得更好的自适应性和更高的稀疏度,进而极大缩小压缩感知解空间,得到稳定解;而基于块的冗余稀疏表示模型作用则是能够减少重建图像的块效应,获得更高质量的视觉重建效果。实验结果证明提出的方法取得了较目前主流算法相当甚至更好的性能。第四,提出了一种基于双重字典学习和稀疏表示的图像超分辨率算法。传统基于字典学习和稀疏表示的图像超分辨率算法存在高、低分辨率图像具有较大频谱差异的问题,从而造成在图像超分辨率过程中损失很多高频信息。针对以上问题,本文提出将需要重建的图像高频信息分解为主要高频信息和残差高频信息两部分。对于以上两种类型的高频信息,利用训练图像基于稀疏表示模型分别逐次设计训练出双重字典,即第一重主要字典和第二重残差字典。在图像超分辨率过程中,采用渐进式的方式运用双重字典和稀疏表示进行高分辨率图像重建。实验结果表明提出的算法能够缩小高、低分辨率图像的频谱差异,获得更丰富的图像细节和更高质量的高分辨率图像。

【Abstract】 With the rapid development of information technology, mobile internet, solocial network, and the boom of Big Data, digitized media technology has been found everywhere and been widely applied in various fields of human society. Image and Video are becoming the main carriers of visual signals for digital multimedia content. The quality of digtal images plays a significant role in the process of visual perception and communication. The degradations of digital im-ages are inevitablly caused by many factors during the period of image acquisi-tion, storage, transmission and processing. Therefore, image restoration technol-ogy, which infers the recovery of the original image from the observed degraded version, has been a hot and basic topic in the field of image processing. The re-search of image restoration has a theoretical and practical significance. Nowa-days, image restoration has evolved into an energetic field at the intersection of image processing, computer vision, and computational imaging. Due to the in-formation loss in the process of image degradation, image restoration as an in-verse linear problem is usually ill-posed. The prior models of natural images have important impact for solving the problems of image restoration. The utiliza-tion of natural image prior models is able to constrain the solution space, and enable the inverse problem well-posed, achieving the restored images that are coincident with the characteristics of human visual perception. In this thesis, based on the sparse representation prior modeling of natural images, we mainly focus on the research topics on five image restoration problems: image inpainting, image deblurring, image noise removal, image super-resolution, and image com-pressive sensing recovery. The contents of the thesis can be divided into four sections that are detailed as follows.First, a novel image restoration algorithm based on a joint sparse statistical modeling in space-transform domain is proposed. Traditional image prior models regularized image restoration algorithms usually have two shortcomings. On one hand, only one image property used in regularization-based framework is not enough to obtain satisfying restoration results. On the other hand, the image property of nonlocal self-similarity should be characterized by a more powerful manner, rather than by the traditional weighted graph. To rectify the above prob-lems, from the perspective of image statistics, we first establish two sparse statis-tical models by characterizing image local smoothness in two-dimensional space domain at pixel level, and image nonlocal self-similarity in three-dimensional transform domain at patch level, respectively, and then merge the two models into a novel joint sparse statistical modeling in hybrid space-transform domain, which offers a powerful mechanism of combining local smoothness and nonlocal self-similarity simultaneously to ensure a more reliable and robust estimation. Under the regularization-based framework, a new form of minimization func-tional for solving image inverse problem using the joint sparse statistical model-ing is formulated, which is accompanied by an effective choice for the corre-sponding regularization parameters. Extensive experiments on image inpainting, image deblurring and mixed Gaussian plus salt-and-pepper noise removal appli-cations verify the effectiveness of the proposed algorithm.Second, a novel image restoration algorithm based on structural group sparse representation (SGSR) is proposed. Traditional patch-based sparse repre-sentation modeling of natural images usually suffer from two problems. First, it has to solve a large-scale optimization problem with high computational com-plexity in dictionary learning. Second, each patch is considered independently in dictionary learning and sparse coding, which ignores the relationship among patches, resulting in inaccurate sparse coding coefficients. To rectify the above problems, instead of using patch as the basic unit of sparse representation, this thesis exploits the concept of structural group as the basic unit of sparse repre-sentation, which is composed of nonlocal patches with similar structures, and es-tablishs a novel sparse representation modeling of natural images, called struc-tural group sparse representation (SGSR). The proposed SGSR is able to sparsely represent natural images in the domain of structural group, which enforces the intrinsic local sparsity and nonlocal self-similarity of images simultaneously in a unified framework. Under the regularization-based framework, a new form of minimization functional for solving image inverse problem via SGSR in the form of L0norm is formulated, associated with an effective self-adaptive dictionary learning method for each structural group with low complexity. Extensive ex- periments on image inpainting, image deblurring and image compressive sensing recovery manifest that the proposed SGSR modeling outperforms many current state-of-the-art schemes in both PSNR and visual perception.Third, a new algorithm for image compressive sensing recovery using adap-tively learned sparsifying basis via L0minimization is proposed. Most of the conventional CS recovery approaches exploited a set of fixed bases (e.g. DCT, wavelet and gradient domain) for the entirety of a signal, which are irrespective of the non-stationarity of natural signals and cannot achieve high enough degree of sparsity, thus resulting in poor CS recovery performance and restricting the CS application in natural images. This thesis introduces the patch-based redundant sparse representation model that is used to characterize the intrinsic sparsity of the whole image into the problem of image compressive sensing recovery, and develops a framework for image compressive sensing recovery using the adap-tively learned sparsifying basis in the form of L0norm. The role of the adaptive learned sparsifying basis in the form of L0is to achieve high enough degree of adaptability and sparsity, thus greatly confining the CS solution. The role of the patch-based redundant sparse representation model is to reduce blocking artifacts and obtain the CS results with high visual quality. Experimental results on a wide range of natural images for CS recovery have shown that the proposed algorithm achieves significant performance improvements over many current state-of-the-art schemes.Fourth, a novel image super-resolution algorithm via dual-dictionary learn-ing and sparse representation is proposed. Aiming at reducing the large gap of the frequency in the low and high resolution images exhibited in the traditional dictionary learning and sparse representation based image super-resolution methods and recovering more image high-frequency information, in this thesis, the high-frequency (HF) to be estimated in super-solution is considered as a combination of two components: main high-frequency (MHF) and residual high-frequency (RHF). As for above two types of high-frequency, two corre-sponding dictionaries are successively learned by making use of training images via sparse representation, i.e., the main dictionary and the residual dictionary. In the process of image super-resolution, the input low resolution image is first magnified and added the reconstructed main high-frequency information by vir- tue of the main dictionary, thus generating a temporary image. Then, by pre-forming the same reconstruction scheme with the temporary image and the re-sidual dictionary, the reconstructed residual high-frequency information is recon-structed, leading to the final high-resolution image. Extensive experimental re-sults on test images validate that by employing the proposed two-layer progres-sive scheme, more image details can be recovered and much better results can be achieved.

节点文献中: 

本文链接的文献网络图示:

本文的引文网络