节点文献

基于稀疏表示的语音信号欠定盲分离技术研究

Research on Underdetermined Blind Source Separation of Speech Signals Based on Sparse Representation

【作者】 郑翠

【导师】 张朝柱;

【作者基本信息】 哈尔滨工程大学 , 通信与信息系统, 2011, 硕士

【摘要】 信号处理在各个领域一直有着重要的地位,而伴随着移动通信和地质勘探技术的发展,迫切需要一种新的信号处理方法,盲信号处理就应运而生。盲源分离是二十世纪九十年代发展起来的一种新兴信号处理方法,它在研究语音增强、图像识别、生物工程信号、通信信号以及地震探测等领域中有非常重要的理论价值。传统盲源分离往往假设传感器个数大于信源数目,但是伴随着盲源分离问题的深入研究,欠定盲源分离问题备受关注,因为它是更符合实际,也是更具挑战的。在此条件下系统是不可逆的,传统的盲源分离算法失效,所以本文在语音信号稀疏表示的基础上,对语音信号欠定盲源分离的关键技术进行了研究。本文首先介绍盲源分离的发展现状,并对传统盲源分离算法进行了阐述。然后对欠定盲源分离的理论基础及关键技术进行了探索性研究。其中“两步法”是欠定盲源分离问题的热点,即首先通过聚类算法估计出混合矩阵,然后通过优化算法得到源信号估计,它与信号稀疏表示以及过完备基的选择有着密切关系。本文主要研究的内容如下:在信号稀疏表示的基础上,本文将目前模式识别聚类理论中最为成熟的模糊C均值聚类算法,运用到混合矩阵估计中。它克服了传统用势函数估计混合矩阵的方法中存在的参数选择复杂,势函数定义缺乏理论指导等缺点。但是模糊聚类自身存在对初始值敏感,易陷入局部最优等缺点,因此将其与差分进化结合,提出DE-FCM的混合矩阵估计算法,实现无监督聚类,参数选择简单,收敛速度快,估计准确等优点,并且达到全局优化。得到混合矩阵的估计之后,为了估计出不同稀疏程度的源信号,本文将平滑l0范数思想引入源信号估计中,不再使用最小l1范数的方法。因为最小化l1范数只有在信号足够稀疏时,才可以很好的恢复出源信号。而平滑l0范数直接利用一个近似函数来逼近l0范数以保证估计的性能,并利用一个控制因子来决定估计出的源信号稀疏性强度。实验表明该方法可以很好的实现源信号估计,而且得到源信号估计更符合混合模型。

【Abstract】 Signal processing plays an important role in various fields and with the development of mobile communication and the geological prospecting technology, a new kind of signal processing is in an urgent need, then blind signal processing arises at the historic moment. Blind source separation is developed as a new kind of signal processing method 1990s, which has important theoretical value in researching the speech enhancement, the image recognition, the bioengineering signal, communication signals, and seismic exploration.Traditional blind source separation is always based on the assumption that the sensor number is more than source number, but with the further research of blind source separation more attention has been paid to underdetermined blind source separation which is more conform to the fact and would be more challenging. In this condition the system is irreversible, so the traditional blind source separation algorithm would fail. In this paper, based on sparse representation of speech signal, the key technologies of underdetermined blind source separation are explained.In this paper, the development status of blind source separation is briefly introduced firstly. Then explorative study is made on theoretical basis and key technology of the underdetermined blind source separation. the basic algorithms are also discussed. "Two-step" algorithm cluster-then-optimization to estimate the mixing matrix and source signals separately is a hotspot of underdetermined blind source separation, and it has close relation with signal sparse representation and overcomplete basis. This paper mainly studies the content as follows:Based on the sparse representation of signals, the fuzzy C-means clustering algorithm whose theoretical basis is most mature among pattern recognition clustering theory is applied to estimate the mixing matrix here. It can overcome the disadvantage of traditional potential function algorithm used to estimate the mixing matrix, such as complexity in the parameter selection and lack of theoretical guidance to define the potential function. But fuzzy C-means clustering is sensitive to the initial value and easy to be trapped in local optimum, so it will be combined with the differential evolution, named DE-FCM algorithm to realize unsupervised clustering, simple parameter selection, fast convergence rate, more accurate estimation and achieve global optimization finally. After getting the mixing matrix estimation, in order to estimate source signals with different degree of sparsity, this paper proposed a method based on the smoothed l0 norm to recover the source signals. Here minimum norm l1 method is no longer used, because minimize l1 norm can get a good result only when the signals are sparse enough. The method based on the smoothed l0 only uses an approximate function to approximate l0 norm directly and the quality of the approximation depends on a parameter called control factor. Experiments show that this method can get good result and the result can fit the mixing model better.

节点文献中: 

本文链接的文献网络图示:

本文的引文网络