节点文献

VQ与HMM联合模型语音信号的实验研究

【作者】 李国峰

【导师】 吴胜举;

【作者基本信息】 陕西师范大学 , 声学, 2010, 硕士

【摘要】 人类一直都在梦想着能够通过语言直接指挥机器做出相应的各种动作,以便于完成特殊环境下的工作。但是,在很长的时间里,这个梦想没有能够实现。直到信息化时代的今天,计算机科学及其相关学科的高速发展为人类的这个梦想提供了高效的实现手段,使机器理解人的语言成为可能,这种使机器理解语言的技术就是语音信号分析识别技术。近二十年来,人类社会的高速发展对语音信号分析识别技术提出了越来越高的要求,同时科学技术的进步也为语音信号分析识别技术提供了各个方面的理论与技术支持,使语音信号分析识别技术取得显着进步,开始从实验室走向市场。在不久的将来,语音信号分析识别技术将进入工业、家电、通信、汽车电子、医疗、家庭服务、消费电子产品等人类生产生活的各个领域。本文在语音信号分析识别的基本原理和基本技术的基础上,通过对马尔可模型(HMM)和矢量量化模型(VQ)的研究和分析,针对HMM模型虽然建模能力很强,但是识别能力受到环境影响很大,而矢量量化模型建模能力虽然不强,但由于矢量的相似性使得它的识别能力很好的特点;在分析了两者的优缺点基础上,提出了新的模型和算法。同时,依据实验条件,选择Mel参数作为识别特征参数。在新的模型下,建立了语音分析识别系统,对所选取的语音信号进行特征参数提取和语音信号分析识别。在相同条件下,对同一语音信号的分析识别结果与HMM模型的分析识别结果进行了对比,研究结果表明:联合模型的识别结果普遍高于单一HMM模型,联合模型的性能要优于HMM模型;并进一步应用建立的联合模型在指定样本、指定语音信号、不指定样本不指定语义信号这三种情况下,做了联合模型的稳定性实验,得出了联合模型的性能比较可靠、运算比较良好。

【Abstract】 Humans have been dreaming through direct language can make all sorts of corresponding machine, in order to finish the work under special environment. But, in the very long time, can not achieve the dream. Until today, the information age of computer science and related disciplines for the rapid development of the human dream provides efficient means of realization, make the machine understanding human language, the language of the machine understanding speech signal analysis & identification technique, technique.Nearly two years, the rapid development of human society in speech signal analysis is increasingly high recognition technology, and the requirements for the advancement of science and technology of speech signal analysis to identify technology provides all aspects of the theory and technology support, make the speech signal analysis made significant progress in identifying technology, start from the lab to the market. In the near future, the speech signal analysis to identify technology will enter industry, electrical appliances, communications, automotive electronics, medical, family service, consumer electronic products such as production all spheres of life.Based on the analysis of the speech signal recognition principle and basic technical basis, through the markov model (HMM) and vector quantization model (VQ), according to the study and analysis of the modeling ability strong HMM, but the ability to identify the influence by environment, While vector quantization model ability is not strong, but because, although the vector similarity makes it good ability to identify, This study analyzed the advantages and disadvantages of both in put forward, based on the new model and algorithm. At the same time, according to the experimental conditions, choose Mel parameters as recognition characteristic parameters. In the new model is established, the speech recognition system, the selection of the speech signal of speech signal feature extraction and recognition of speech signal analysis. Under the same conditions, the analysis of the same speech signal recognition results and the analysis of the model identification results HMM compares the results of recognition, joint model is higher than single HMM model, the joint model has better performance than the HMM model, And to further establish joint model is applied in the designated specimen, designated speech signal, not specified sample not specified semantic signal the three cases, joint model experiment, the stability of the performance of joint model is reliable, the operation is in good condition

节点文献中: