节点文献

支持向量机在语音识别中的应用研究

Application Research of Support Vector Machine in Speech Recognition

【作者】 郭月玲

【导师】 张雪英;

【作者基本信息】 太原理工大学 , 信号与信息处理, 2009, 硕士

【摘要】 语音识别是语音信号处理的一个重要方面,是人机交互技术的基础,有着广阔的应用前景。因此,对语音识别进行研究具有重要的理论价值和实际意义。本文首先系统地介绍了语音识别的基本原理,分析了目前主要的语音识别方法的局限性和不足,概述了本文研究的基础——统计学习理论和支持向量机方法,分析了支持向量机在语音识别技术中的应用前景。为了验证支持向量机在语音识别系统中的识别效果,本文分别构建了基于线性核支持向量机、径向基核支持向量机、三阶多项式核支持向量机以及Sigmoid核支持向量机的非特定人孤立词语音识别系统,并进行了大量的仿真实验。实验结果表明,前三种支持向量机应用于语音识别系统中均取得了优于隐马尔可夫模型的识别结果,并且支持向量机的运行速度也优于隐马尔可夫模型;而Sigmoid核支持向量机应用于语音识别系统中却得到了不尽如人意的识别结果。因此,核函数的不同直接影响着支持向量机的分类性能,从而影响了语音识别系统的识别效果。其次,为了研究在核函数相同的情况下,核参数和惩罚因子的不同取值对支持向量机推广性能的影响,本文构建了基于径向基核函数支持向量机的非特定人孤立词语音识别系统。在实验中,分别取了核参数和惩罚因子的三组不同的值进行了语音识别实验。实验结果表明,核参数和惩罚因子的不同取值也会影响支持向量机的推广性能,从而影响语音识别系统的识别效果。核函数的类型、核参数以及惩罚因子的选取直接影响着支持向量机语音识别系统的识别效果。然而,到目前为止,支持向量机的核函数、核参数及惩罚因子的选择还没有科学的方法,它们的选择只能根据经验、大量的反复实验进行对比等方法来进行选择,带有很大的局限性。针对这个问题,本文做了初步的研究,实现了在核函数类型确定的前提下,用粒子群优化算法对核参数和惩罚因子的优化,并用基于优选参数值的支持向量机进行语音识别实验,识别率得到了一定的改善和提高。

【Abstract】 Speech recognition is an important aspect of speech signal processing. It is the foundation of human-computer interaction technology and has wide application prospect. It has great theoretical value and practical significance for us to do research on speech recognition.This paper first introduced the basic principle of speech recognition systematically, analyzed the limitation and shortage of current main speech recognition methods, summarized the research foundation of this paper - statistical learning theory and support vector machine method and analyzed the application prospect of support vector machine in speech recognition technology. In order to verify the recognition effect of support vector machine in speech recognition system, this paper constructed four non-specific person and isolated words speech recognition systems which are based on support vector machines of different kernel function respectively and did a lot of simulation experiments. These four kernel function are linear kernel function, radial basis kernel function, three-order polynomial kernel function and sigmoid kernel function. The experimental results show that the recognition results of speech recognition systems which are based on linear kernel support vector machine, radial basis kernel support vector machine and three-order polynomial support vector machine are very good and better than the recognition results that is based on hidden markov models. The running speed of support vector machine is faster than hidden markov models. But the recognition results of speech recognition system based on sigmoid kernel support vector machine are very bad. So the type of kernel functions directly affects the classification performance of support vector machine and accordingly affects the recognition effect of the speech recognition system.Secondly, in order to study the influences of kernel parameter and error penalty parameter on the generalization performance of support vector machine in condition of a fixed kernel function, this paper constructed a non-specific person and isolated words speech recognition system based on support vector machine of radial basis kernel function. In the experiments, three groups of kernel parameter and error penalty parameter values were taken to do speech recognition. The experimental results show that the different values of kernel parameter and error penalty parameter affect the generalization performance of support vector machine and accordingly affect the recognition effect of the speech recognition system.The selection of kernel function type, kernel parameter value and error penalty parameter value directly affects the recognition effect of speech recognition system based on support vector machine. However, there is no scientific method to select these three factors and people select them only according to experience and repeated experiments. There exists great limitation. Aiming at this problem, this paper did preliminary research and proposed a method to do parameter optimization that uses particle swarm optimization algorithm in condition of the kernel function type is fixed. At last this paper constructed a speech recognition system based on the support vector machine whose kernel parameter and error penalty parameter have been optimized and the recognition rates get certain improvement.

节点文献中: 

本文链接的文献网络图示:

本文的引文网络