节点文献

语音识别剪枝算法研究

Research on Pruning Algorithm for Speech Recognition

【作者】 陆遥

【导师】 刘刚;

【作者基本信息】 北京邮电大学 , 信息与信号处理, 2012, 硕士

【摘要】 语音是人类之间交互最为有效的手段,使用语音输入代替传统的键盘和触屏输入,将会使用户操作更为便捷——将语音技术用于人机交互也一直是前沿的研究课题。伴随着移动终端的发展和移动互联网的飞速发展,基于移动终端的语音识别技术也随之成为了新的研究热点。和已经比较成熟的语音识别技术相比,基于移动终端的识别技术将面临更多的技术挑战,主要表现在移动终端的计算能力有限,根据传统算法开发的识别技术不能满足作为语音输入的实时性能要求;另外,高精度的语音识别需要描述能力较强的语言模型支持——一般都采用基于统计的n-gram模型,而该模型所需要的存储空间较大,很难在移动终端上得到全面的应用。因此当前对基于移动终端的识别技术的主要问题集中在压缩识别器的计算空间和加速识别器的解码速度这两个方面。针对上述问题,本文提出了一种根据用户所在的地点信息进行剪枝的算法,该算法能够有效的提升识别器的性能。本文的主要工作内容如下:1.基于有限状态语法识别系统的剪枝算法实现孤立词识别系统和固定句式识别系统是语音识别系统的基础,本文首先对这种简单的识别网络进行了剪枝算法的研究,提出了根据用户地点信息对识别网络进行剪枝的算法,并用大量的实验证明了算法的有效性。2.基于连续语识别系统的剪枝算法实现以有限状态识别网络剪枝算法的研究为基础,本文利提出了利用语言模型前瞻树结构在解码时进行剪枝的算法,该算法根据用户的地点信息对解码器的搜索空间进行了有效地剪枝。通过对比实验证明,该算法能够有效的提升识别器的识别精度和速度。

【Abstract】 Speech is the most efficient way for human-beings’communication. For using speech input instead of traditional keyboard or touch screen input will make user feeling much more convenient, using speech recognition technology into human-computer interaction has been a cutting-edge research, In other hand, with rapid development of mobile terminals and mobile Internet, the speech recognition technology based on mobile terminals will become a new hotspot.Compared to proven speech recognition technique, the mobile terminal based speech recognition technology faced much more technical challenges. It’s mainly because of the mobile terminals’hardware is limited, which caused two problems:Firstly, the real-time performance cannot be met when using traditional recognition technology as the speech input system’s core algorithm; Secondly, high accuracy speech recognition system requires language model support, generally is statistical n-gram model, which is difficult to be used in the mobile terminal application for the larger storage space requirement. Current the researches on the mobile terminal based speech recognition technology are focused on the speeding up computation and storage compression.According to those problems, we proposed and implement an effective location based pruning algorithm. The mainly work is as follows:1. Pruning algorithm based on finite-state grammar recognition systemFirstly we researched on isolated word recognition system and fixed sentence recognition system, which are basic speech recognition systems and have the sampler recognition network than large vocabulary continuous speech recognition system. After lots of work, we proposed an algorithm to pruning network with user’s location information and used large number of experiments to prove the effectiveness of the algorithm.2. Pruning algorithm based on continuous speech recognition systemBased on finite-state recognition network pruning algorithm research, we proposed an algorithm which uses language model look-ahead tree structure and user’s location information to solve the same problem in continuous speech recognition system. By comparing the experiments among different systems we prove that the algorithm can effectively enhance the system’s recognition accuracy and speed performance.

节点文献中: 

本文链接的文献网络图示:

本文的引文网络