节点文献

婴幼儿语音信息处理与识别研究模型

Research Model of Infant Voice Information Processing and Recognition

  • 推荐 CAJ下载
  • PDF下载
  • 不支持迅雷等下载工具,请取消加速工具后下载。

【作者】 左正东万光彩杜佳轩

【Author】 ZUO Zheng-dong;WAN Guang-cai;DU Jia-xuan;Faculty of Finance,Anhui University of Finance and Economics;

【通讯作者】 万光彩;

【机构】 安徽财经大学金融学院

【摘要】 本文针对婴儿语音的识别及处理问题,通过Mel尺度倒谱参数(MFCC)等信号分析的参数,对婴儿语音情绪信息的数据采集和预处理过程及相应的特征参数提取方法进行了研究.综合运用了Matlab、Excel、Widi及TT Composer等软件求解,经过综合比较,本文采用了参数和方法,针对辨别婴儿性别以及婴儿身心状态所表达的情绪信息作模式识别研究,并给出了其技术实现方法和实验测试结果,取得了良好的识别效果.对于一男一女唱同一首歌的音频,我们基于性别差异角度对语音信号时域进行特征分析,通过绘制语谱图、能量图、相关函数图等,观察男女声的差异,可以发现语音信号的前面部分性别差异特征,在此基础上再利用MFCC分别得到男性和女性语音的48 110*24 MFCC特征矩阵.通过贝叶斯判别法,将语音进行性别判别归类,再利用该模型对婴儿的声音进行鉴别.

【Abstract】 Aiming at the problem of infant speech recognition and processing,this paper studies the data acquisition and preprocessing process of infant phonetic emotion information and the corresponding method of feature parameter extraction through the parameters of signal analysis such as Mel scale inverted spectrum parameter( MFCC). Based on the comprehensive comparison of Matlab,Excel,Widi and TT Composer,this paper adopts parameters and methods to study the pattern recognition of emotional information expressed in the identification of infant sex and infant’s physical and mental state,and gives its technical realization method and experimental test results. Good recognition results have been obtained. For the audio of a man and a woman singing the same song,we analyze the characteristics of speech signal time domain based on the angle of gender differences,and observe the differences of male and female sound by drawing the spectrum diagram,energy graph,correlation function diagram,etc. And it can be found that the characteristics of gender difference in the front part of speech signal. On this basis,the 48 110* 24 mecc characteristic matrix of male and female phonetics was obtained by using MFCC respectively. By Bayesian discriminant method,the speech was classified by sex discrimination,and the model was used to identify the infant’s voice.

  • 【文献出处】 山西师范大学学报(自然科学版) ,Journal of Shanxi Normal University(Natural Science Edition) , 编辑部邮箱 ,2019年02期
  • 【分类号】TN912.3
  • 【被引频次】1
  • 【下载频次】189
节点文献中: 

本文链接的文献网络图示:

本文的引文网络