节点文献
面向空中交通管制的时频域语音增强技术研究
Research on Time-Frequency Domain Speech Enhancement Techniques for Air Traffic Control
【摘要】 本研究旨在通过语音增强技术解决空中交通管制通话中的语音干扰问题。通过结合频域降噪和时域增强方法,提出了改进的U-Net模型实现了对管制语音的有效降噪处理。采用SNR(Signal-to-Noise Ration)、MOS(Mean Opinion Score)来直接评估降噪效果。实验结果显示,与基线U-Net模型相比,改进模型的SNR值提升了4.566 3,达到了7.386 1。鉴于在实际ATC工作环境中难以准确计算SNR,采用了间接评估方法,通过语音识别系统的识别结果来间接衡量模型在真实ATC环境下音频的降噪效果。实验结果表明,经过语音增强处理后的测试音频在语音识别系统中平均字错率降低了1.79%,句错率降低了3%,改进后的模型能有效改善话音质量提升语音识别系统的识别准确率。
【Abstract】 This study aims to solve the problem of voice interference in air traffic control communications using voice enhancement technology.By combining frequency domain noise reduction with time domain enhancement methods, this paper proposes an improved U-Net model for effective noise reduction in control voice communications.The noise reduction effectiveness is directly evaluated using SNR(Signal-to-Noise Ratio) and MOS(Mean Opinion Score).Experimental results show that the SNR value of the improved model increased by 4.566 3 over the baseline U-Net model, reaching 7.386 1.Given the difficulty of accurately calculating SNR in real ATC environments, this paper employs an indirect evaluation method, using the results of a speech recognition system to measure the model′s noise reduction effectiveness in actual ATC scenarios.The experimental results indicate that the test audio, after undergoing voice enhancement processing, show a reduction in the average word error rate by 1.79% and in the sentence error rate by 3% within the speech recognition system.The improved model effectively enhances voice quality and increases the accuracy of the speech recognition system.
- 【文献出处】 航空计算技术 ,Aeronautical Computing Technique , 编辑部邮箱 ,2024年03期
- 【分类号】V355.1;TN912.35
- 【下载频次】24