节点文献

复合乐音的多基频提取

Multi-pitch Detection from Ployphonic Music

【作者】 刘成芳

【导师】 刘若伦;

【作者基本信息】 山东大学 , 信号与信息处理, 2011, 硕士

【摘要】 随着计算机网络和多媒体技术的发展,基于复合乐音的多基频提取已经成为乐音信号处理中不可或缺的技术。本文首先介绍了多基频提取的基础知识。它们包括听觉模型、听觉滤波器、小波变换、STFT与MDCT变换、自相关及平均幅度差分的求解。其次,描述了本文研究的多基频提取技术:基于听觉模型、二进小波变换近似系数相乘、Specmurt算法的多基频提取。最后,对以上典型技术进行了改进并比较了改进前后的提取效果。基于听觉模型的多基频提取是较为传统的一种方法。它可分为:Anssi Klapuri提出的多通道模型、M.Y.Wu提出的多通道分高低频模型、TeroTolonen提出的基于两通道的多基频提取。在这三种方法中,基于两通道的多基频提取得到了更好的基频分布。基于听觉模型的多基频提取采用了归一化自相关、傅里叶变换求自相关方法、增强自相关的方法来修正自相关对求解周期分布存在的缺陷,其中归一化自相关减少了分帧加窗所带来的影响,傅里叶变换求自相关提高了峰值显著度,而增强自相关则消除了倍数周期。二进小波变换近似系数相乘算法增强了峰值显著度,它使得基频的位置更容易被找到。本文首先采用仿真信号对尺度的选择进行了比较,实验表明选用三个近似系数相乘时效果是最佳的。文中发挥了基于听觉模型的多基频提取的优势,对它进行了改进。将两通道模型和二进小波变换近似系数相乘结合在一起,这种结合提高了提取基频的查准率和查全率相对于听觉模型方法,Specmurt算法提高了提取基频的分辨率,更加合理的利用了音乐的谐波结构。相对于二进小波变换来说,它的干扰频率较少,“野点”的剔除比较容易。Specmurt算法可以通过MDCT、STFT、复小波变换的方法实现。其中MDCT算法主要是针对MP3乐音存储格式。本文对上述三种典型技术利用仿真信号、单基频叠加的复合乐音及MIDI复合乐音进行了仿真实验,并分别从理论分析、实验验证两个方面对比分析了各算法的多基频提取性能。

【Abstract】 With the development of computer networks and multimedia technology, multi-pitch detection from polyphonic music has become the key issue in the music signal processing.Firstly, this paper briefly introduces the fundamental concepts of multi-pitch detection, such as auditory system, auditory filters, wavelet transform, autocorrelation, STFT and MDCT. Secondly, it depicts the typical techniques of multi-pitch detection namely multi-pitch detection using auditory model, detection multi-pitch by Specmurt and detection multi-pitch by dyadic wavelet transform approximate coefficient multiply. Finally, typical techniques are improved and performance before and after modification is compared.Multi-pitch detection using auditory model is a traditional way. it utilizes auditory masking effects. It can be divided into three methods:multi-channel detection putting forward by Klapuri, high and low frequency multi-channel detection by M.Y.Wu, two-channel detection by Tolonen. Two-channel detection obtains better multi-pitch distribution than others. Multi-pitch detection using auditory model solves period distribution in three ways using normalized autocorrelation, autocorrelation by Fourier transform and enhanced autocorrelation which revises the traditional autocorrelation. Normalized autocorrelation reduces the influence of window function. Autocorrelation via Fourier transform enhances the saliency of peaks. While enhanced autocorrelation eliminates multiple periods.Detection multi-pitch by multiplication of approximate coefficients of dyadic wavelet transform enhances the saliency of peaks. Multi-pitch can be found more easily. In this paper, multiplications of different scale are compared by synthesized signal. The result confirms that approximate coefficients of the first three scales work best. Experimental results show that it improves the precision and recall.Compared with multi-pitch detection using auditory model, detection multi-pitch by Specmurt which uses music harmonic structure properly improves the resolution. Compared with wavelet approximate coefficient multiply, it has less interference frequency and is easier to eliminate outliers. Specmurt can be realized by MDCT, STFT, complex wavelet transform.This paper simulates three categories of multi-pitch detection discussed above and assesses their objective performance evaluation. The two objective evaluation indicators are precision and recall.

【关键词】 多基频提取听觉模型Specmurt
【Key words】 Multi-pitch detectionauditory modelSpecmurt
  • 【网络出版投稿人】 山东大学
  • 【网络出版年期】2012年 04期
  • 【分类号】TN912.3
  • 【下载频次】126
节点文献中: 

本文链接的文献网络图示:

本文的引文网络