提出了一种基于能量谱包络非负矩阵分解的钢琴多音符估计算法.首先对钢琴88个单音片段进行RTFI时频分析,求得对应平均能量谱,经过时序平均、归一化求得平均能量谱包络,拼接成钢琴的单音能量谱包络基矩阵.之后对测试的多音片段,采用同样处理方法求得多音平均能量谱包络,通过非负矩阵分解求得各音符的权重系数,最后通过阈值限定求得多音符估计结果.性能评估实验基于MAPS数据集的UCHO集和RAND集展开,与MIREX中最好的钢琴音乐自动记谱系统相比,本文提出的钢琴多音符估计算法性能有很大幅度的提升.
A multiple pitch estimation (MPE) algorithm for piano music was proposed here based on non -negative matrix factorization (NMF) of energy spectrum envelope. Firstly, the average energy spectrums (AES) of 88 piano notes fragments are calculated using Resonator Time - Frequency Image (RTFI). Then the average energy envelopes ( AEE ) are obtained by AES normalization across time. After that, the AEEs are combined to form the average energy envelope basis (AEEB) , which is then used for the NMF of poly- phonic average energy envelope (PAEE). The weight coefficients of piano notes are calculated from the NMF. Finally, the estimation results are obtained by threshold limitation. Performance evaluation experiments were carried out on UCHO and RAND subsets of MAPS database. Compared with the MPE algorithm used in the best AMT system in MIREX, our proposed one outperforms with better performance.