
Is masking a relevant aspect lacking in MFCC? A speaker verification perspective
2012; Elsevier BV; Volume: 33; Issue: 16 Linguagem: Inglês
10.1016/j.patrec.2012.07.023
ISSN1872-7344
AutoresJugurta Montalvão, Marcos Renato Rodrigues Araujo,
Tópico(s)Music and Audio Processing
ResumoWe hypothesize that spectral masking may account for most of the gains in robustness against noise using ensemble interval histogram (EIH) and zero crossing with peak amplitude (ZCPA) compared to Mel-frequency cepstral coefficients (MFCCs). To test this hypothesis, we focus on this issue by comparing two MFCC implementations for which the only difference is spectral masking. The comparison involved biometric speaker verification tasks using two publicly available databases. The results confirm the superiority of MFCC with masking, thus corroborating our hypotheses that masking is a key aspect for improved robustness in feature extraction.
Referência(s)