Artigo Produção Nacional Revisado por pares

Is masking a relevant aspect lacking in MFCC? A speaker verification perspective

2012; Elsevier BV; Volume: 33; Issue: 16 Linguagem: Inglês

10.1016/j.patrec.2012.07.023

ISSN

1872-7344

Autores

Jugurta Montalvão, Marcos Renato Rodrigues Araujo,

Tópico(s)

Music and Audio Processing

Resumo

We hypothesize that spectral masking may account for most of the gains in robustness against noise using ensemble interval histogram (EIH) and zero crossing with peak amplitude (ZCPA) compared to Mel-frequency cepstral coefficients (MFCCs). To test this hypothesis, we focus on this issue by comparing two MFCC implementations for which the only difference is spectral masking. The comparison involved biometric speaker verification tasks using two publicly available databases. The results confirm the superiority of MFCC with masking, thus corroborating our hypotheses that masking is a key aspect for improved robustness in feature extraction.

Referência(s)
Altmetric
PlumX