Artigo Acesso aberto Revisado por pares

A Study of Acoustic Features for Emotional Speaker Recognition in I-Vector Representation

2015; Technical University of Košice; Volume: 15; Issue: 2 Linguagem: Inglês

10.15546/aeei-2015-0011

ISSN

1338-3957

Autores

Lenka Macková, Anton Čižmár, Jozef Juhár,

Tópico(s)

Music and Audio Processing

Resumo

Recently recognition of emotions became very important in the field of speech and/or speaker recognition.This paper is dedicated to experimental investigation of best acoustic features obtained for purpose of gender-dependent speaker recognition from emotional speech.Four feature sets -LPC (Linear Prediction Coefficients), LPCC (Linear Prediction Cepstral Coefficients), MFCC (Melfrequency Cepstral Coefficients) and PLP (Perceptual linear prediction) coefficients -were compared in an experimental setup of speaker recognition system, based on i-vector representation.For evaluation of the system emotional speech recordings from newly created Slovak emotional database and Mahalanobis distance metric as scoring method were used.The results of the experiment showed the MFCC representation as the best fitted for speaker verification from Slovak emotional speech with recognition rate higher than 80%.

Referência(s)