Capítulo de livro Revisado por pares

Objective Comparison of Four GMM-Based Methods for PMA-to-Speech Conversion

2016; Springer Science+Business Media; Linguagem: Inglês

10.1007/978-3-319-49169-1_3

ISSN

1611-3349

Autores

Daniel Erro, Inma Hernáez, Luís Serrano, Ibon Saratxaga, Eva Navas,

Tópico(s)

Music and Audio Processing

Resumo

In silent speech interfaces a mapping is established between biosignals captured by sensors and acoustic characteristics of speech. Recent works have shown the feasibility of a silent interface based on permanent magnet-articulography (PMA). This paper studies the performance of four different mapping methods based on Gaussian mixture models (GMMs), typical from the voice conversion field, when applied to PMA-to-spectrum conversion. The results show the superiority of methods based on maximum likelihood parameter generation (MLPG), especially when the parameters of the mapping function are trained by minimizing the generation error. Informal listening tests reveal that the resulting speech is moderately intelligible for the database under study.

Referência(s)