Objective Comparison of Four GMM-Based Methods for PMA-to-Speech Conversion
2016; Springer Science+Business Media; Linguagem: Inglês
10.1007/978-3-319-49169-1_3
ISSN1611-3349
AutoresDaniel Erro, Inma Hernáez, Luís Serrano, Ibon Saratxaga, Eva Navas,
Tópico(s)Music and Audio Processing
ResumoIn silent speech interfaces a mapping is established between biosignals captured by sensors and acoustic characteristics of speech. Recent works have shown the feasibility of a silent interface based on permanent magnet-articulography (PMA). This paper studies the performance of four different mapping methods based on Gaussian mixture models (GMMs), typical from the voice conversion field, when applied to PMA-to-spectrum conversion. The results show the superiority of methods based on maximum likelihood parameter generation (MLPG), especially when the parameters of the mapping function are trained by minimizing the generation error. Informal listening tests reveal that the resulting speech is moderately intelligible for the database under study.
Referência(s)