An Empirical Study on Language Model Adaptation Using a Metric of Domain Similarity

Capítulo de livro Acesso aberto Revisado por pares

An Empirical Study on Language Model Adaptation Using a Metric of Domain Similarity

2005; Springer Science+Business Media; Linguagem: Inglês

10.1007/11562214_83

ISSN

1611-3349

Autores

Wei Yuan, Jianfeng Gao, Hisami Suzuki,

Tópico(s)

Speech Recognition and Synthesis

Resumo

This paper presents an empirical study on four techniques of language model adaptation, including a maximum a posteriori (MAP) method and three discriminative training models, in the application of Japanese Kana-Kanji conversion. We compare the performance of these methods from various angles by adapting the baseline model to four adaptation domains. In particular, we attempt to interpret the results given in terms of the character error rate (CER) by correlating them with the characteristics of the adaptation domain measured using the information-theoretic notion of cross entropy. We show that such a metric correlates well with the CER performance of the adaptation methods, and also show that the discriminative methods are not only superior to a MAP-based method in terms of achieving larger CER reduction, but are also more robust against the similarity of background and adaptation domains.

Ver no editor

Altmetric

PlumX

Entrar

Lembrar minha senha

Receber meu e-mail de confirmação

An Empirical Study on Language Model Adaptation Using a Metric of Domain Similarity