Evaluating RBMT output for -ing forms: A study of four tar-get languages

Artigo Acesso aberto Revisado por pares

Evaluating RBMT output for -ing forms: A study of four tar-get languages

2021; Volume: 8; Linguagem: Inglês

10.52034/lanstts.v8i.247

ISSN

2295-5739

Autores

Nora Aranberri-Monasterio, Sharon O’Brien,

Tópico(s)

Speech and dialogue systems

Resumo

-ing forms in English are reported to be problematic for Machine Transla-tion and are often the focus of rules in Controlled Language rule sets. We investigated how problematic -ing forms are for an RBMT system, translat-ing into four target languages in the IT domain. Constituent-based human evaluation was used and the results showed that, in general, -ing forms do not deserve their bad reputation. A comparison with the results of five automated MT evaluation metrics showed promising correlations. Some issues prevail, however, and can vary from target language to target lan-guage. We propose different strategies for dealing with these problems, such as Controlled Language rules, semi-automatic post-editing, source text tagging and “post-editing” the source text.

Ver no editor

Altmetric

PlumX

Entrar

Lembrar minha senha

Receber meu e-mail de confirmação

Evaluating RBMT output for -ing forms: A study of four tar-get languages