Artigo Acesso aberto Revisado por pares

Building a Bidirectional English-Vietnamese Statistical Machine Translation System by Using MOSES

2016; Volume: 8; Issue: 2 Linguagem: Inglês

10.17706/ijcee.2016.8.2.161-168

ISSN

1793-8163

Autores

Quang-Phuoc Nguyen, Ying-Xiu Quan, Cheol-Young Ock,

Tópico(s)

Handwritten Text Recognition Techniques

Resumo

In this paper, we describe an attempt at constructing a bidirectional English-Vietnamese statistical machine translation system using MOSES, an open-source toolkit for statistical machine translation.The quality of a statistical machine translation system depends on the bilingual sentence alignment data called parallel corpus.However, Vietnamese is an under-resourced language that is largely left out of initial corpus building efforts.Therefore, we concentrate on building Vietnamese corpora which consist over 880,000 English-Vietnamese sentence pairs and over 11,000,000 Vietnamese monolingual sentences to train the statistical translation model and the language model respectively.According to the obtained BLEU scores, the proposed system outperforms the Google translator and the Microsoft Bing translator in both English to Vietnamese and Vietnamese to English translating.

Referência(s)