Building a Bidirectional English-Vietnamese Statistical Machine Translation System by Using MOSES
2016; Volume: 8; Issue: 2 Linguagem: Inglês
10.17706/ijcee.2016.8.2.161-168
ISSN1793-8163
AutoresQuang-Phuoc Nguyen, Ying-Xiu Quan, Cheol-Young Ock,
Tópico(s)Handwritten Text Recognition Techniques
ResumoIn this paper, we describe an attempt at constructing a bidirectional English-Vietnamese statistical machine translation system using MOSES, an open-source toolkit for statistical machine translation.The quality of a statistical machine translation system depends on the bilingual sentence alignment data called parallel corpus.However, Vietnamese is an under-resourced language that is largely left out of initial corpus building efforts.Therefore, we concentrate on building Vietnamese corpora which consist over 880,000 English-Vietnamese sentence pairs and over 11,000,000 Vietnamese monolingual sentences to train the statistical translation model and the language model respectively.According to the obtained BLEU scores, the proposed system outperforms the Google translator and the Microsoft Bing translator in both English to Vietnamese and Vietnamese to English translating.
Referência(s)