Statistical and Neural Machine Translation for Manipuri-English on Intelligence Domain
2021; Springer Science+Business Media; Linguagem: Inglês
10.1007/978-981-33-6987-0_21
ISSN1876-1119
AutoresLaishram Rahul, Loitongbam Sanayai Meetei, H. S. Jayanna,
Tópico(s)Translation Studies and Practices
ResumoRahul, Laishram Meetei, Loitongbam Sanayai Jayanna, H. S.This paper describes the development and results of Manipuri-English machine translation system built on an intelligence domain. Manipuri is an under-resourced Tibeto-Burman language that is spoken mainly in the North-Eastern states of India. A total of 56,678 Manipuri-English parallel corpora from intelligence domain based on the open-source intelligence (OSINT) keywords and phrases are collected for the experiment. An evaluation of statistical machine translation (SMT) and neural machine translation (NMT) is carried out in terms of BLEU score. A BLEU score of 23.91 is achieved with the SMT-based approach which is outperformed by the NMT-based system with a BLEU score of 40.67. Further, a language-specific morphological analysis based on the suffixes is investigated. The findings on the incorporation of morphological analysis report a BLEU score of 25.03 with the SMT and a BLEU score of 44 with NMT, both of which are a significant improvement.
Referência(s)