Artigo Revisado por pares

Qualitative: Open Source Python Tool for Quality Estimation over Multiple Machine Translation Outputs

2014; De Gruyter Open; Volume: 102; Issue: 1 Linguagem: Inglês

10.2478/pralin-2014-0009

ISSN

1804-0462

Autores

Eleftherios Avramidis, Lukas Poustka, Sven Schmeier,

Tópico(s)

Computational Physics and Python Applications

Resumo

Abstract “Qualitative” is a python toolkit for ranking and selection of sentence-level output by different MT systems using Quality Estimation. The toolkit implements a basic pipeline for annotating the given sentences with black-box features. Consequently, it applies a machine learning mechanism in order to rank data based on models pre-trained on human preferences. The preprocessing pipeline includes support for language models, PCFG parsing, language checking tools and various other pre-processors and feature generators. The code follows the principles of object-oriented programming to allow modularity and extensibility. The tool can operate by processing both batch-files and single sentences. An XML-RPC interface is provided for hooking up with web-services and a graphical animated web-based interface demonstrates its potential on-line use.

Referência(s)