Capítulo de livro Revisado por pares

Automatic Construction of a Morphological Dictionary of Multi-Word Units

2010; Springer Science+Business Media; Linguagem: Inglês

10.1007/978-3-642-14770-8_26

ISSN

1611-3349

Autores

Cvetana Krstev, Ranka Stanković, Ivan Obradović, Duško Vitas, Miloš Utvić,

Tópico(s)

Natural Language Processing Techniques

Resumo

The development of a comprehensive morphological dictionary of multi-word units for Serbian is a very demanding task, due to the complexity of Serbian morphology. Manual production of such a dictionary proved to be extremely time-consuming. In this paper we present a procedure that automatically produces dictionary lemmas for a given list of multi-word units. To accomplish this task the procedure relies on data in e-dictionaries of Serbian simple words, which are already well developed. We also offer an evaluation of the proposed procedure on several different sets of data. Finally, we discuss some implementation issues and present how the same procedure is used for other languages.

Referência(s)