Automatic Construction of a Morphological Dictionary of Multi-Word Units
2010; Springer Science+Business Media; Linguagem: Inglês
10.1007/978-3-642-14770-8_26
ISSN1611-3349
AutoresCvetana Krstev, Ranka Stanković, Ivan Obradović, Duško Vitas, Miloš Utvić,
Tópico(s)Natural Language Processing Techniques
ResumoThe development of a comprehensive morphological dictionary of multi-word units for Serbian is a very demanding task, due to the complexity of Serbian morphology. Manual production of such a dictionary proved to be extremely time-consuming. In this paper we present a procedure that automatically produces dictionary lemmas for a given list of multi-word units. To accomplish this task the procedure relies on data in e-dictionaries of Serbian simple words, which are already well developed. We also offer an evaluation of the proposed procedure on several different sets of data. Finally, we discuss some implementation issues and present how the same procedure is used for other languages.
Referência(s)