J2-9180 — Annual report 2008
1.
Machine learning of lemmatisers

Lemmatisation is one of the basic language technology components. In this paper we present a supervised machine learning method that learns lemmatisation models from morphological lexica. We show its advantages over previously developed methods.

COBISS.SI-ID: 2159338
2.
Morphosyntactic tagging of Slovene language with a meta-tagger

Morphosyntactic tagging is one of the basic language technology components. In this paper we introduce a method that enables increasing the accuracy of morphosyntactic tagging by combining the outputs of multiple taggers.

COBISS.SI-ID: 22416423