Propria (příjmení na -č) - problém automatické morfologické analýzy

Warning

This publication doesn't include Institute of Computer Science. It includes Faculty of Arts. Official publication website can be found on muni.cz.
Title in English Propria (Family Names on -č) - the Problem of the Automatic Morphological Analysis
Authors

OSOLSOBĚ Klára

Year of publication 2008
Type Article in Proceedings
Conference Jazyk a jeho proměny
MU Faculty or unit

Faculty of Arts

Citation
Field Linguistics
Keywords corpus; proprium; family name; authomatical morphological analysis
Description The aim of this paper is to demonstrate how can be used the data mined from corpora for preparation of linguistic basis for NLP (natural language processing) applications. In three representative corpora of literary Czech (SYN2000, SYN2005, SYN2006PUB) the family names (animate masculine on č) were find. The possibility of verbal motivation of them was analyzed thereafter. In this way a list of evantual overgenerations of application of the word formation's formal rules (Osolsobě 2008) was enlarged.
Related projects:

You are running an old browser version. We recommend updating your browser to its latest version.

More info