Detecting Annotation Errors in a Corpus by Induction of Syntactic Patterns
| Authors | |
|---|---|
| Year of publication | 2003 |
| Type | Article in Proceedings |
| Conference | Text, Speech and Dialogue: Sixth International Conference, TSD 2003 |
| MU Faculty or unit | |
| Citation | |
| Field | Informatics |
| Keywords | error detection; morphological tagging; relational rule induction; syntactic patterns |
| Description | This paper brings a new method for acquisition of syntactic patterns capable of detecting errors in annotated corpora. These patterns are acquired semi-automatically, by means of an inductive logic programming (relational data mining) system followed by a human expert supervision. The patterns acquired have been used for automatic detection and subsequent manual correction of the annotation errors found in DESAM, a morphologically annotated corpus of written Czech. |
| Related projects: |