A Distributional Multi-word Thesaurus in Sketch Engine
| Authors | |
|---|---|
| Year of publication | 2019 |
| Type | Article in Proceedings |
| Conference | Proceedings of the Thirteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2019 |
| MU Faculty or unit | |
| Citation | |
| web | https://nlp.fi.muni.cz/raslan/2019/paper17-jakubicek.pdf |
| Keywords | text corpus; Sketch Engine; MWE; multi-word expressions; thesaurus |
| Description | In this paper we present an extension of the current distribu-tional thesaurus as available in the Sketch Engine corpus managementsystem towards multi-word units. We explain how multi-word sketches areused to generate multi-word unit candidates, thus preserving access to theunderlying corpus texts. Finally we present sample results on the BritishNational Corpus and discuss future development as well as difficulties inevaluation. |
| Related projects: |