When Word Pairs Matter - Analysis of the English-Slovak Evaluation Dataset
| Autoři | |
|---|---|
| Rok publikování | 2021 |
| Druh | Článek ve sborníku |
| Konference | Proceedings of the Fifteenth Workshop on Recent Advances in Slavonic Natural Languages Processing, RASLAN 2021 |
| Fakulta / Pracoviště MU | |
| Citace | |
| www | |
| Klíčová slova | Cross-lingual word embeddings; Ground truth dictionary; Evaluation; English; Slovak |
| Popis | Cross-lingual word embeddings facilitate the transfer of lexical knowledge across languages, and they are mainly used for finding transla- tion equivalents. Translation equivalents obtained in this way are usually evaluated with the help of ground truth dictionaries. However, the evalu- ation process, including the ground truth dictionaries, differs from model to model, impeding the correct interpretation of the results. Therefore, in this paper, we provide a thorough analysis of the English-Slovak ground truth dictionary and employ our analysis in evaluating two cross-lingual word embedding models. We show that word pairs choice is an important factor when accurately reflecting the model’s performance. |
| Související projekty: |