Resource-based WordNet Augmentation and Enrichment

Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, Cvetana Krstev


Abstract
In this paper we present an approach to support production of synsets for Serbian WordNet (SerWN) by adjusting Princeton WordNet (PWN) synsets using several bilingual English-Serbian resources. PWN synset definitions were automatically translated and post-edited, if needed, while candidate literals for Serbian synsets were obtained automatically from a list of translational equivalents compiled form bilingual resources. Preliminary results obtained from a set of 1248 selected PWN synsets show that the produced Serbian synsets contain 4024 literals, out of which 2278 were offered by the system we present in this paper, whereas experts added the remaining 1746. Approximately one half of synset definitions obtained automatically were accepted with no or minor corrections. These first results are encouraging, since the efficiency of synset production for SerWN was increased. There is also space for further improvement of this approach to wordnet enrichment.
Anthology ID:
2018.clib-1.14
Volume:
Proceedings of the Third International Conference on Computational Linguistics in Bulgaria (CLIB 2018)
Month:
May
Year:
2018
Address:
Sofia, Bulgaria
Venue:
CLIB
SIG:
Publisher:
Department of Computational Linguistics, Institute for Bulgarian Language, Bulgarian Academy of Sciences
Note:
Pages:
104–114
Language:
URL:
https://aclanthology.org/2018.clib-1.14/
DOI:
Bibkey:
Cite (ACL):
Ranka Stanković, Miljana Mladenović, Ivan Obradović, Marko Vitas, and Cvetana Krstev. 2018. Resource-based WordNet Augmentation and Enrichment. In Proceedings of the Third International Conference on Computational Linguistics in Bulgaria (CLIB 2018), pages 104–114, Sofia, Bulgaria. Department of Computational Linguistics, Institute for Bulgarian Language, Bulgarian Academy of Sciences.
Cite (Informal):
Resource-based WordNet Augmentation and Enrichment (Stanković et al., CLIB 2018)
Copy Citation:
PDF:
https://aclanthology.org/2018.clib-1.14.pdf