Multilingual Comparative Analysis of Deep-Learning Dependency Parsing Results Using Parallel Corpora

Diego Alves, Marko Tadić, Božo Bekavac


Abstract
This article presents a comparative analysis of dependency parsing results for a set of 16 languages, coming from a large variety of linguistic families and genera, whose parallel corpora were used to train a deep-learning tool. Results are analyzed in comparison to an innovative way of classifying languages concerning the head directionality parameter used to perform a quantitative syntactic typological classification of languages. It has been shown that, despite using parallel corpora, there is a large discrepancy in terms of LAS results. The obtained results show that this heterogeneity is mainly due to differences in the syntactic structure of the selected languages, where Indo-European ones, especially Romance languages, have the best scores. It has been observed that the differences in the size of the representation of each language in the language model used by the deep-learning tool also play a major role in the dependency parsing efficacy. Other factors, such as the number of dependency parsing labels may also have an influence on results with more complex labeling systems such as the Polish language.
Anthology ID:
2022.bucc-1.5
Volume:
Proceedings of the BUCC Workshop within LREC 2022
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Reinhard Rapp, Pierre Zweigenbaum, Serge Sharoff
Venue:
BUCC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
33–42
Language:
URL:
https://aclanthology.org/2022.bucc-1.5
DOI:
Bibkey:
Cite (ACL):
Diego Alves, Marko Tadić, and Božo Bekavac. 2022. Multilingual Comparative Analysis of Deep-Learning Dependency Parsing Results Using Parallel Corpora. In Proceedings of the BUCC Workshop within LREC 2022, pages 33–42, Marseille, France. European Language Resources Association.
Cite (Informal):
Multilingual Comparative Analysis of Deep-Learning Dependency Parsing Results Using Parallel Corpora (Alves et al., BUCC 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.bucc-1.5.pdf
Data
Universal Dependencies