KLEJ: Comprehensive Benchmark for Polish Language Understanding

Piotr Rybak, Robert Mroczkowski, Janusz Tracz, Ireneusz Gawlik


Abstract
In recent years, a series of Transformer-based models unlocked major improvements in general natural language understanding (NLU) tasks. Such a fast pace of research would not be possible without general NLU benchmarks, which allow for a fair comparison of the proposed methods. However, such benchmarks are available only for a handful of languages. To alleviate this issue, we introduce a comprehensive multi-task benchmark for the Polish language understanding, accompanied by an online leaderboard. It consists of a diverse set of tasks, adopted from existing datasets for named entity recognition, question-answering, textual entailment, and others. We also introduce a new sentiment analysis task for the e-commerce domain, named Allegro Reviews (AR). To ensure a common evaluation scheme and promote models that generalize to different NLU tasks, the benchmark includes datasets from varying domains and applications. Additionally, we release HerBERT, a Transformer-based model trained specifically for the Polish language, which has the best average performance and obtains the best results for three out of nine tasks. Finally, we provide an extensive evaluation, including several standard baselines and recently proposed, multilingual Transformer-based models.
Anthology ID:
2020.acl-main.111
Volume:
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
Month:
July
Year:
2020
Address:
Online
Editors:
Dan Jurafsky, Joyce Chai, Natalie Schluter, Joel Tetreault
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1191–1201
Language:
URL:
https://aclanthology.org/2020.acl-main.111
DOI:
10.18653/v1/2020.acl-main.111
Bibkey:
Cite (ACL):
Piotr Rybak, Robert Mroczkowski, Janusz Tracz, and Ireneusz Gawlik. 2020. KLEJ: Comprehensive Benchmark for Polish Language Understanding. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 1191–1201, Online. Association for Computational Linguistics.
Cite (Informal):
KLEJ: Comprehensive Benchmark for Polish Language Understanding (Rybak et al., ACL 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.acl-main.111.pdf
Video:
 http://slideslive.com/38929260
Code
 allegro/klejbenchmark-baselines
Data
Allegro ReviewsKLEJGLUEOpenSubtitlesPSCPolEmo 2.0SentEval