A New Massive Multilingual Dataset for High-Performance Language Technologies Ona de Gibert author Graeme Nail author Nikolay Arefyev author Marta Bañón author Jelmer van der Linde author Shaoxiong Ji author Jaume Zaragoza-Bernabeu author Mikko Aulamo author Gema Ramírez-Sánchez author Andrey Kutuzov author Sampo Pyysalo author Stephan Oepen author Jörg Tiedemann author 2024-05 text Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) Nicoletta Calzolari editor Min-Yen Kan editor Veronique Hoste editor Alessandro Lenci editor Sakriani Sakti editor Nianwen Xue editor ELRA and ICCL Torino, Italia conference publication de-gibert-etal-2024-new https://aclanthology.org/2024.lrec-main.100/ 2024-05 1116 1128