CorpusNÓS: A massive Galician corpus for training large language models Iria de-Dios-Flores author Silvia Paniagua Suárez author Cristina Carbajal Pérez author Daniel Bardanca Outeiriño author Marcos Garcia author Pablo Gamallo author 2024-03 text Proceedings of the 16th International Conference on Computational Processing of Portuguese - Vol. 1 Pablo Gamallo editor Daniela Claro editor António Teixeira editor Livy Real editor Marcos Garcia editor Hugo Gonçalo Oliveira editor Raquel Amaro editor Association for Computational Lingustics Santiago de Compostela, Galicia/Spain conference publication de-dios-flores-etal-2024-corpusnos https://aclanthology.org/2024.propor-1.66/ 2024-03 593 599