Using MT for multilingual covid-19 case load prediction from social media texts

Maja Popovic, Vasudevan Nedumpozhimana, Meegan Gower, Sneha Rautmare, Nishtha Jain, John Kelleher


Abstract
In the context of an epidemiological study involving multilingual social media, this paper reports on the ability of machine translation systems to preserve content relevant for a document classification task designed to determine whether the social media text is related to covid. The results indicate that machine translation does provide a feasible basis for scaling epidemiological social media surveillance to multiple languages. Moreover, a qualitative error analysis revealed that the majority of classification errors are not caused by MT errors.
Anthology ID:
2023.eamt-1.45
Volume:
Proceedings of the 24th Annual Conference of the European Association for Machine Translation
Month:
June
Year:
2023
Address:
Tampere, Finland
Editors:
Mary Nurminen, Judith Brenner, Maarit Koponen, Sirkku Latomaa, Mikhail Mikhailov, Frederike Schierl, Tharindu Ranasinghe, Eva Vanmassenhove, Sergi Alvarez Vidal, Nora Aranberri, Mara Nunziatini, Carla Parra Escartín, Mikel Forcada, Maja Popovic, Carolina Scarton, Helena Moniz
Venue:
EAMT
SIG:
Publisher:
European Association for Machine Translation
Note:
Pages:
461–470
Language:
URL:
https://aclanthology.org/2023.eamt-1.45
DOI:
Bibkey:
Cite (ACL):
Maja Popovic, Vasudevan Nedumpozhimana, Meegan Gower, Sneha Rautmare, Nishtha Jain, and John Kelleher. 2023. Using MT for multilingual covid-19 case load prediction from social media texts. In Proceedings of the 24th Annual Conference of the European Association for Machine Translation, pages 461–470, Tampere, Finland. European Association for Machine Translation.
Cite (Informal):
Using MT for multilingual covid-19 case load prediction from social media texts (Popovic et al., EAMT 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.eamt-1.45.pdf