Twitter Data Augmentation for Monitoring Public Opinion on COVID-19 Intervention Measures

Lin Miao, Mark Last, Marina Litvak


Abstract
The COVID-19 outbreak is an ongoing worldwide pandemic that was announced as a global health crisis in March 2020. Due to the enormous challenges and high stakes of this pandemic, governments have implemented a wide range of policies aimed at containing the spread of the virus and its negative effect on multiple aspects of our life. Public responses to various intervention measures imposed over time can be explored by analyzing the social media. Due to the shortage of available labeled data for this new and evolving domain, we apply data distillation methodology to labeled datasets from related tasks and a very small manually labeled dataset. Our experimental results show that data distillation outperforms other data augmentation methods on our task.
Anthology ID:
2020.nlpcovid19-2.19
Volume:
Proceedings of the 1st Workshop on NLP for COVID-19 (Part 2) at EMNLP 2020
Month:
December
Year:
2020
Address:
Online
Editors:
Karin Verspoor, Kevin Bretonnel Cohen, Michael Conway, Berry de Bruijn, Mark Dredze, Rada Mihalcea, Byron Wallace
Venue:
NLP-COVID19
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
Language:
URL:
https://aclanthology.org/2020.nlpcovid19-2.19
DOI:
10.18653/v1/2020.nlpcovid19-2.19
Bibkey:
Cite (ACL):
Lin Miao, Mark Last, and Marina Litvak. 2020. Twitter Data Augmentation for Monitoring Public Opinion on COVID-19 Intervention Measures. In Proceedings of the 1st Workshop on NLP for COVID-19 (Part 2) at EMNLP 2020, Online. Association for Computational Linguistics.
Cite (Informal):
Twitter Data Augmentation for Monitoring Public Opinion on COVID-19 Intervention Measures (Miao et al., NLP-COVID19 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.nlpcovid19-2.19.pdf
Video:
 https://slideslive.com/38939859