MIN_PT: An European Portuguese Lexicon for Minorities Related Terms

Paula Fortuna, Vanessa Cortez, Miguel Sozinho Ramalho, Laura Pérez-Mayos


Abstract
Hate speech-related lexicons have been proved to be useful for many tasks such as data collection and classification. However, existing Portuguese lexicons do not distinguish between European and Brazilian Portuguese, and do not include neutral terms that are potentially useful to detect a broader spectrum of content referring to minorities. In this work, we present MIN_PT, a new European Portuguese Lexicon for Minorities-Related Terms specifically designed to tackle the limitations of existing resources. We describe the data collection and annotation process, discuss the limitation and ethical concerns, and prove the utility of the resource by applying it to a use case for the Portuguese 2021 presidential elections.
Anthology ID:
2021.woah-1.8
Volume:
Proceedings of the 5th Workshop on Online Abuse and Harms (WOAH 2021)
Month:
August
Year:
2021
Address:
Online
Editors:
Aida Mostafazadeh Davani, Douwe Kiela, Mathias Lambert, Bertie Vidgen, Vinodkumar Prabhakaran, Zeerak Waseem
Venue:
WOAH
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
76–80
Language:
URL:
https://aclanthology.org/2021.woah-1.8
DOI:
10.18653/v1/2021.woah-1.8
Bibkey:
Cite (ACL):
Paula Fortuna, Vanessa Cortez, Miguel Sozinho Ramalho, and Laura Pérez-Mayos. 2021. MIN_PT: An European Portuguese Lexicon for Minorities Related Terms. In Proceedings of the 5th Workshop on Online Abuse and Harms (WOAH 2021), pages 76–80, Online. Association for Computational Linguistics.
Cite (Informal):
MIN_PT: An European Portuguese Lexicon for Minorities Related Terms (Fortuna et al., WOAH 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.woah-1.8.pdf
Video:
 https://aclanthology.org/2021.woah-1.8.mp4