Tokenization Matters: Navigating Data-Scarce Tokenization for Gender Inclusive Language Technologies Anaelia Ovalle author Ninareh Mehrabi author Palash Goyal author Jwala Dhamala author Kai-Wei Chang author Richard Zemel author Aram Galstyan author Yuval Pinter author Rahul Gupta author 2024-06 text Findings of the Association for Computational Linguistics: NAACL 2024 Kevin Duh editor Helena Gomez editor Steven Bethard editor Association for Computational Linguistics Mexico City, Mexico conference publication ovalle-etal-2024-tokenization 10.18653/v1/2024.findings-naacl.113 https://aclanthology.org/2024.findings-naacl.113/ 2024-06 1739 1756