MOTIF: Contextualized Images for Complex Words to Improve Human Reading

Xintong Wang, Florian Schneider, Özge Alacam, Prateek Chaudhury, Chris Biemann


Abstract
MOTIF (MultimOdal ConTextualized Images For Language Learners) is a multimodal dataset that consists of 1125 comprehension texts retrieved from Wikipedia Simple Corpus. Allowing multimodal processing or enriching the context with multimodal information has proven imperative for many learning tasks, specifically for second language (L2) learning. In this respect, several traditional NLP approaches can assist L2 readers in text comprehension processes, such as simplifying text or giving dictionary descriptions for complex words. As nicely stated in the well-known proverb, sometimes “a picture is worth a thousand words” and an image can successfully complement the verbal message by enriching the representation, like in Pictionary books. This multimodal support can also assist on-the-fly text reading experience by providing a multimodal tool that chooses and displays the most relevant images for the difficult words, given the text context. This study mainly focuses on one of the key components to achieving this goal; collecting a multimodal dataset enriched with complex word annotation and validated image match.
Anthology ID:
2022.lrec-1.263
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
2468–2477
Language:
URL:
https://aclanthology.org/2022.lrec-1.263
DOI:
Bibkey:
Cite (ACL):
Xintong Wang, Florian Schneider, Özge Alacam, Prateek Chaudhury, and Chris Biemann. 2022. MOTIF: Contextualized Images for Complex Words to Improve Human Reading. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 2468–2477, Marseille, France. European Language Resources Association.
Cite (Informal):
MOTIF: Contextualized Images for Complex Words to Improve Human Reading (Wang et al., LREC 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.lrec-1.263.pdf
Data
MS COCO