Representing Multiword Term Variation in a Terminological Knowledge Base: a Corpus-Based Study

Pilar León-Araúz, Arianne Reimerink, Melania Cabezas-García


Abstract
In scientific and technical communication, multiword terms are the most frequent type of lexical units. Rendering them in another language is not an easy task due to their cognitive complexity, the proliferation of different forms, and their unsystematic representation in terminographic resources. This often results in a broad spectrum of translations for multiword terms, which also foment term variation since they consist of two or more constituents. In this study we carried out a quantitative and qualitative analysis of Spanish translation variants of a set of environment-related concepts by evaluating equivalents in three parallel corpora, two comparable corpora and two terminological resources. Our results showed that MWTs exhibit a significant degree of term variation of different characteristics, which were used to establish a set of criteria according to which term variants should be selected, organized and described in terminological knowledge bases.
Anthology ID:
2020.lrec-1.287
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
2358–2367
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.287
DOI:
Bibkey:
Cite (ACL):
Pilar León-Araúz, Arianne Reimerink, and Melania Cabezas-García. 2020. Representing Multiword Term Variation in a Terminological Knowledge Base: a Corpus-Based Study. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 2358–2367, Marseille, France. European Language Resources Association.
Cite (Informal):
Representing Multiword Term Variation in a Terminological Knowledge Base: a Corpus-Based Study (León-Araúz et al., LREC 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.lrec-1.287.pdf