CEO: Corpus-based Open-Domain Event Ontology Induction

Nan Xu, Hongming Zhang, Jianshu Chen


Abstract
Existing event-centric NLP models often only apply to the pre-defined ontology, which significantly restricts their generalization capabilities.This paper presents CEO, a novel Corpus-based Event Ontology induction model to relax the restriction imposed by pre-defined event ontologies. Without direct supervision, CEO leverages distant supervision from available summary datasets to detect corpus-wise salient events and exploits external event knowledge to force events within a short distance to have close embeddings. Experiments on three popular event datasets show that the schema induced by CEO has better coverage and higher accuracy than previous methods. Moreover, CEO is the first event ontology induction model that can induce a hierarchical event ontology with meaningful names on eleven open-domain corpora, making the induced schema more trustworthy and easier to be further curated. We anonymously release our dataset, codes, and induced ontology.
Anthology ID:
2024.findings-eacl.64
Volume:
Findings of the Association for Computational Linguistics: EACL 2024
Month:
March
Year:
2024
Address:
St. Julian’s, Malta
Editors:
Yvette Graham, Matthew Purver
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
946–964
Language:
URL:
https://aclanthology.org/2024.findings-eacl.64
DOI:
Bibkey:
Cite (ACL):
Nan Xu, Hongming Zhang, and Jianshu Chen. 2024. CEO: Corpus-based Open-Domain Event Ontology Induction. In Findings of the Association for Computational Linguistics: EACL 2024, pages 946–964, St. Julian’s, Malta. Association for Computational Linguistics.
Cite (Informal):
CEO: Corpus-based Open-Domain Event Ontology Induction (Xu et al., Findings 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.findings-eacl.64.pdf