Abstract
Existing event-centric NLP models often only apply to the pre-defined ontology, which significantly restricts their generalization capabilities.This paper presents CEO, a novel Corpus-based Event Ontology induction model to relax the restriction imposed by pre-defined event ontologies. Without direct supervision, CEO leverages distant supervision from available summary datasets to detect corpus-wise salient events and exploits external event knowledge to force events within a short distance to have close embeddings. Experiments on three popular event datasets show that the schema induced by CEO has better coverage and higher accuracy than previous methods. Moreover, CEO is the first event ontology induction model that can induce a hierarchical event ontology with meaningful names on eleven open-domain corpora, making the induced schema more trustworthy and easier to be further curated. We anonymously release our dataset, codes, and induced ontology.- Anthology ID:
- 2024.findings-eacl.64
- Volume:
- Findings of the Association for Computational Linguistics: EACL 2024
- Month:
- March
- Year:
- 2024
- Address:
- St. Julian’s, Malta
- Editors:
- Yvette Graham, Matthew Purver
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 946–964
- Language:
- URL:
- https://aclanthology.org/2024.findings-eacl.64
- DOI:
- Cite (ACL):
- Nan Xu, Hongming Zhang, and Jianshu Chen. 2024. CEO: Corpus-based Open-Domain Event Ontology Induction. In Findings of the Association for Computational Linguistics: EACL 2024, pages 946–964, St. Julian’s, Malta. Association for Computational Linguistics.
- Cite (Informal):
- CEO: Corpus-based Open-Domain Event Ontology Induction (Xu et al., Findings 2024)
- PDF:
- https://preview.aclanthology.org/proper-vol2-ingestion/2024.findings-eacl.64.pdf