Capturing the Content of a Document through Complex Event Identification

Zheng Qi, Elior Sulem, Haoyu Wang, Xiaodong Yu, Dan Roth


Abstract
Granular events, instantiated in a document by predicates, can usually be grouped into more general events, called complex events. Together, they capture the major content of the document. Recent work grouped granular events by defining event regions, filtering out sentences that are irrelevant to the main content. However, this approach assumes that a given complex event is always described in consecutive sentences, which does not always hold in practice. In this paper, we introduce the task of complex event identification. We address this task as a pipeline, first predicting whether two granular events mentioned in the text belong to the same complex event, independently of their position in the text, and then using this to cluster them into complex events. Due to the difficulty of predicting whether two granular events belong to the same complex event in isolation, we propose a context-augmented representation learning approach CONTEXTRL that adds additional context to better model the pairwise relation between granular events. We show that our approach outperforms strong baselines on the complex event identification task and further present a promising case study exploring the effectiveness of using complex events as input for document-level argument extraction.
Anthology ID:
2022.starsem-1.29
Volume:
Proceedings of the 11th Joint Conference on Lexical and Computational Semantics
Month:
July
Year:
2022
Address:
Seattle, Washington
Venue:
*SEM
SIG:
SIGSEM
Publisher:
Association for Computational Linguistics
Note:
Pages:
331–340
Language:
URL:
https://aclanthology.org/2022.starsem-1.29
DOI:
10.18653/v1/2022.starsem-1.29
Bibkey:
Cite (ACL):
Zheng Qi, Elior Sulem, Haoyu Wang, Xiaodong Yu, and Dan Roth. 2022. Capturing the Content of a Document through Complex Event Identification. In Proceedings of the 11th Joint Conference on Lexical and Computational Semantics, pages 331–340, Seattle, Washington. Association for Computational Linguistics.
Cite (Informal):
Capturing the Content of a Document through Complex Event Identification (Qi et al., *SEM 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/auto-file-uploads/2022.starsem-1.29.pdf