Semantic Novelty Detection and Characterization in Factual Text Involving Named Entities

Nianzu Ma, Sahisnu Mazumder, Alexander Politowicz, Bing Liu, Eric Robertson, Scott Grigsby


Abstract
Much of the existing work on text novelty detection has been studied at the topic level, i.e., identifying whether the topic of a document or a sentence is novel or not. Little work has been done at the fine-grained semantic level (or contextual level). For example, given that we know Elon Musk is the CEO of a technology company, the sentence “Elon Musk acted in the sitcom The Big Bang Theory” is novel and surprising because normally a CEO would not be an actor. Existing topic-based novelty detection methods work poorly on this problem because they do not perform semantic reasoning involving relations between named entities in the text and their background knowledge. This paper proposes an effective model (called PAT-SND) to solve the problem, which can also characterize the novelty. An annotated dataset is also created. Evaluation shows that PAT-SND outperforms 10 baselines by large margins.
Anthology ID:
2022.emnlp-main.627
Volume:
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
Month:
December
Year:
2022
Address:
Abu Dhabi, United Arab Emirates
Editors:
Yoav Goldberg, Zornitsa Kozareva, Yue Zhang
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
9225–9252
Language:
URL:
https://preview.aclanthology.org/build-pipeline-with-new-library/2022.emnlp-main.627/
DOI:
10.18653/v1/2022.emnlp-main.627
Bibkey:
Cite (ACL):
Nianzu Ma, Sahisnu Mazumder, Alexander Politowicz, Bing Liu, Eric Robertson, and Scott Grigsby. 2022. Semantic Novelty Detection and Characterization in Factual Text Involving Named Entities. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 9225–9252, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.
Cite (Informal):
Semantic Novelty Detection and Characterization in Factual Text Involving Named Entities (Ma et al., EMNLP 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/build-pipeline-with-new-library/2022.emnlp-main.627.pdf