AI-Monitors at SemEval-2026 Task 4: A Hybrid Embedding and LLM Ensemble for Narrative Similarity

Vishnu Tripathi; Azad -; Prakhar Joshi; Pragyananda Sahoo; Gaurav Kumar; Piyush Arora; Neel Mani

AI-Monitors at SemEval-2026 Task 4: A Hybrid Embedding and LLM Ensemble for Narrative Similarity

Vishnu Tripathi, Azad -, Prakhar Joshi, Pragyananda Sahoo, Gaurav Kumar, Piyush Arora, Neel Mani

Abstract

Narrative similarity requires reasoning over the deeper structural properties of stories - shared themes, causal progression, and outcomes - rather than surface-level lexical overlap. We describe AI-Monitors, our system for SemEval-2026 Task 4 (Track A), which determines which of two candidate stories is more narratively similar to a given anchor. We explore a progression of approaches - from embedding-based similarity to structured LLM prompting and ensemble construction - guided by four hypotheses about where narrative reasoning gains can be found. The final system achieves 75\% test accuracy on 400 instances, ranking 3rd out of 47 systems and approaching the individual human annotator ceiling of 78\%.Our key findings are: i) structured few-shot prompting substantially outperforms dense embedding similarity; ii) selecting ensemble components by how differently they make errors - rather than by accuracy alone - produces stronger predictions; and iii) how you describe an example to the model affects its predictions.

Anthology ID:: 2026.semeval-1.271
Volume:: Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
Venues:: SemEval | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2139–2148
Language:
URL:: https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.271/
DOI:
Bibkey:
Cite (ACL):: Vishnu Tripathi, Azad -, Prakhar Joshi, Pragyananda Sahoo, Gaurav Kumar, Piyush Arora, and Neel Mani. 2026. AI-Monitors at SemEval-2026 Task 4: A Hybrid Embedding and LLM Ensemble for Narrative Similarity. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 2139–2148, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: AI-Monitors at SemEval-2026 Task 4: A Hybrid Embedding and LLM Ensemble for Narrative Similarity (Tripathi et al., SemEval 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.271.pdf

PDF Cite Search Fix data