CUNI at SemEval-2026 Task 4: Multi-Head Narrative Aspect Disentanglement via Entangled Synthetic Dataset

Jan Mitka, Jindrich Helcl


Abstract
We participate in Track B of the SemEval 2026 Task 4 on narrative similarity, focusing on narrative representation learning. We introduce a synthetic dataset designed to disentangle core narrative aspects-abstract theme, course of action, and outcome-and propose a multi-head multi-positive extension of the InfoNCE objective to train aspect-specific embeddings. Our best model achieves 64.25\% accuracy on the test set. A nearest-centroid analysis indicates partial aspect-specific structure in the submitted checkpoint, while the training dynamics reveal a partial misalignment between the contrastive objective and the triplet-based evaluation protocol.
Anthology ID:
2026.semeval-1.274
Volume:
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
Venues:
SemEval | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2163–2178
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.274/
DOI:
Bibkey:
Cite (ACL):
Jan Mitka and Jindrich Helcl. 2026. CUNI at SemEval-2026 Task 4: Multi-Head Narrative Aspect Disentanglement via Entangled Synthetic Dataset. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 2163–2178, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
CUNI at SemEval-2026 Task 4: Multi-Head Narrative Aspect Disentanglement via Entangled Synthetic Dataset (Mitka & Helcl, SemEval 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.274.pdf
Supplementarymaterial:
 2026.semeval-1.274.SupplementaryMaterial.zip