One and Only at SemEval-2026 Task 2: Evaluating Zero-Shot Autonomous LLM Agents and Heuristic Proxies in Ecological Affect Forecasting

Nam Dinh

One and Only at SemEval-2026 Task 2: Evaluating Zero-Shot Autonomous LLM Agents and Heuristic Proxies in Ecological Affect Forecasting

Abstract

This paper presents team One and Only’s sys-tem for SemEval-2026 Task 2: PredictingVariation in Emotional Valence and Arousalover Time (Soni et al., 2026). We investigatewhether zero-shot LLM reasoning can replacefine-tuning for ecological affect forecasting bycombining deterministic statistical priors withfrozen LLMs (Gemini 3 Pro, Claude Opus4.6, GPT-5.2). For short-term state changes(Subtask 2A), an OLS mean-reversion anchoris paired with LLM-generated impulses; forlong-term disposition changes (Subtask 2B),a Chain-of-Thought prompt drives direct nu-meric prediction. Our system underperformsfine-tuned approaches on both subtasks. How-ever, post-submission ablation across threeLLMs reveals a task-dependent pattern: CoTreasoning substantially improves dispositionforecasting (rV : −0.185 → +0.129; MAEV :0.899 → 0.422), while uncalibrated LLM im-pulses degrade state-change prediction due tovariance collapse (σpred = 0.41 vs. σgold =1.73). We provide a detailed diagnostic anal-ysis of these failure modes and release allprompts and outputs for reproducibility.

Anthology ID:: 2026.semeval-1.162
Volume:: Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
Venues:: SemEval | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1205–1211
Language:
URL:: https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.162/
DOI:
Bibkey:
Cite (ACL):: Nam Dinh. 2026. One and Only at SemEval-2026 Task 2: Evaluating Zero-Shot Autonomous LLM Agents and Heuristic Proxies in Ecological Affect Forecasting. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 1205–1211, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: One and Only at SemEval-2026 Task 2: Evaluating Zero-Shot Autonomous LLM Agents and Heuristic Proxies in Ecological Affect Forecasting (Dinh, SemEval 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.162.pdf
Supplementarymaterial:: 2026.semeval-1.162.SupplementaryMaterial.zip
Supplementarymaterial:: 2026.semeval-1.162.SupplementaryMaterial.tex

PDF Cite Search Supplementarymaterial Supplementarymaterial Fix data