McMaster NLP at SemEval-2026 Task 2: A Lightweight Multi-Feature System for Predicting Emotional Valence and Arousal over Time

Hongyi Zhang, Daniel Hu, Allison Lahnala


Abstract
We present a lightweight, feature-based regression system for predicting \textbf{valence} (pleasantness) and \textbf{arousal} (activation) from longitudinal language data. The language data ranges from longer free-form ecological essays to short affect-word, organized by user and time, reflecting natural variation in affective expression and experience. Our approach combines three complementary signals: (i) sentence-level semantic embeddings, (ii) psycholinguistic category features capturing affect- and function-related word usage, (iii) similarity measures between the language data with archetypal sentences, and (iv) trainable user-embeddings to account for between-user differences. The resulting feature vector is passed to a multi-layer perceptron trained to jointly predict valence and arousal. Our design provides a strong and interpretable baseline by making it possible to isolate the contribution of semantic, psycholinguistic, similarity, and user-specific signals. We further analyze our model’s predictions to identify which feature groups are most informative and where errors are concentrated across users and input types.
Anthology ID:
2026.semeval-1.98
Volume:
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
Venues:
SemEval | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
686–698
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.98/
DOI:
Bibkey:
Cite (ACL):
Hongyi Zhang, Daniel Hu, and Allison Lahnala. 2026. McMaster NLP at SemEval-2026 Task 2: A Lightweight Multi-Feature System for Predicting Emotional Valence and Arousal over Time. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 686–698, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
McMaster NLP at SemEval-2026 Task 2: A Lightweight Multi-Feature System for Predicting Emotional Valence and Arousal over Time (Zhang et al., SemEval 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.98.pdf
Supplementarymaterial:
 2026.semeval-1.98.SupplementaryMaterial.zip
Supplementarymaterial:
 2026.semeval-1.98.SupplementaryMaterial.zip