Team TüLK at SemEval-2026 Task 1: Humor Generation with Qwen and Group Relative Policy Optimization

Konrad Brüggemann, Luting Hou


Abstract
This paper addresses the challenge of computational humor generation proposed in SemEval-2026 Task 1: Humor Generation. Our approach leverages Group Relative Policy Optimization, with an LLM serving as the policy and a custom joke rating model providing a reward signal. We demonstrate that this framework is an effective and computationally efficient approach, reliably producing genuinely funny content that adheres to task constraints.
Anthology ID:
2026.semeval-1.67
Volume:
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
Venues:
SemEval | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
463–474
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.67/
DOI:
Bibkey:
Cite (ACL):
Konrad Brüggemann and Luting Hou. 2026. Team TüLK at SemEval-2026 Task 1: Humor Generation with Qwen and Group Relative Policy Optimization. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 463–474, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
Team TüLK at SemEval-2026 Task 1: Humor Generation with Qwen and Group Relative Policy Optimization (Brüggemann & Hou, SemEval 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.67.pdf