Luting Hou

2026

Team TüLK at SemEval-2026 Task 1: Humor Generation with Qwen and Group Relative Policy Optimization
Konrad Brüggemann | Luting Hou
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)

This paper addresses the challenge of computational humor generation proposed in SemEval-2026 Task 1: Humor Generation. Our approach leverages Group Relative Policy Optimization, with an LLM serving as the policy and a custom joke rating model providing a reward signal. We demonstrate that this framework is an effective and computationally efficient approach, reliably producing genuinely funny content that adheres to task constraints.

Co-authors

Konrad Brüggemann 1

Venues

SemEval1
WS1

Fix author