Luting Hou
2026
Team TüLK at SemEval-2026 Task 1: Humor Generation with Qwen and Group Relative Policy Optimization
Konrad Brüggemann | Luting Hou
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Konrad Brüggemann | Luting Hou
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
This paper addresses the challenge of computational humor generation proposed in SemEval-2026 Task 1: Humor Generation. Our approach leverages Group Relative Policy Optimization, with an LLM serving as the policy and a custom joke rating model providing a reward signal. We demonstrate that this framework is an effective and computationally efficient approach, reliably producing genuinely funny content that adheres to task constraints.