CUETClashing at SemEval-2026 Task 1: Multilingual Joke Generation Under Lexical and Topical Constraints Using Small Instruction-Tuned LLMs

Madiha Ahmed Chowdhury, Lamia Khan, Faozia Fariha, Symom Hossain Shohan, Mohammed Moshiul Hoque


Abstract
Generating humorous text is one of the most challenging tasks in natural language generation, as models must simultaneously juggle creativity, cultural understanding, and rules. To tackle these issues, this paper introduces our system for Subtask A of SemEval-2026 Task 1: MWAHAHA - Models Write Automatic Humor And Humans Annotate, which asks for single-sentence jokes with two rules—certain words must be included, and the joke must relate to a news headline—in English, Spanish, and Chinese. Our method uses instruction-tuned language models: Qwen2.5-3B-Instruct for English and Chinese, and Salamandra-2B-Instruct for Spanish, paired with language-specific prompts, special sampling for outputs, and a strong cleaning process after jokes are generated. Without additional task-specific training, our system generates jokes that adhere to the rules in all three languages, demonstrating that simple prompt design and small, instruction-tuned models can be a strong, efficient way to generate funny text across multiple languages.
Anthology ID:
2026.semeval-1.359
Volume:
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Ekaterina Kochmar, Debanjan Ghosh, Kai North, Mamoru Komachi
Venues:
SemEval | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2860–2865
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.359/
DOI:
Bibkey:
Cite (ACL):
Madiha Ahmed Chowdhury, Lamia Khan, Faozia Fariha, Symom Hossain Shohan, and Mohammed Moshiul Hoque. 2026. CUETClashing at SemEval-2026 Task 1: Multilingual Joke Generation Under Lexical and Topical Constraints Using Small Instruction-Tuned LLMs. In Proceedings of the 20th International Workshop on Semantic Evaluation (2026), pages 2860–2865, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
CUETClashing at SemEval-2026 Task 1: Multilingual Joke Generation Under Lexical and Topical Constraints Using Small Instruction-Tuned LLMs (Chowdhury et al., SemEval 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.semeval-1.359.pdf