Warda Yousaf
2026
SLPGFJWUWarda at SemEval-2026 Task 1: A Multimodal Vision-Language Approach for Humor Generation Using Fine-Tuned BLIP
Warda Yousaf
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Warda Yousaf
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
We present a BLIP-based multimodal system for image-based humor generation submitted to SemEval-2026 Task 1 (MWAHAHA), focusing on Task B1. Our approach fine-tunes a vision–language model on meme-style captions and handles animated GIFs via representative frame extraction to generate culturally grounded humorous captions.