CaRVE: Critiquing and Refining Visual Elaborations for Figurative Language Illustrations
Manishit Kundu, Tejomay Kishor Padole, Sumit Shekhar, Biplab Banerjee, Pushpak Bhattacharyya
Abstract
Illustrating figurative language remains challenging due to its non-literal semantics, and existing text-to-image frameworks rely heavily on proprietary models or human supervision to achieve adequate alignment. We introduce CaRVE, a lightweight and fully open-source critique-driven framework that employs VLM feedback to refine visual elaborations for figurative image generation. CaRVE bridges the semantic alignment gap even in sub-4B models by correcting visual and conceptual misalignments, reducing over-literalization, and improving robustness to complex figurative expressions. Using only open-source models, CaRVE achieves a 6.49% improvement over prior baselines on intrinsic automatic evaluations and a +0.37 average rank gain in human preference. We further release MetaCaRVE, an enhanced figurative image dataset constructed by refining HAIVMet using CaRVE.- Anthology ID:
- 2026.findings-acl.2122
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2026
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 42760–42777
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.2122/
- DOI:
- Cite (ACL):
- Manishit Kundu, Tejomay Kishor Padole, Sumit Shekhar, Biplab Banerjee, and Pushpak Bhattacharyya. 2026. CaRVE: Critiquing and Refining Visual Elaborations for Figurative Language Illustrations. In Findings of the Association for Computational Linguistics: ACL 2026, pages 42760–42777, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- CaRVE: Critiquing and Refining Visual Elaborations for Figurative Language Illustrations (Kundu et al., Findings 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.2122.pdf