FIFA: Unified Faithfulness Evaluation Framework for Text-to-Video and Video-to-Text Generation

Liqiang Jing, Viet Dac Lai, Seunghyun Yoon, Trung Bui, Xinya Du


Abstract
Video Multimodal Large Language Models (VideoMLLMs) have achieved remarkable progress in both Video-to-Text and Text-to-Video tasks. However, they often suffer from hallucinations, generating content that contradicts the visual input. Existing evaluation methods are limited to one task (V2T) and also fail to assess hallucinations in open-ended, free-form responses. To address this gap, we propose FIFA, a unified FaIthFulness evAluation framework that extracts comprehensive descriptive facts, models their semantic dependencies via a Spatio-Temporal Semantic Dependency Graph, and verifies them using VideoQA models. We further introduce , a tool-based correction framework that revises hallucinated content. Extensive experiments demonstrate that FIFA aligns more closely with human judgment than existing evaluation methods, and that   effectively improves factual consistency in both text and video generation.
Anthology ID:
2026.findings-acl.555
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
11418–11456
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.555/
DOI:
Bibkey:
Cite (ACL):
Liqiang Jing, Viet Dac Lai, Seunghyun Yoon, Trung Bui, and Xinya Du. 2026. FIFA: Unified Faithfulness Evaluation Framework for Text-to-Video and Video-to-Text Generation. In Findings of the Association for Computational Linguistics: ACL 2026, pages 11418–11456, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
FIFA: Unified Faithfulness Evaluation Framework for Text-to-Video and Video-to-Text Generation (Jing et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.555.pdf
Checklist:
 2026.findings-acl.555.checklist.pdf