@inproceedings{zhu-etal-2025-multichartqa,
    title = "{M}ulti{C}hart{QA}: Benchmarking Vision-Language Models on Multi-Chart Problems",
    author = "Zhu, Zifeng  and
      Jia, Mengzhao  and
      Zhang, Zhihan  and
      Li, Lang  and
      Jiang, Meng",
    editor = "Chiruzzo, Luis  and
      Ritter, Alan  and
      Wang, Lu",
    booktitle = "Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)",
    month = apr,
    year = "2025",
    address = "Albuquerque, New Mexico",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/ingest-emnlp/2025.naacl-long.566/",
    doi = "10.18653/v1/2025.naacl-long.566",
    pages = "11341--11359",
    ISBN = "979-8-89176-189-6",
    abstract = "Multimodal Large Language Models (MLLMs) have demonstrated impressive abilities across various tasks, including visual question answering and chart comprehension, yet existing benchmarks for chart-related tasks fall short in capturing the complexity of real-world multi-chart scenarios. Current benchmarks primarily focus on single-chart tasks, neglecting the multi-hop reasoning required to extract and integrate information from multiple charts, which is essential in practical applications. To fill this gap, we introduce MultiChartQA, a benchmark that evaluates MLLMs' capabilities in four key areas: direct question answering, parallel question answering, comparative reasoning, and sequential reasoning. Our evaluation of a wide range of MLLMs reveals significant performance gaps compared to humans. These results highlight the challenges in multi-chart comprehension and the potential of MultiChartQA to drive advancements in this field. Our code and data are available at https://github.com/Zivenzhu/Multi-chart-QA."
}Markdown (Informal)
[MultiChartQA: Benchmarking Vision-Language Models on Multi-Chart Problems](https://preview.aclanthology.org/ingest-emnlp/2025.naacl-long.566/) (Zhu et al., NAACL 2025)
ACL
- Zifeng Zhu, Mengzhao Jia, Zhihan Zhang, Lang Li, and Meng Jiang. 2025. MultiChartQA: Benchmarking Vision-Language Models on Multi-Chart Problems. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 11341–11359, Albuquerque, New Mexico. Association for Computational Linguistics.