Agent for Numerical Data Retrieval and Understanding by Code Generation and Multimodal Reasoning

Florian Baud, Feda Almuhisen, Dorian Midou


Abstract
Numerical data from sensors and time series are widely used in scientific research fields such as nuclear fusion experiments, which generate vast amounts of complex, high-dimensional data. Therefore, efficient numerical data analysis tools are crucial to accelerate experimental research. Large language models (LLMs) have emerged as promising solutions to analyze numerical data with natural language queries. However, LLMs have difficulties treating this type of data as they have been designed for text in the first place. To overcome these limitations, we propose a model-agnostic and data-agnostic agent that processes numerical data by code generation and multimodal reasoning. Our agent demonstrates competitive performance against baselines on benchmark data on numerical data tasks such as sensor data classification and time series understanding. While outperforming them on information retrieval benchmarks, also we have successfully applied our agent in the context of nuclear fusion research, where physicists and Tokamak operators interact with it to plan and analyze fusion experiments.
Anthology ID:
2026.findings-acl.1924
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
38630–38653
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1924/
DOI:
Bibkey:
Cite (ACL):
Florian Baud, Feda Almuhisen, and Dorian Midou. 2026. Agent for Numerical Data Retrieval and Understanding by Code Generation and Multimodal Reasoning. In Findings of the Association for Computational Linguistics: ACL 2026, pages 38630–38653, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Agent for Numerical Data Retrieval and Understanding by Code Generation and Multimodal Reasoning (Baud et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1924.pdf
Checklist:
 2026.findings-acl.1924.checklist.pdf