Figure It Out: Improve the Frontier of Reasoning with Executable Visual States

Meiqi Chen, Fandong Meng, Jie Zhou


Abstract
Complex reasoning problems often involve implicit spatial and geometric relationships that are not explicitly encoded in text. While recent reasoning models perform well across many domains, purely text-based reasoning struggles to capture structural constraints in complex settings. In this paper, we introduce FIGR, which integrates executable visual construction into multi-turn reasoning via end-to-end reinforcement learning. Rather than relying solely on textual chains of thought, FIGR externalizes intermediate hypotheses by generating executable code that constructs diagrams within the reasoning loop. An adaptive reward mechanism selectively regulates when visual construction is invoked, enabling more consistent reasoning over latent global properties that are difficult to infer from text alone. Experiments on seven challenging mathematical benchmarks demonstrate that FIGR outperforms strong text-only chain-of-thought baselines, improving the base model by 13.12% on AIME 2025 and 11.00% on BeyondAIME. These results highlight the effectiveness of precise, controllable figure construction of FIGR in enhancing complex reasoning ability.
Anthology ID:
2026.acl-long.1827
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
39375–39388
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.1827/
DOI:
Bibkey:
Cite (ACL):
Meiqi Chen, Fandong Meng, and Jie Zhou. 2026. Figure It Out: Improve the Frontier of Reasoning with Executable Visual States. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 39375–39388, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Figure It Out: Improve the Frontier of Reasoning with Executable Visual States (Chen et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.1827.pdf
Checklist:
 2026.acl-long.1827.checklist.pdf