SpiralThinker: Latent Reasoning through an Iterative Process with Text–Latent Interleaving

Shengmin Piao, Sanghyun Park


Abstract
Recent advances in large reasoning models have been driven by reinforcement learning and test-time scaling, accompanied by growing interest in latent rather than purely textual reasoning. However, existing latent reasoning methods lack mechanisms to ensure stable reasoning dynamics in latent space and a systematic way to interleave implicit and explicit reasoning. We introduce SpiralThinker, a unified framework that performs iterative updates over latent representations while enabling interleaved reasoning across latent and textual steps. At its core, SpiralThinker employs a progressive alignment objective and structured annotations to stabilize latent reasoning and maintain coherence with textual reasoning. Across mathematical, logical, and commonsense reasoning tasks, SpiralThinker achieves state-of-the-art performance among latent reasoning baselines. Detailed analyses reveal that both iteration and alignment are indispensable, the numbers of latent tokens and iterations exhibit dataset-specific optima, and appropriate alignment proves critical for an effective iterative process. Overall, SpiralThinker bridges iterative computation and latent reasoning, demonstrating that aligned iterative updates can reliably steer reasoning in the latent space.
Anthology ID:
2026.findings-acl.1605
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
32072–32088
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1605/
DOI:
Bibkey:
Cite (ACL):
Shengmin Piao and Sanghyun Park. 2026. SpiralThinker: Latent Reasoning through an Iterative Process with Text–Latent Interleaving. In Findings of the Association for Computational Linguistics: ACL 2026, pages 32072–32088, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
SpiralThinker: Latent Reasoning through an Iterative Process with Text–Latent Interleaving (Piao & Park, Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1605.pdf
Checklist:
 2026.findings-acl.1605.checklist.pdf