AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Junyu Zhang; Runpei Dong; Han Wang (王涵); Xuying Ning; Haoran Geng; Peihao Li; Xialin He; Yutong Bai; Jitendra Malik; Saurabh Gupta; Huan Zhang (张欢)

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Junyu Zhang, Runpei Dong, Han Wang, Xuying Ning, Haoran Geng, Peihao Li, Xialin He, Yutong Bai, Jitendra Malik, Saurabh Gupta, Huan Zhang

Abstract

This paper presents AlphaOne (𝛼1), a universal framework for modulating reasoning progress in large reasoning models (LRMs) at test time. 𝛼1 first introduces 𝛼 moment, which represents the scaled thinking phase with a universal parameter 𝛼.Within this scaled pre-𝛼 moment phase, it dynamically schedules slow thinking transitions by modeling the insertion of reasoning transition tokens as a Bernoulli stochastic process. After the 𝛼 moment, 𝛼1 deterministically terminates slow thinking with the end-of-thinking token, thereby fostering fast reasoning and efficient answer generation. This approach unifies and generalizes existing monotonic scaling methods by enabling flexible and dense slow-to-fast reasoning modulation. Extensive empirical studies on various challenging benchmarks across mathematical, coding, and scientific domains demonstrate 𝛼1‘s superior reasoning capability and efficiency. Project page: https://alphaone-project.github.io/.

Anthology ID:: 2025.emnlp-main.570
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 11340–11365
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.570/
DOI:
Bibkey:
Cite (ACL):: Junyu Zhang, Runpei Dong, Han Wang, Xuying Ning, Haoran Geng, Peihao Li, Xialin He, Yutong Bai, Jitendra Malik, Saurabh Gupta, and Huan Zhang. 2025. AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 11340–11365, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time (Zhang et al., EMNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.570.pdf
Checklist:: 2025.emnlp-main.570.checklist.pdf

PDF Cite Search Checklist Fix data