Entropy-based Exploration Conduction for Multi-step Reasoning

Jinghan Zhang; Xiting Wang; Fengran Mo; Yeyang Zhou; Wanfu Gao; Kunpeng Liu

Entropy-based Exploration Conduction for Multi-step Reasoning

Jinghan Zhang, Xiting Wang, Fengran Mo, Yeyang Zhou, Wanfu Gao, Kunpeng Liu

Abstract

Multi-step processes via large language models (LLMs) have proven effective for solving complex reasoning tasks. However, the depth of exploration of the reasoning procedure can significantly affect the task performance. Existing methods to automatically decide the depth often lead to high cost and a lack of flexibility. To address these issues, we propose Entropy-based Exploration Depth Conduction (Entro-duction), a novel method that dynamically adjusts the exploration depth during multi-step reasoning by monitoring LLM’s output entropy and variance entropy. We employ these two features to capture the model’s uncertainty of the current step and the fluctuation of uncertainty across consecutive reasoning steps. Based on the observed entropy changes, the LLM selects whether to deepen, expand, or stop exploration according to the probability, which facilitates the trade-off between the reasoning accuracy and exploration effectiveness. Experimental results across four benchmark datasets demonstrate the efficacy of Entro-duction.

Anthology ID:: 2025.findings-acl.201
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venues:: Findings | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3895–3906
Language:
URL:: https://preview.aclanthology.org/ingestion-acl-25/2025.findings-acl.201/
DOI:
Bibkey:
Cite (ACL):: Jinghan Zhang, Xiting Wang, Fengran Mo, Yeyang Zhou, Wanfu Gao, and Kunpeng Liu. 2025. Entropy-based Exploration Conduction for Multi-step Reasoning. In Findings of the Association for Computational Linguistics: ACL 2025, pages 3895–3906, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Entropy-based Exploration Conduction for Multi-step Reasoning (Zhang et al., Findings 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingestion-acl-25/2025.findings-acl.201.pdf

PDF Cite Search Fix data