ThinkPilot: Steering Reasoning Models via Automated Think-prefixes Optimization

Sunzhu Li; Zhiyu Lin; Jiale Zhao; Shuling Yang; Chen Wei

ThinkPilot: Steering Reasoning Models via Automated Think-prefixes Optimization

Sunzhu Li, Zhiyu Lin, Jiale Zhao, Shuling Yang, Chen Wei

Abstract

Large Reasoning Models (LRMs) are powerful, but they still suffer from inefficient and off-target reasoning. Currently, training-free methods are limited to either rigid heuristics or descriptive, non-actionable analyses. In this paper, we introduce ThinkPilot, a training-free framework that automatically optimizes LRMs reasoning. It uses an evolutionary process to generate think-prefixes, namely instructions that evolve driven by a taxonomy of reasoning behaviors to guide models toward superior performance. Extensive experiments demonstrate ThinkPilot’s broad effectiveness: it significantly improves the accuracy-length trade-off for efficient reasoning, drastically improves safety (e.g., cutting the StrongREJECT score of DeepSeek-R1-Distill-Qwen-32B from 27.0% to 0.7%), and enhances instruction following. It also synergizes with existing training-based methods. Specially, our analysis reveals that think-prefixes can reliably control LRMs’ reasoning behaviors, and that different tasks have strong preferences for specific behavioral distributions. By automatically identifying and eliciting these behaviors, ThinkPilot provides a generalizable framework for aligning LRMs reasoning with task demands.

Anthology ID:: 2026.findings-eacl.185
Volume:: Findings of the Association for Computational Linguistics: EACL 2026
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3573–3592
Language:
URL:: https://preview.aclanthology.org/ingest-eacl/2026.findings-eacl.185/
DOI:
Bibkey:
Cite (ACL):: Sunzhu Li, Zhiyu Lin, Jiale Zhao, Shuling Yang, and Chen Wei. 2026. ThinkPilot: Steering Reasoning Models via Automated Think-prefixes Optimization. In Findings of the Association for Computational Linguistics: EACL 2026, pages 3573–3592, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: ThinkPilot: Steering Reasoning Models via Automated Think-prefixes Optimization (Li et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-eacl/2026.findings-eacl.185.pdf
Checklist:: 2026.findings-eacl.185.checklist.pdf

PDF Cite Search Checklist Fix data