ConMA : Confidence-Guided Kernel Sampling with Multi-Stage Aggregation for LLM Reasoning

Yinuo Wang, Qingjie Li, Wenyao Cui, Qiuchi Li, Zhang Huaping


Abstract
Test-time scaling (TTS) enhances LLM reasoning capabilities by sampling and aggregating diverse solution trajectories. However, existing approaches often rely on external verifiers and one-shot independent sampling, which results in inefficient budget allocation and underutilizes interim high-quality trajectories. We propose ConMA, a training-free, verifier-free TTS framework that reallocates a fixed inference budget into iterative sample–filter–diversify–select cycles: it filters answer groups based on intrinsic token-probability confidence, enriches candidates through diversity-aware expansion, and employs repeated single-choice selection for multi-stage refinement. Across multiple benchmarks, ConMA consistently improves accuracy under fixed budgets. With a maximum budget of N=64, ConMA boosts Qwen3-4B to 80% accuracy on AIME25, significantly outperforming strong baselines while converging early with only 18 samples on average, substantially reducing inference cost.
Anthology ID:
2026.findings-acl.1475
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
29500–29518
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1475/
DOI:
Bibkey:
Cite (ACL):
Yinuo Wang, Qingjie Li, Wenyao Cui, Qiuchi Li, and Zhang Huaping. 2026. ConMA : Confidence-Guided Kernel Sampling with Multi-Stage Aggregation for LLM Reasoning. In Findings of the Association for Computational Linguistics: ACL 2026, pages 29500–29518, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
ConMA : Confidence-Guided Kernel Sampling with Multi-Stage Aggregation for LLM Reasoning (Wang et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1475.pdf
Checklist:
 2026.findings-acl.1475.checklist.pdf