ConMA : Confidence-Guided Kernel Sampling with Multi-Stage Aggregation for LLM Reasoning
Yinuo Wang, Qingjie Li, Wenyao Cui, Qiuchi Li, Zhang Huaping
Abstract
Test-time scaling (TTS) enhances LLM reasoning capabilities by sampling and aggregating diverse solution trajectories. However, existing approaches often rely on external verifiers and one-shot independent sampling, which results in inefficient budget allocation and underutilizes interim high-quality trajectories. We propose ConMA, a training-free, verifier-free TTS framework that reallocates a fixed inference budget into iterative sample–filter–diversify–select cycles: it filters answer groups based on intrinsic token-probability confidence, enriches candidates through diversity-aware expansion, and employs repeated single-choice selection for multi-stage refinement. Across multiple benchmarks, ConMA consistently improves accuracy under fixed budgets. With a maximum budget of N=64, ConMA boosts Qwen3-4B to 80% accuracy on AIME25, significantly outperforming strong baselines while converging early with only 18 samples on average, substantially reducing inference cost.- Anthology ID:
- 2026.findings-acl.1475
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2026
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 29500–29518
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1475/
- DOI:
- Cite (ACL):
- Yinuo Wang, Qingjie Li, Wenyao Cui, Qiuchi Li, and Zhang Huaping. 2026. ConMA : Confidence-Guided Kernel Sampling with Multi-Stage Aggregation for LLM Reasoning. In Findings of the Association for Computational Linguistics: ACL 2026, pages 29500–29518, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- ConMA : Confidence-Guided Kernel Sampling with Multi-Stage Aggregation for LLM Reasoning (Wang et al., Findings 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1475.pdf