Benchmarking Uncertainty Metrics for LLM Target-Aware Search

Pei-Fu Guo; Yun-Da Tsai; Shou-De Lin

doi:10.18653/v1/2025.findings-emnlp.226

Benchmarking Uncertainty Metrics for LLM Target-Aware Search

Abstract

LLM search methods, such as Chain of Thought (CoT) and Tree of Thought (ToT), enhance LLM reasoning by exploring multiple reasoning paths. When combined with search algorithms like MCTS and Bandit methods, their effectiveness relies heavily on uncertainty estimation to prioritize paths that align with specific search objectives. However, it remains unclear whether existing LLM uncertainty metrics adequately capture the diverse types of uncertainty required to guide different search objectives. In this work, we introduce a framework for uncertainty benchmarking, identifying four distinct uncertainty types: Answer, Correctness, Aleatoric, and Epistemic Uncertainty. Each type serves different optimization goals in search. Our experiments demonstrate that current metrics often align with only a subset of these uncertainty types, limiting their effectiveness for objective-aligned search in some cases. These findings highlight the need for additional target-aware uncertainty estimators that can adapt to various optimization goals in LLM search.

Anthology ID:: 2025.findings-emnlp.226
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2025
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 4230–4238
Language:
URL:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.226/
DOI:: 10.18653/v1/2025.findings-emnlp.226
Bibkey:
Cite (ACL):: Pei-Fu Guo, Yun-Da Tsai, and Shou-De Lin. 2025. Benchmarking Uncertainty Metrics for LLM Target-Aware Search. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 4230–4238, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Benchmarking Uncertainty Metrics for LLM Target-Aware Search (Guo et al., Findings 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.226.pdf
Checklist:: 2025.findings-emnlp.226.checklist.pdf

PDF Cite Search Checklist Fix data