Hui Ma
2026
WebUncertainty: Dual-Level Uncertainty Driven Planning and Reasoning For Autonomous Web Agent
Lingfeng Zhang | Yongan Sun | Jinpeng Hu | Hui Ma | Ying Yang | Kuien Liu | Zenglin Shi | Meng Wang
Findings of the Association for Computational Linguistics: ACL 2026
Lingfeng Zhang | Yongan Sun | Jinpeng Hu | Hui Ma | Ying Yang | Kuien Liu | Zenglin Shi | Meng Wang
Findings of the Association for Computational Linguistics: ACL 2026
Recent advancements in large language models (LLMs) have empowered autonomous web agents to execute natural language instructions directly on real-world webpages. However, existing agents often struggle with complex tasks involving dynamic interactions and long-horizon execution due to rigid planning strategies and hallucination-prone reasoning. To address these limitations, we propose WebUncertainty, a novel autonomous agent framework designed to tackle dual-level uncertainty in planning and reasoning. Specifically, we design a Task Uncertainty-Driven Adaptive Planning Mechanism that adaptively selects planning modes to navigate unknown environments. Furthermore, we introduce an Action Uncertainty-Driven Monte Carlo tree search (MCTS) Reasoning Mechanism. This mechanism incorporates the Confidence-induced Action Uncertainty (ConActU) strategy to quantify both aleatoric uncertainty (AU) and epistemic uncertainty (EU), thereby optimizing the search process and guiding robust decision-making. Experimental results on the WebArena and WebVoyager benchmarks demonstrate that WebUncertainty achieves superior performance compared to state-of-the-art baselines.
2025
Efficient Tuning of Large Language Models for Knowledge-Grounded Dialogue Generation
Bo Zhang | Hui Ma | Dailin Li | Jian Ding | Jian Wang | Bo Xu | HongFei Lin
Transactions of the Association for Computational Linguistics, Volume 13
Bo Zhang | Hui Ma | Dailin Li | Jian Ding | Jian Wang | Bo Xu | HongFei Lin
Transactions of the Association for Computational Linguistics, Volume 13
Large language models (LLMs) demonstrate remarkable text comprehension and generation capabilities but often lack the ability to utilize up-to-date or domain-specific knowledge not included in their training data. To address this gap, we introduce KEDiT, an efficient method for fine-tuning LLMs for knowledge-grounded dialogue generation. KEDiT operates in two main phases. First, it employs an information bottleneck to compress retrieved knowledge into learnable parameters, retaining essential information while minimizing computational overhead. Second, a lightweight knowledge-aware adapter integrates these compressed knowledge vectors into the LLM during fine-tuning, updating less than 2% of the model parameters. The experimental results on the Wizard of Wikipedia and a newly constructed PubMed-Dialog dataset demonstrate that KEDiT excels in generating contextually relevant and informative responses, outperforming competitive baselines in automatic, LLM-based, and human evaluations. This approach effectively combines the strengths of pretrained LLMs with the adaptability needed for incorporating dynamic knowledge, presenting a scalable solution for fields such as medicine.1