Hui Liu
Other people with similar names: Hui Liu (CUHK), Hui Liu, Hui Liu (MSU), Hui Liu (UCAS, Tencent)
Unverified author pages with similar names: Hui Liu
2026
A Reward-Guided Dual-Phase Framework for Adaptive Inference-Time Reasoning
Yingqian Cui | Zhenwei Dai | Pengfei He | Bing He | Hui Liu | Zhan Shi | Xianfeng Tang | Jingying Zeng | Suhang Wang | Yue Xing | Jiliang Tang | Benoit Dumoulin
Findings of the Association for Computational Linguistics: ACL 2026
Yingqian Cui | Zhenwei Dai | Pengfei He | Bing He | Hui Liu | Zhan Shi | Xianfeng Tang | Jingying Zeng | Suhang Wang | Yue Xing | Jiliang Tang | Benoit Dumoulin
Findings of the Association for Computational Linguistics: ACL 2026
Large Language Models (LLMs) have made strong progress in reasoning. To enhance the reasoning performance, a common inference-time approach is tree-based search, which decomposes the reasoning process into multiple steps, expands multiple reasoning paths, and uses reward models to prune and select candidates. However, based on our exploration, the simple decomposition may lead to suboptimal searching efficiency: while planning is generally harder, it is the execution errors that are more likely to propagate to later steps. This indicates that planning and execution play different roles in reasoning and should be treated differently during tree-based search. Given this, to enhance the searching efficiency, we propose a dual-phase test-time scaling framework that separates reasoning into planning and execution, and performs search over each phase independently. To further refine the algorithm, we also introduce a dynamic budget allocation mechanism that adaptively redistributes sampling effort based on reward feedback, allowing early stopping on confident steps and reallocation of computation to more challenging steps. Experiments on both math reasoning and code generation benchmarks demonstrate that our approach consistently improves accuracy while reducing redundant computation.
Anchoring the Cache: Mitigating Contextual Hallucination in KV-Compressed Long-Context Summarization
Yu Fu | Chen Luo | Josef Valvoda | Xin Zhang | Xuejing Lei | Xiao Pan | Hui Liu | Yue Dong
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Yu Fu | Chen Luo | Josef Valvoda | Xin Zhang | Xuejing Lei | Xiao Pan | Hui Liu | Yue Dong
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Key-Value (KV) cache compression techniques have improved the efficiency of long-context summarization in Large Language Models (LLMs), but their impact on model hallucination remains underexplored. In this paper, we present the first systematic study of how KV cache compression affects hallucination in long-context summarization, demonstrating that aggressive compression can increase hallucination scores by up to 3.36× compared to the baseline. To mitigate this issue, we propose HalluKV, a decoding-phase strategy that selectively removes generated KV pairs from retrieval heads responsible for retrieving critical information from source context, thereby anchoring their attention on the preserved source information. Our approach maintains computational efficiency while significantly reducing hallucination across multiple models and datasets, achieving up to 5.48 average point reductions on Llama-3-8B-Instruct, enabling more trustworthy long-context summarization.
Do VLMs Have a Moral Backbone? A Study on the Fragile Morality of Vision-Language Models
Zhining Liu | Tianyi Wang | Xiao Lin | Penghao Ouyang | Gaotang Li | Ze Yang | Hui Liu | Sumit Keswani | Vishwa Pardeshi | Huijun Zhao | Wei Fan | Hanghang Tong
Findings of the Association for Computational Linguistics: ACL 2026
Zhining Liu | Tianyi Wang | Xiao Lin | Penghao Ouyang | Gaotang Li | Ze Yang | Hui Liu | Sumit Keswani | Vishwa Pardeshi | Huijun Zhao | Wei Fan | Hanghang Tong
Findings of the Association for Computational Linguistics: ACL 2026
Despite substantial efforts toward improving the moral alignment of Vision-Language Models (VLMs), it remains unclear whether their ethical judgments are stable in realistic settings. This work studies moral robustness in VLMs, defined as the ability to preserve moral judgments under textual and visual perturbations that do not alter the underlying moral context. We systematically probe VLMs with a diverse set of model-agnostic multimodal perturbations and find that their moral stances are highly fragile, frequently flipping under simple manipulations. Our analysis reveals systematic vulnerabilities across perturbation types, moral domains, and model scales, including a sycophancy trade-off where stronger instruction-following models are more susceptible to persuasion. We further show that lightweight inference-time interventions can partially restore moral stability. These results demonstrate that moral alignment alone is insufficient and that moral robustness is a necessary criterion for the responsible deployment of VLMs.
Graph-Assisted Large Language Models: A Perspective on Mitigating Intrinsic Limitations
Haitong Luo | Fali Wang | Weiyao Zhang | Xianren Zhang | Zhiwei Zhang | Tianxiang Zhao | Minhua Lin | Jiahao Zhang | Hui Liu | Xianfeng Tang | Qi He | Suhang Wang | Xuying Meng | Yujun Zhang
Findings of the Association for Computational Linguistics: ACL 2026
Haitong Luo | Fali Wang | Weiyao Zhang | Xianren Zhang | Zhiwei Zhang | Tianxiang Zhao | Minhua Lin | Jiahao Zhang | Hui Liu | Xianfeng Tang | Qi He | Suhang Wang | Xuying Meng | Yujun Zhang
Findings of the Association for Computational Linguistics: ACL 2026
Large language models (LLMs) have made progress in knowledge-intensive tasks, reasoning and planning, and collaborative problem solving, yet they exhibit intrinsic limitations such as knowledge cutoff, single-threaded reasoning that hinders finer-grained branch and aggregation, and rigid collaboration mechanisms that struggle to coordinate specialized capabilities. Graphs, with their ability to represent relational knowledge and complex dependencies, offer a natural means to address these limitations: they provide structured, high-density knowledge for augmenting or correcting LLMs’ generation; enable revisitable inference by organizing intermediate steps as graphs; and support dynamic coordination among experts or agents in collaborative settings. Motivated by these developments, we present the first systematic survey of graph-assisted LLMs from the perspective of how graph structures mitigate LLMs’ limitations. We introduce a taxonomy spanning *Graph-Assisted Knowledge Augmentation*, *Graph-Assisted Reasoning and Planning*, and *Graph-Assisted LLM Collaboration*, and analyze representative methods, summarize common design patterns, and outline open challenges and future directions for advancing LLMs with graph-based enhancements. The collected papers are available in [link here](https://github.com/FairyFali/Graph4LLM-Survey).
Search
Fix author
Co-authors
- Xianfeng Tang 2
- Suhang Wang 2
- Yingqian Cui 1
- Zhenwei Dai 1
- Yue Dong 1
- Benoit Dumoulin 1
- Wei Fan 1
- Yu Fu 1
- Pengfei He 1
- Bing He 1
- Qi He 1
- Sumit Keswani 1
- Xuejing Lei 1
- Gaotang Li 1
- Xiao Lin 1
- Minhua Lin 1
- Zhining Liu 1
- Chen Luo 1
- Haitong Luo 1
- Xuying Meng 1
- Penghao Ouyang 1
- Xiao Pan 1
- Vishwa Pardeshi 1
- Zhan Shi 1
- Jiliang Tang 1
- Hanghang Tong 1
- Josef Valvoda 1
- Tianyi Wang 1
- Fali Wang 1
- Yue Xing 1
- Ze Yang 1
- Jingying Zeng 1
- Xin Zhang 1
- Weiyao Zhang 1
- Xianren Zhang 1
- Zhiwei Zhang 1
- Jiahao Zhang 1
- Yujun Zhang 1
- Huijun Zhao 1
- Tianxiang Zhao 1