Haoran Luo
Other people with similar names: Haoran Luo, Haoran Luo
Unverified author pages with similar names: Haoran Luo
2026
MUR: Momentum Uncertainty guided Reasoning for Large Language Models
Hang Yan | Fangzhi Xu | Rongman Xu | Yifei Li | Jian Zhang | Haoran Luo | Xiaobao Wu | Anh Tuan Luu | Haiteng Zhao | Qika Lin | Jun Liu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Hang Yan | Fangzhi Xu | Rongman Xu | Yifei Li | Jian Zhang | Haoran Luo | Xiaobao Wu | Anh Tuan Luu | Haiteng Zhao | Qika Lin | Jun Liu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Large Language Models (LLMs) have achieved impressive performance on reasoning-intensive tasks, yet optimizing their reasoning efficiency remains an open challenge. While Test-Time Scaling (TTS) improves reasoning quality, it often leads to overthinking—wasting tokens on redundant computations. This work investigates how to efficiently and adaptively guide LLM TTS without additional training. Inspired by the concept of momentum in physics, we propose Momentum Uncertainty-guided Reasoning (MUR), which dynamically allocates thinking budgets to critical reasoning steps by tracking and aggregating step-wise uncertainty over time. To support flexible inference-time control, we introduce -control, a simple mechanism that tunes the reasoning budget via a single hyperparameter. We provide in-depth theoretical proof to support the superiority of MUR in terms of stability and biases. MUR is comprehensively evaluated against various TTS methods across four challenging benchmarks (MATH-500, AIME24, AIME25, and GPQA-diamond) using different sizes of recent Qwen3 models (1.7B, 4B, and 8B). Results demonstrate that MUR reduces computation by over 45% on average while improving accuracy by 0.33–3.46%.
TemplateRL: Structured Template-Guided Reinforcement Learning for LLM Reasoning
Jinyang Wu | Chonghua Liao | Mingkuan Feng | Shuai Zhang | Zhengqi Wen | Haoran Luo | Ling Yang | Huazhe Xu | Jianhua Tao
Findings of the Association for Computational Linguistics: ACL 2026
Jinyang Wu | Chonghua Liao | Mingkuan Feng | Shuai Zhang | Zhengqi Wen | Haoran Luo | Ling Yang | Huazhe Xu | Jianhua Tao
Findings of the Association for Computational Linguistics: ACL 2026
Reinforcement learning (RL) has emerged as an effective paradigm for enhancing model reasoning. However, existing RL methods like GRPO often rely on unstructured self-sampling to fit scalar rewards, often producing inefficient rollouts that fail to capture transferable problem-solving strategies. To address these limitations, we propose **TemplateRL**, a structured template-guided RL framework that augments policy optimization with explicit template guidance. Our approach first constructs a problem-solving template library via MCTS on a small seed set, then seamlessly integrates this high-level structured guidance into RL training. By guiding rollout generation to align with proven template structures, TemplateRL significantly improves high-quality trajectory hit rates while reducing ineffective exploration. This structure-guided design steers the policy toward validated strategic patterns, stabilizing training dynamics, and enhancing RL sampling efficiency. Notably, the explicit template library is interpretable, editable, and supports online updates-enabling continuous updates during both training and inference. Extensive experiments demonstrate that TemplateRL outperforms GRPO by 99% on AIME and 41% on AMC, with superior stability on weak models and remarkable cross-domain generalization, highlighting its potential for broader tasks.
LexGenius: An Expert-Level Benchmark for Large Language Models in Legal General Intelligence
Wenjin Liu | Haoran Luo | Xin Feng | Xiang Ji | Lijuan Zhou | Rui Mao | Jiapu Wang | Shirui Pan | Erik Cambria
Findings of the Association for Computational Linguistics: ACL 2026
Wenjin Liu | Haoran Luo | Xin Feng | Xiang Ji | Lijuan Zhou | Rui Mao | Jiapu Wang | Shirui Pan | Erik Cambria
Findings of the Association for Computational Linguistics: ACL 2026
Legal general intelligence (GI) refers to artificial intelligence (AI) that encompasses legal understanding, reasoning, and decision-making, simulating the expertise of legal experts across domains. However, existing benchmarks are result-oriented and fail to systematically evaluate the legal intelligence of large language models (LLMs), hindering the development of legal GI. To address this, we propose LexGenius, an expert-level Chinese legal benchmark for evaluating legal GI in LLMs. It follows a Dimension-Task-Ability framework, covering seven dimensions, eleven tasks, and twenty abilities. We use the recent legal cases and exam questions to create multiple-choice questions with a combination of manual and LLM reviews to reduce data leakage risks, ensuring accuracy and reliability through multiple rounds of checks. We evaluate 12 state-of-the-art LLMs using LexGenius and conduct an in-depth analysis. We find significant disparities across legal intelligence abilities for LLMs, with even the best LLMs lagging behind human legal professionals. We believe LexGenius can assess the legal intelligence abilities of LLMs and enhance legal GI development.Our project is available at https://github.com/QwenQKing/LexGenius.
MAXS: Meta-Adaptive Exploration with LLM Agents
Jian Zhang | Zhiyuan Wang | Zhangqi Wang | Yu He | Haoran Luo | li Yuan | Lingling Zhang | Rui Mao | Qika Lin | Jun Liu
Findings of the Association for Computational Linguistics: ACL 2026
Jian Zhang | Zhiyuan Wang | Zhangqi Wang | Yu He | Haoran Luo | li Yuan | Lingling Zhang | Rui Mao | Qika Lin | Jun Liu
Findings of the Association for Computational Linguistics: ACL 2026
Large Language Model (LLM) Agents exhibit inherent reasoning abilities through the collaboration of multiple tools.However, during agent inference, existing methods often suffer from (i) locally myopic generation, due to the absence of lookahead, and (ii) trajectory instability, where minor early errors can escalate into divergent reasoning paths. These issues make it difficult to balance global effectiveness and computational efficiency. To address these two issues, we propose meta-adaptive exploration with LLM agents (MAXS)[<https://github.com/exoskeletonzj/MAXS>], a meta-adaptive reasoning framework based on LLM Agents that flexibly integrates tool execution and reasoning planning. MAXS employs a lookahead strategy to extend reasoning paths a few steps ahead, estimating the advantage value of tool usage, and combines step consistency variance and inter-step trend slopes to jointly select stable, consistent, and high-value reasoning steps. Additionally, we introduce a trajectory convergence mechanism that controls computational cost by halting further rollouts once path consistency is achieved, enabling a balance between resource efficiency and global effectiveness in multi-tool reasoning. We conduct extensive empirical studies across three base models (MiMo-VL-7B, Qwen2.5-VL-7B, Qwen2.5-VL-32B) and five datasets, demonstrating that MAXS consistently outperforms existing methods in both performance and inference efficiency. Further analysis confirms the effectiveness of our lookahead strategy and tool usage.
Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning
Wenjin Liu | Haoran Luo | Xueyuan Lin | Haoming Liu | Tiesunlong Shen | Jiapu Wang | Rui Mao | Erik Cambria
Findings of the Association for Computational Linguistics: ACL 2026
Wenjin Liu | Haoran Luo | Xueyuan Lin | Haoming Liu | Tiesunlong Shen | Jiapu Wang | Rui Mao | Erik Cambria
Findings of the Association for Computational Linguistics: ACL 2026
Recently, various excellent and powerful large language models (LLMs) have been utilized to solve a wide range of human problems. However, when faced with complex problems, most users are often unable to provide accurate and effective prompts to interact with LLMs, thus limiting their performance. To address this challenge, we propose Prompt-R1, an end-to-end reinforcement learning framework that utilizes a small-scale LLM (as agent) to collaborate with large-scale LLMs (as environment), replacing users to interact better. This collaboration is presented as a multi-turn interaction, where the small-scale LLM thinks and generates prompts, and the large-scale LLM performs complex reasoning. A double-constrained reward is designed to optimize correctness and quality of generation. Prompt-R1 provides a plug-and-play framework that supports both inference and training with various large-scale LLMs. Experimental results on twelve datasets show that Prompt-R1 significantly outperforms baseline LLMs across various tasks.Our code is available at https://github.com/QwenQKing/Prompt-R1.
Beyond Examples: Towards Automated Thought-level In-Context Reasoning for Large Language Models
Jinyang Wu | Mingkuan Feng | Shuai Zhang | Feihu Che | Zhengqi Wen | Chonghua Liao | Ling Yang | Haoran Luo | Zheng Lian | Jianhua Tao
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Jinyang Wu | Mingkuan Feng | Shuai Zhang | Feihu Che | Zhengqi Wen | Chonghua Liao | Ling Yang | Haoran Luo | Zheng Lian | Jianhua Tao
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
In-context learning (ICL) leverages demonstrations to enhance the performance of large language models (LLMs). However, traditional ICL struggles with complex reasoning mainly due to superficial, example-level implicit imitation. To address these limitations, we introduce **ThoughtICR**, an automated **Thought**-level **I**n-**C**ontext **R**easoning paradigm that shifts from surface-level examples to more guidance-oriented thought patterns. Specifically, we first define atomic reasoning actions and construct thought patterns on small-scale seed data using Monte Carlo Tree Search (MCTS). During inference, we dynamically select appropriate thought patterns based on target problem attributes, providing explicit guidance for model reasoning. Thanks to its automated and strategic design, our method enables seamless plug-and-play integration with various post-training techniques. Experimental results demonstrate that our method improves performance across different model sizes and generalizes effectively across reasoning domains. Using only small-scale seed data, we achieve 80.6% accuracy on MATH and 62.5% on AMC, surpassing GPT-4o’s 77.2% and 57.5%, respectively. Moreover, compared to test-time scaling methods, our approach reduces computational costs by over 10. Our code is available at https://github.com/jinyangwu/ThoughtICR.
2025
A Cognitive Writing Perspective for Constrained Long-Form Text Generation
Kaiyang Wan | Honglin Mu | Rui Hao | Haoran Luo | Tianle Gu | Xiuying Chen
Findings of the Association for Computational Linguistics: ACL 2025
Kaiyang Wan | Honglin Mu | Rui Hao | Haoran Luo | Tianle Gu | Xiuying Chen
Findings of the Association for Computational Linguistics: ACL 2025
Like humans, Large Language Models (LLMs) struggle to generate high-quality long-form text that adheres to strict requirements in a single pass. This challenge is unsurprising, as successful human writing, according to the Cognitive Writing Theory, is a complex cognitive process involving iterative planning, translating, reviewing, and monitoring. Motivated by these cognitive principles, we aim to equip LLMs with human-like cognitive writing capabilities through CogWriter, a novel training-free framework that transforms LLM constrained long-form text generation into a systematic cognitive writing paradigm. Our framework consists of two key modules: (1) a Planning Agent that performs hierarchical planning to decompose the task, and (2) multiple Generation Agents that execute these plans in parallel. The system maintains quality via continuous monitoring and reviewing mechanisms, which evaluate outputs against specified requirements and trigger necessary revisions. CogWriter demonstrates exceptional performance on LongGenBench, a benchmark for complex constrained long-form text generation. Even when using Qwen-2.5-14B as its backbone, CogWriter surpasses GPT-4o by 22% in complex instruction completion accuracy while reliably generating texts exceeding 10,000 words. We hope this cognitive science-inspired approach provides a paradigm for LLM writing advancements: https://anonymous.4open.science/r/CogWriter-8DFE.
Complex Numerical Reasoning with Numerical Semantic Pre-training Framework
Jun Zhang | Haihong E | Tianyi Hu | Yifan Zhu | Meina Song | Haoran Luo
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Jun Zhang | Haihong E | Tianyi Hu | Yifan Zhu | Meina Song | Haoran Luo
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Multi-hop complex reasoning over incomplete knowledge graphs (KGs) has been extensively studied, but research on numerical knowledge graphs (NKGs) remains relatively limited. Recent approaches focus on separately encoding entities and numerical values, using neural networks to process query encodings for reasoning. However, in complex multi-hop reasoning tasks, numerical values are not merely symbols, and they carry specific semantics and logical relationships that must be accurately represented. The CNR-NST framework can perform binary operations on numerical attributes in NKGs, enabling it to infer new numerical attributes from existing knowledge. Our approach effectively handles up to 102 types of complex numerical reasoning queries. On three public datasets, CNR-NST demonstrates SOTA performance in complex numerical queries, achieving an average improvement of over 40% compared to existing methods. Notably, this work expands the query types for complex multi-hop numerical reasoning and introduces a new evaluation metric for numerical answers, which has been validated through comprehensive experiments.
2024
ChatKBQA: A Generate-then-Retrieve Framework for Knowledge Base Question Answering with Fine-tuned Large Language Models
Haoran Luo | Haihong E | Zichen Tang | Shiyao Peng | Yikai Guo | Wentai Zhang | Chenghao Ma | Guanting Dong | Meina Song | Wei Lin | Yifan Zhu | Anh Tuan Luu
Findings of the Association for Computational Linguistics: ACL 2024
Haoran Luo | Haihong E | Zichen Tang | Shiyao Peng | Yikai Guo | Wentai Zhang | Chenghao Ma | Guanting Dong | Meina Song | Wei Lin | Yifan Zhu | Anh Tuan Luu
Findings of the Association for Computational Linguistics: ACL 2024
Knowledge Base Question Answering (KBQA) aims to answer natural language questions over large-scale knowledge bases (KBs), which can be summarized into two crucial steps: knowledge retrieval and semantic parsing. However, three core challenges remain: inefficient knowledge retrieval, mistakes of retrieval adversely impacting semantic parsing, and the complexity of previous KBQA methods. To tackle these challenges, we introduce ChatKBQA, a novel and simple generate-then-retrieve KBQA framework, which proposes first generating the logical form with fine-tuned LLMs, then retrieving and replacing entities and relations with an unsupervised retrieval method, to improve both generation and retrieval more directly. Experimental results show that ChatKBQA achieves new state-of-the-art performance on standard KBQA datasets, WebQSP, and CWQ. This work can also be regarded as a new paradigm for combining LLMs with knowledge graphs (KGs) for interpretable and knowledge-required question answering.
2023
HAHE: Hierarchical Attention for Hyper-Relational Knowledge Graphs in Global and Local Level
Haoran Luo | Haihong E | Yuhao Yang | Yikai Guo | Mingzhi Sun | Tianyu Yao | Zichen Tang | Kaiyang Wan | Meina Song | Wei Lin
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Haoran Luo | Haihong E | Yuhao Yang | Yikai Guo | Mingzhi Sun | Tianyu Yao | Zichen Tang | Kaiyang Wan | Meina Song | Wei Lin
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Link Prediction on Hyper-relational Knowledge Graphs (HKG) is a worthwhile endeavor. HKG consists of hyper-relational facts (H-Facts), composed of a main triple and several auxiliary attribute-value qualifiers, which can effectively represent factually comprehensive information. The internal structure of HKG can be represented as a hypergraph-based representation globally and a semantic sequence-based representation locally. However, existing research seldom simultaneously models the graphical and sequential structure of HKGs, limiting HKGs’ representation. To overcome this limitation, we propose a novel Hierarchical Attention model for HKG Embedding (HAHE), including global-level and local-level attention. The global-level attention can model the graphical structure of HKG using hypergraph dual-attention layers, while the local-level attention can learn the sequential structure inside H-Facts via heterogeneous self-attention layers. Experiment results indicate that HAHE achieves state-of-the-art performance in link prediction tasks on HKG standard datasets. In addition, HAHE addresses the issue of HKG multi-position prediction for the first time, increasing the applicability of the HKG link prediction task. Our code is publicly available.
Search
Fix author
Co-authors
- Haihong E 3
- Rui Mao 3
- Meina Song 3
- Erik Cambria 2
- Mingkuan Feng 2
- Yikai Guo 2
- Chonghua Liao 2
- Qika Lin 2
- Jun Liu 2
- Wenjin Liu 2
- Zichen Tang 2
- Jianhua Tao 2
- Luu Anh Tuan 2
- Kaiyang Wan 2
- Jiapu Wang 2
- Zhengqi Wen 2
- Jinyang Wu 2
- Ling Yang 2
- Jian Zhang 2
- Shuai Zhang 2
- Yifan Zhu 2
- Feihu Che 1
- Xiuying Chen 1
- Guanting Dong 1
- Xin Feng 1
- Tianle Gu 1
- Rui Hao 1
- Yu He 1
- Tianyi Hu 1
- Xiang Ji 1
- Yifei Li 1
- Zheng Lian 1
- Wei Lin 1
- Xueyuan Lin 1
- Wei Lin 1
- Haoming Liu 1
- Chenghao Ma 1
- Honglin Mu 1
- Shirui Pan 1
- Shiyao Peng 1
- Tiesunlong Shen 1
- Mingzhi Sun 1
- Zhiyuan Wang 1
- Zhangqi Wang 1
- Xiaobao Wu 1
- Fangzhi Xu 1
- Rongman Xu 1
- Huazhe Xu 1
- Hang Yan 1
- Yuhao Yang 1
- Tianyu Yao 1
- Li Yuan 1
- Lingling Zhang 1
- Wentai Zhang 1
- Jun Zhang 1
- Haiteng Zhao 1
- Lijuan Zhou 1