Yongkang Xiao
2026
RiTeK: A Dataset for Large Language Models Complex Reasoning over Textual Knowledge Graphs in Medicine
Jiatan Huang | Mingchen Li | Zonghai Yao | Dawei Li | Yuxin Zhang | Zhichao Yang | Yongkang Xiao | Feiyun Ouyang | Xiaohan Li | Shuo Han | Hong yu
Findings of the Association for Computational Linguistics: ACL 2026
Jiatan Huang | Mingchen Li | Zonghai Yao | Dawei Li | Yuxin Zhang | Zhichao Yang | Yongkang Xiao | Feiyun Ouyang | Xiaohan Li | Shuo Han | Hong yu
Findings of the Association for Computational Linguistics: ACL 2026
Answering complex real-world questions in the medical domain often requires accurate retrieval from medical Textual Knowledge Graphs (medical TKGs), as the relational path information from TKGs could enhance the inference ability of Large Language Models (LLMs). However, the main bottlenecks lie in the scarcity of existing medical TKGs, the limited expressiveness of their topological structures, and the lack of comprehensive evaluations of current retrievers for medical TKGs. To address these challenges, we first develop a dataset for LLMs Complex Reasoning over medical Textual Knowledge Graphs (RiTeK), covering a broad range of topological structures. Specifically, we synthesize realistic user queries integrating diverse topological structures, relational information, and complex textual descriptions. We conduct a rigorous medical expert evaluation process to assess and validate the quality of our synthesized queries. RiTeK also serves as a comprehensive benchmark dataset for evaluating the capabilities of retrieval systems built upon LLMs. By assessing 11 representative retrievers on this benchmark, we observe that existing methods struggle to perform well, revealing notable limitations in current LLM-driven retrieval approaches. These findings highlight the pressing need for more effective retrieval systems tailored for semi-structured data in the medical domain.
2025
The Efficiency vs. Accuracy Trade-off: Optimizing RAG-Enhanced LLM Recommender Systems Using Multi-Head Early Exit
Huixue Zhou | Hengrui Gu | Zaifu Zhan | Xi Liu | Kaixiong Zhou | Yongkang Xiao | Mingfu Liang | Srinivas Prasad Govindan | Piyush Chawla | Jiyan Yang | Xiangfei Meng | Huayu Li | Buyun Zhang | Liang Luo | Wen-Yen Chen | Yiping Han | Bo Long | Rui Zhang | Tianlong Chen
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Huixue Zhou | Hengrui Gu | Zaifu Zhan | Xi Liu | Kaixiong Zhou | Yongkang Xiao | Mingfu Liang | Srinivas Prasad Govindan | Piyush Chawla | Jiyan Yang | Xiangfei Meng | Huayu Li | Buyun Zhang | Liang Luo | Wen-Yen Chen | Yiping Han | Bo Long | Rui Zhang | Tianlong Chen
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
The deployment of Large Language Models (LLMs) in recommender systems for Click-Through Rate (CTR) prediction requires a careful balance between computational efficiency and predictive accuracy. This paper introduces OptiRAG-Rec, a comprehensive framework that integrates Retrieval-Augmented Generation (RAG) with a novel multi-head early exit architecture to address both challenges. By leveraging Graph Convolutional Networks (GCNs) as efficient retrieval mechanisms, the framework significantly reduces data retrieval times while maintaining high model performance. Additionally, the multi-head early exit strategy dynamically terminates inference based on real-time predictive confidence assessments, enhancing responsiveness without sacrificing accuracy. Experimental results demonstrate that OptiRAG-Rec reduces computation time while preserving the precision required for reliable recommendations, establishing a new benchmark for efficient and accurate LLM deployment in recommendation.
DrKGC: Dynamic Subgraph Retrieval-Augmented LLMs for Knowledge Graph Completion across General and Biomedical Domains
Yongkang Xiao | Sinian Zhang | Yi Dai | Huixue Zhou | Jue Hou | Jie Ding | Rui Zhang
Findings of the Association for Computational Linguistics: EMNLP 2025
Yongkang Xiao | Sinian Zhang | Yi Dai | Huixue Zhou | Jue Hou | Jie Ding | Rui Zhang
Findings of the Association for Computational Linguistics: EMNLP 2025
Knowledge graph completion (KGC) aims to predict missing triples in knowledge graphs (KGs) by leveraging existing triples and textual information. Recently, generative large language models (LLMs) have been increasingly employed for graph tasks. However, current approaches typically encode graph context in textual form, which fails to fully exploit the potential of LLMs for perceiving and reasoning about graph structures. To address this limitation, we propose DrKGC (Dynamic Subgraph Retrieval-Augmented LLMs for Knowledge Graph Completion). DrKGC employs a flexible lightweight model training strategy to learn structural embeddings and logical rules within the KG. It then leverages a novel bottom-up graph retrieval method to extract a subgraph for each query guided by the learned rules. Finally, a graph convolutional network (GCN) adapter uses the retrieved subgraph to enhance the structural embeddings, which are then integrated into the prompt for effective LLM fine-tuning. Experimental results on two general domain benchmark datasets and two biomedical datasets demonstrate the superior performance of DrKGC. Furthermore, a realistic case study in the biomedical domain highlights its interpretability and practical utility.
Search
Fix author
Co-authors
- Rui Zhang 2
- Huixue Zhou 2
- Piyush Chawla 1
- Wen-Yen Chen 1
- Tianlong Chen 1
- Yi Dai 1
- Jie Ding 1
- Srinivas Prasad Govindan 1
- Hengrui Gu 1
- Yiping Han 1
- Shuo Han 1
- Jue Hou 1
- Jiatan Huang 1
- Huayu Li 1
- Mingchen Li 1
- Dawei Li 1
- Xiaohan Li 1
- Mingfu Liang 1
- Xi Liu 1
- Bo Long 1
- Liang Luo 1
- Xiangfei Meng 1
- Feiyun Ouyang 1
- Jiyan Yang 1
- Zhichao Yang 1
- Zonghai Yao 1
- Hong Yu 1
- Zaifu Zhan 1
- Buyun Zhang 1
- Sinian Zhang 1
- Yuxin Zhang 1
- Kaixiong Zhou 1