Liangwei Yang
2026
PersonaAgent: Bridging Memory and Action for Personalized LLM Agents
Weizhi Zhang | Xinyang Zhang | Chenwei Zhang | Liangwei Yang | Jingbo Shang | Zhepei Wei | Henry Peng Zou | Zijie Huang | Zhengyang Wang | Yifan Gao | Xiaoman Pan | Lian Xiong | Jingguo Liu | Philip S. Yu | Xian Li
Findings of the Association for Computational Linguistics: ACL 2026
Weizhi Zhang | Xinyang Zhang | Chenwei Zhang | Liangwei Yang | Jingbo Shang | Zhepei Wei | Henry Peng Zou | Zijie Huang | Zhengyang Wang | Yifan Gao | Xiaoman Pan | Lian Xiong | Jingguo Liu | Philip S. Yu | Xian Li
Findings of the Association for Computational Linguistics: ACL 2026
Large Language Model (LLM) empowered agents have recently emerged as advanced paradigms that exhibit impressive capabilities in a wide range of domains and tasks. Despite their potential, current LLM agents often adopt a one-size-fits-all approach, lacking the flexibility to respond to users’ varying needs and preferences. This limitation motivates us to develop PersonaAgent, the first personalized LLM agent framework designed to address versatile personalization tasks. Specifically, PersonaAgent integrates two complementary components: a personalized memory module that includes episodic and semantic memory mechanisms; a personalized action module that enables the agent to perform tool actions tailored to the user. At the core, the persona (defined as unique system prompt for each user) functions as an intermediary: it leverages insights from personalized memory to control agent actions, while the outcomes of these actions in turn refine the memory. Based on the framework, we propose a test-time user-preference alignment strategy that simulate the latest n interactions to optimize the persona prompt, ensuring real-time user preference alignment through textual loss feedback between simulated and ground-truth responses. Experimental evaluations demonstrate that PersonaAgent significantly outperforms other baseline methods by not only personalizing the action space effectively but also scaling during test-time real-world applications. These results underscore the feasibility and potential of our approach in delivering tailored, dynamic user experiences.
2025
Taxonomy-Guided Zero-Shot Recommendations with LLMs
Yueqing Liang | Liangwei Yang | Chen Wang | Xiongxiao Xu | Philip S. Yu | Kai Shu
Proceedings of the 31st International Conference on Computational Linguistics
Yueqing Liang | Liangwei Yang | Chen Wang | Xiongxiao Xu | Philip S. Yu | Kai Shu
Proceedings of the 31st International Conference on Computational Linguistics
With the emergence of large language models (LLMs) and their ability to perform a variety of tasks, their application in recommender systems (RecSys) has shown promise. However, we are facing significant challenges when deploying LLMs into RecSys, such as limited prompt length, unstructured item information, and un-constrained generation of recommendations, leading to sub-optimal performance. To address these issues, we propose a novel Taxonomy-guided Recommendation (TaxRec) framework to empower LLM with category information in a systematic approach. Specifically, TaxRec features a two-step process: one-time taxonomy categorization and LLM-based recommendation. In the one-time taxonomy categorization phase, we organize and categorize items, ensuring clarity and structure of item information. In the LLM-based recommendation phase, we feed the structured items into LLM prompts, achieving efficient token utilization and controlled feature generation. This enables more accurate, contextually relevant, and zero-shot recommendations without the need for domain-specific fine-tuning. Experimental results demonstrate that TaxRec significantly enhances recommendation quality compared to traditional zero-shot approaches, showcasing its efficacy as a personal recommender with LLMs. Code is available at: https://github.com/yueqingliang1/TaxRec.
LLMInit: A Free Lunch from Large Language Models for Selective Initialization of Recommendation
Weizhi Zhang | Liangwei Yang | Wooseong Yang | Henry Peng Zou | Yuqing Liu | Ke Xu | Sourav Medya | Philip S. Yu
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track
Weizhi Zhang | Liangwei Yang | Wooseong Yang | Henry Peng Zou | Yuqing Liu | Ke Xu | Sourav Medya | Philip S. Yu
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track
Collaborative filtering (CF) is widely adopted in industrial recommender systems (RecSys) for modeling user-item interactions across numerous applications, but often struggles with cold-start and data-sparse scenarios. Recent advancements in pre-trained large language models (LLMs) with rich semantic knowledge, offer promising solutions to these challenges. However, deploying LLMs at scale is hindered by their significant computational demands and latency. In this paper, we propose a novel and scalable LLM-RecSys framework, LLMInit, designed to integrate pretrained LLM embeddings into CF models through selective initialization strategies. Specifically, we identify the embedding collapse issue observed when CF models scale and match the large embedding sizes in LLMs and avoid the problem by introducing efficient sampling methods, including, random, uniform, and variance-based selections. Comprehensive experiments conducted on multiple real-world datasets demonstrate that LLMInit significantly improves recommendation performance while maintaining low computational costs, offering a practical and scalable solution for industrial applications. To facilitate industry adoption and promote future research, we provide open-source access to our implementation at https://github.com/DavidZWZ/LLMInit.
PersonaBench: Evaluating AI Models on Understanding Personal Information through Accessing (Synthetic) Private User Data
Juntao Tan | Liangwei Yang | Zuxin Liu | Zhiwei Liu | Rithesh R N | Tulika Manoj Awalgaonkar | Jianguo Zhang | Weiran Yao | Ming Zhu | Shirley Kokane | Silvio Savarese | Huan Wang | Caiming Xiong | Shelby Heinecke
Findings of the Association for Computational Linguistics: ACL 2025
Juntao Tan | Liangwei Yang | Zuxin Liu | Zhiwei Liu | Rithesh R N | Tulika Manoj Awalgaonkar | Jianguo Zhang | Weiran Yao | Ming Zhu | Shirley Kokane | Silvio Savarese | Huan Wang | Caiming Xiong | Shelby Heinecke
Findings of the Association for Computational Linguistics: ACL 2025
Personalization is essential for AI assistants, especially in private AI settings where models are expected to interpret users’ personal data (e.g., conversations, app usage) to understand their background, preferences, and social context. However, due to privacy concerns, existing academic research lacks direct access to such data, making benchmarking difficult. To fill this gap, we propose a synthetic data pipeline that generates realistic user profiles and private documents, enabling the creation of PersonaBench—a benchmark for evaluating models’ ability to understand personal information. Using this benchmark, we assess Retrieval-Augmented Generation (RAG) pipelines on personalized questions and find that current models struggle to accurately extract and answer questions even when provided with the full set of user documents, highlighting the need for improved personalization methods.
2024
PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Zhiwei Liu | Weiran Yao | Jianguo Zhang | Rithesh Murthy | Liangwei Yang | Zuxin Liu | Tian Lan | Ming Zhu | Juntao Tan | Shirley Kokane | Thai Hoang | Juan Carlos Niebles | Shelby Heinecke | Huan Wang | Silvio Savarese | Caiming Xiong
Proceedings of the 28th Conference on Computational Natural Language Learning
Zhiwei Liu | Weiran Yao | Jianguo Zhang | Rithesh Murthy | Liangwei Yang | Zuxin Liu | Tian Lan | Ming Zhu | Juntao Tan | Shirley Kokane | Thai Hoang | Juan Carlos Niebles | Shelby Heinecke | Huan Wang | Silvio Savarese | Caiming Xiong
Proceedings of the 28th Conference on Computational Natural Language Learning
We introduce the Principled Reasoning and Acting (PRAct) framework, a novel method for learning and enforcing action principles from trajectory data. Central to our approach is the use of text gradients from a reflection and optimization engine to derive these action principles. To adapt action principles to specific task requirements, we propose a new optimization framework, Reflective Principle Optimization (RPO). After execution, RPO employs a reflector to critique current action principles and an optimizer to update them accordingly.We investigate the RPO framework under two scenarios: Reward-RPO, which uses environmental rewards for reflection, and Self-RPO, which conducts self-reflection without external rewards. Additionally, we developed two RPO methods, RPO-Traj and RPO-Batch, to adapt to different settings.Experimental results across four environments demonstrate that the PRAct agent, leveraging the RPO framework, can effectively learn and apply action principles to enhance performance.
Search
Fix author
Co-authors
- Philip S. Yu 3
- Shelby Heinecke 2
- Shirley Kokane 2
- Zuxin Liu 2
- Silvio Savarese 2
- Juntao Tan 2
- Huan Wang 2
- Caiming Xiong 2
- Weiran Yao 2
- Weizhi Zhang 2
- Jianguo Zhang 2
- Ming Zhu 2
- Henry Peng Zou 2
- Tulika Manoj Awalgaonkar 1
- Yifan Gao 1
- Thai Hoang 1
- Zijie Huang 1
- Tian Lan 1
- Xian Li 1
- Yueqing Liang 1
- Jingguo Liu 1
- Yuqing Liu 1
- Zhiwei Liu 1
- Zhiwei Liu 1
- Sourav Medya 1
- Rithesh Murthy 1
- Juan Carlos Niebles 1
- Xiaoman Pan 1
- Rithesh R N 1
- Jingbo Shang 1
- Kai Shu 1
- Zhengyang Wang 1
- Chen Wang 1
- Zhepei Wei 1
- Lian Xiong 1
- Xiongxiao Xu 1
- Ke Xu 1
- Wooseong Yang 1
- Xinyang Zhang 1
- Chenwei Zhang 1