Xu Pan
2026
User-Assistant Bias in LLMs
Xu Pan | Jingxuan Fan | Zidi Xiong | Ely Hahami | Jorin Overwiening | Ziqian Xie
Findings of the Association for Computational Linguistics: ACL 2026
Xu Pan | Jingxuan Fan | Zidi Xiong | Ely Hahami | Jorin Overwiening | Ziqian Xie
Findings of the Association for Computational Linguistics: ACL 2026
Modern large language models (LLMs) are typically trained and deployed using structured role tags (e.g. system, user, assistant, tool) that explicitly mark the source of each piece of context. While these tags are essential for instruction following and controllability, asymmetries in the training data associated with different role tags can potentially introduce inductive biases. In this paper, we study this phenomenon by formalizing user–assistant bias, defined as the tendency of an LLM to preferentially rely on information from either the user or assistant role when they provide incompatible information about the same entity in the context history. We introduce a task-agnostic benchmark UserAssist and evaluate such bias in 52 frontier models. We observe that most of the instruction-tuned models exhibit strong user bias, whereas base and reasoning models are close to neutral. Using controlled fine-tuning experiments, we isolate which post-training recipes drive the observed user–assistant bias. We find that human-preference alignment amplifies user bias, while reasoning fine-tuning reduces it. Finally, we show that user–assistant bias can be bidirectionally controlled via direct preference optimization (DPO) on UserAssist-train, and that the resulting bias reliably generalizes to two realistic multi-turn debate datasets spanning philosophical opinions and natural argumentative exchanges on factual/policy topics. These results reveal an underexplored consequence of role-tagged training and provide a principled framework to diagnose and control tag-induced biases in modern LLMs.
2020
Multi-Instance Multi-Label Learning Networks for Aspect-Category Sentiment Analysis
Yuncong Li | Cunxiang Yin | Sheng-hua Zhong | Xu Pan
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Yuncong Li | Cunxiang Yin | Sheng-hua Zhong | Xu Pan
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Aspect-category sentiment analysis (ACSA) aims to predict sentiment polarities of sentences with respect to given aspect categories. To detect the sentiment toward a particular aspect category in a sentence, most previous methods first generate an aspect category-specific sentence representation for the aspect category, then predict the sentiment polarity based on the representation. These methods ignore the fact that the sentiment of an aspect category mentioned in a sentence is an aggregation of the sentiments of the words indicating the aspect category in the sentence, which leads to suboptimal performance. In this paper, we propose a Multi-Instance Multi-Label Learning Network for Aspect-Category sentiment analysis (AC-MIMLLN), which treats sentences as bags, words as instances, and the words indicating an aspect category as the key instances of the aspect category. Given a sentence and the aspect categories mentioned in the sentence, AC-MIMLLN first predicts the sentiments of the instances, then finds the key instances for the aspect categories, finally obtains the sentiments of the sentence toward the aspect categories by aggregating the key instance sentiments. Experimental results on three public datasets demonstrate the effectiveness of AC-MIMLLN.
A Joint Model for Aspect-Category Sentiment Analysis with Shared Sentiment Prediction Layer
Yuncong Li | Zhe Yang | Cunxiang Yin | Xu Pan | Lunan Cui | Qiang Huang | Ting Wei
Proceedings of the 19th Chinese National Conference on Computational Linguistics
Yuncong Li | Zhe Yang | Cunxiang Yin | Xu Pan | Lunan Cui | Qiang Huang | Ting Wei
Proceedings of the 19th Chinese National Conference on Computational Linguistics
Aspect-category sentiment analysis (ACSA) aims to predict the aspect categories mentioned in texts and their corresponding sentiment polarities. Some joint models have been proposed to address this task. Given a text, these joint models detect all the aspect categories mentioned in the text and predict the sentiment polarities toward them at once. Although these joint models obtain promising performances, they train separate parameters for each aspect category and therefore suffer from data deficiency of some aspect categories. To solve this problem, we propose a novel joint model which contains a shared sentiment prediction layer. The shared sentiment prediction layer transfers sentiment knowledge between aspect categories and alleviates the problem caused by data deficiency. Experiments conducted on SemEval-2016 Datasets demonstrate the effectiveness of our model.