Emma Spiro

2026

Persona-Assigned Large Language Models Exhibit Human-Like Motivated Reasoning
Saloni Dash | Amélie Reymond | Emma Spiro | Aylin Caliskan
Findings of the Association for Computational Linguistics: ACL 2026

Reasoning in humans is prone to biases due to underlying motivations like identity protection, that undermine rational decision-making and judgment. This motivated reasoning at a collective level can be detrimental to society when debating critical issues such as human-driven climate change or vaccine safety, and can further aggravate political polarization. Prior studies have reported that large language models (LLMs) are also susceptible to human-like cognitive biases, however, the extent to which LLMs selectively reason toward identity-congruent conclusions remains largely unexplored. Here, we investigate whether assigning 8 personas across 4 political and socio-demographic attributes induces motivated reasoning in LLMs. Testing 8 LLMs (open source and proprietary) across two reasoning tasks from human-subject studies — veracity discernment of misinformation headlines and evaluation of numeric scientific evidence — we find that persona-assigned LLMs have up to 9% reduced veracity discernment relative to models without personas. Political personas specifically are up to 90% more likely to correctly evaluate scientific evidence on gun control when the ground truth is congruent with their induced political identity. Prompt-based debiasing methods are largely ineffective at mitigating these effects. Taken together, our empirical findings are the first to suggest that persona-assigned LLMs exhibit human-like motivated reasoning that is hard to mitigate through conventional debiasing prompts — raising concerns of exacerbating identity-congruent reasoning in both LLMs and humans.

2017

pdf bib abs

Non-lexical Features Encode Political Affiliation on Twitter
Rachael Tatman | Leo Stewart | Amandalynne Paullada | Emma Spiro
Proceedings of the Second Workshop on NLP and Computational Social Science

Previous work on classifying Twitter users’ political alignment has mainly focused on lexical and social network features. This study provides evidence that political affiliation is also reflected in features which have been previously overlooked: users’ discourse patterns (proportion of Tweets that are retweets or replies) and their rate of use of capitalization and punctuation. We find robust differences between politically left- and right-leaning communities with respect to these discourse and sub-lexical features, although they are not enough to train a high-accuracy classifier.

Co-authors

Rachael Tatman 1

Venues

Findings1
NLP+CSS1

Fix author