Boglarka Nyul
2026
Real Men are Tough: Evaluating Gender Bias and Sensitivity to Masculinity Norms in LLMs
Elisa Leonardelli | Camilla Casula | Boglarka Nyul | Sara Tonelli
Findings of the Association for Computational Linguistics: ACL 2026
Elisa Leonardelli | Camilla Casula | Boglarka Nyul | Sara Tonelli
Findings of the Association for Computational Linguistics: ACL 2026
Large language models (LLMs) are known to exhibit gender bias, yet most evaluations focus on downstream stereotypes rather than the normative frameworks that shape model inference. We investigate whether LLMs rely on traditional masculinity norms (e.g. "real men are tough") as latent priors in gender-biased inference. We ground our evaluation in the Male Role Norms Inventory (MRNI), a validated psychological framework of prescriptive male role norms.Anchored in MRNI items, we probe models using two complementary approaches: (i) explicit Likert-style agreement with masculinity norms, and (ii) a newly crafted English-Italian scenario-based inference dataset (MRNI-BB), in which gender information and evidential support are systematically varied. Across models, explicit endorsement of masculinity norms is generally low. In contrast, in scenario-based inference tasks, models systematically attribute MRNI-aligned behaviors to male agents, even when evidence is ambiguous or absent. This effect disappears when gender markers are removed, suggesting that masculinity norms are treated as gender-specific expectations about male agents. Increasing model scale reduces explicit norm endorsement but is associated with stronger male-directed bias under uncertainty.