Jack W. Stokes
2025
Group Preference Alignment: Customizing LLM Responses from In-Situ Conversations Only When Needed
Ishani Mondal
|
Jack W. Stokes
|
Sujay Kumar Jauhar
|
Longqi Yang
|
Mengting Wan
|
Xiaofeng Xu
|
Xia Song
|
Jordan Lee Boyd-Graber
|
Jennifer Neville
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track
LLMs often fail to meet specialized needs of distinct user groups due to their one-size-fits-all approach, and there is limited understanding of what personalization each group expects.To address this, we propose GPA a group-aware personalization framework that captures context-specific preference variations and steers LLMs accordingly.Our approach involves: (1) Group-Aware Preference Extraction, which distills divergent preferences from real-world conversation logs into interpretable rubrics, and (2) Tailored Response Generation, using (a) GPA-CT, which adapts responses using learnt rubrics, and (b) GPA-FT, which finetunes models using rubric-guided synthetic data.Automatic and Human evaluations confirm that GPA improves group alignment without compromising perfomance on standard instruction-following benchmarks.
Search
Fix author
Co-authors
- Jordan Lee Boyd-Graber 1
- Sujay Kumar Jauhar 1
- Ishani Mondal 1
- Jennifer Neville 1
- Xia Song 1
- show all...