Controllable Pareto Trade-off between Fairness and Accuracy

Yongkang Du, Jieyu Zhao, Yijun Yang, Tianyi Zhou


Abstract
The fairness-accuracy trade-off is a key challenge in NLP tasks. Current work focuses on finding a single optimal solution to balance the two objectives, which is limited considering the diverse solutions on the Pareto front.This work intends to provide controllable trade-offs according to the user’s preference of the two objectives, which is defined as a reference vector. To achieve this goal, we apply multi-objective optimization (MOO), which can find solutions from various regions of the Pareto front. However, it is challenging to precisely control the trade-off due to the stochasticity of the training process and the high dimensional gradient vectors.Thus, we propose Controllable Pareto Trade-off (CPT) that can effectively train models to perform different trade-offs according to users’ preferences.CPT 1) stabilizes the fairness update with a moving average of stochastic gradients to determine the update direction, and 2) prunes the gradients by only keeping the gradients of the critical parameters. We evaluate CPT on hate speech detection and occupation classification tasks. Experiments show that CPT can achieve a higher-quality set of solutions on the Pareto front than the baseline methods. It also exhibits better controllability and can precisely follow the human-defined reference vectors.
Anthology ID:
2026.trustnlp-main.8
Volume:
Proceedings of the 6th Workshop on Trustworthy NLP (TrustNLP 2026)
Month:
July
Year:
2026
Address:
San Diego, California
Editors:
Kai-Wei Chang, Ninareh Mehrabi, Satyapriya Krishna, Anubrata Das, Jwala Dhamala, Yang Trista Cao, Tharindu Kumarage, Anil Ramakrishna, Christos Christodoulopoulos, Yixin Wan, Aram Galystan, Anoop Kumar, Rahul Gupta
Venues:
TrustNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
108–120
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.trustnlp-main.8/
DOI:
Bibkey:
Cite (ACL):
Yongkang Du, Jieyu Zhao, Yijun Yang, and Tianyi Zhou. 2026. Controllable Pareto Trade-off between Fairness and Accuracy. In Proceedings of the 6th Workshop on Trustworthy NLP (TrustNLP 2026), pages 108–120, San Diego, California. Association for Computational Linguistics.
Cite (Informal):
Controllable Pareto Trade-off between Fairness and Accuracy (Du et al., TrustNLP 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.trustnlp-main.8.pdf