A Trip Towards Fairness: Bias and De-Biasing in Large Language Models

Leonardo Ranaldi, Elena Ruzzetti, Davide Venditti, Dario Onorati, Fabio Zanzotto


Abstract
Cheap-to-Build Very Large-Language Models (CtB-LLMs) with affordable training are emerging as the next big revolution in natural language processing and understanding. These CtB-LLMs are democratizing access to trainable Very Large-Language Models (VLLMs) and, thus, may represent the building blocks of many NLP systems solving downstream tasks. Hence, a little or a large bias in CtB-LLMs may cause huge harm. In this paper, we performed a large investigation of the bias of three families of CtB-LLMs, and we showed that debiasing techniques are effective and usable. Indeed, according to current tests, the LLaMA and the OPT families have an important bias in gender, race, religion, and profession. In contrast to the analysis for other LMMs, we discovered that bias depends not on the number of parameters but on the perplexity. Finally, the debiasing of OPT using LORA reduces bias up to 4.12 points in the normalized stereotype score.
Anthology ID:
2024.starsem-1.30
Volume:
Proceedings of the 13th Joint Conference on Lexical and Computational Semantics (*SEM 2024)
Month:
June
Year:
2024
Address:
Mexico City, Mexico
Editors:
Danushka Bollegala, Vered Shwartz
Venue:
*SEM
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
372–384
Language:
URL:
https://aclanthology.org/2024.starsem-1.30
DOI:
Bibkey:
Cite (ACL):
Leonardo Ranaldi, Elena Ruzzetti, Davide Venditti, Dario Onorati, and Fabio Zanzotto. 2024. A Trip Towards Fairness: Bias and De-Biasing in Large Language Models. In Proceedings of the 13th Joint Conference on Lexical and Computational Semantics (*SEM 2024), pages 372–384, Mexico City, Mexico. Association for Computational Linguistics.
Cite (Informal):
A Trip Towards Fairness: Bias and De-Biasing in Large Language Models (Ranaldi et al., *SEM 2024)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-5/2024.starsem-1.30.pdf