A Novel Interpretability Metric for Explaining Bias in Language Models: Applications on Multilingual Models from Southeast Asia - ACL Anthology

This is an internal, incomplete preview of a proposed change to the ACL Anthology. For efficiency reasons, we don't generate MODS or Endnote formats, and the preview may be incomplete in other ways, or contain mistakes. Do not treat this content as an official publication.

A Novel Interpretability Metric for Explaining Bias in Language Models: Applications on Multilingual Models from Southeast Asia

Lance Calvin Lim Gamboa, Mark Lee

Anthology ID:: 2024.paclic-1.29
Volume:: Proceedings of the 38th Pacific Asia Conference on Language, Information and Computation
Month:: December
Year:: 2024
Address:: Tokyo, Japan
Editors:: Nathaniel Oco, Shirley N. Dita, Ariane Macalinga Borlongan, Jong-Bok Kim
Venue:: PACLIC
SIG:
Publisher:: Tokyo University of Foreign Studies
Note:
Pages:: 296–305
Language:
URL:: https://preview.aclanthology.org/fix-sig-urls/2024.paclic-1.29/
DOI:
Bibkey:
Cite (ACL):: Lance Calvin Lim Gamboa and Mark Lee. 2024. A Novel Interpretability Metric for Explaining Bias in Language Models: Applications on Multilingual Models from Southeast Asia. In Proceedings of the 38th Pacific Asia Conference on Language, Information and Computation, pages 296–305, Tokyo, Japan. Tokyo University of Foreign Studies.
Cite (Informal):: A Novel Interpretability Metric for Explaining Bias in Language Models: Applications on Multilingual Models from Southeast Asia (Gamboa & Lee, PACLIC 2024)
Copy Citation:
PDF:: https://preview.aclanthology.org/fix-sig-urls/2024.paclic-1.29.pdf

PDF Cite Search Fix data