VoiceBBQ: Investigating Effect of Content and Acoustics in Social Bias of Spoken Language Model

Junhyuk Choi; Ro-hoon Oh; Jihwan Seol; Bugeun Kim

VoiceBBQ: Investigating Effect of Content and Acoustics in Social Bias of Spoken Language Model

Junhyuk Choi, Ro-hoon Oh, Jihwan Seol, Bugeun Kim

Abstract

We introduce VoiceBBQ, a spoken extension of the BBQ (Bias Benchmark for Question answering) - a dataset that measures social bias by presenting ambiguous or disambiguated contexts followed by questions that may elicit stereotypical responses. Due to the nature of speech modality, social bias in Spoken Language Models (SLMs) can emerge from two distinct sources: 1) content aspect and 2) acoustic aspect. The dataset converts every BBQ context into controlled voice conditions, enabling per-axis accuracy, bias, and consistency scores that remain comparable to the original text benchmark. Using VoiceBBQ, we evaluate two SLMs—LLaMA-Omni and Qwen2-Audio—and observe architectural contrasts: LLaMA-Omni retains strong acoustic sensitivity, amplifying gender and accent bias, whereas Qwen2-Audio substantially dampens these cues while preserving content fidelity. VoiceBBQ thus provides a compact, drop-in testbed for jointly diagnosing content and acoustic bias across spoken language models.

Anthology ID:: 2025.emnlp-main.1461
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 28713–28724
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1461/
DOI:
Bibkey:
Cite (ACL):: Junhyuk Choi, Ro-hoon Oh, Jihwan Seol, and Bugeun Kim. 2025. VoiceBBQ: Investigating Effect of Content and Acoustics in Social Bias of Spoken Language Model. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 28713–28724, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: VoiceBBQ: Investigating Effect of Content and Acoustics in Social Bias of Spoken Language Model (Choi et al., EMNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1461.pdf
Checklist:: 2025.emnlp-main.1461.checklist.pdf

PDF Cite Search Checklist Fix data