Aligning Backchannel and Dialogue Context Representations via Contrastive LLM Fine-Tuning

Livia Qian; Gabriel Skantze

Aligning Backchannel and Dialogue Context Representations via Contrastive LLM Fine-Tuning

Abstract

Backchannels (e.g., ‘yeah’, ‘mhm’, and ‘right’) are short, non-interruptive feedback signals whose lexical form and prosody jointly convey pragmatic meaning. While prior computational research has largely focused on predicting backchannel timing, the relationship between lexico-prosodic form and meaning remains underexplored. We propose a two-stage framework: first, fine-tuning large language models on dialogue transcripts to derive rich contextual representations; and second, learning a joint embedding space for dialogue contexts and backchannel realizations. We evaluate alignment with human perception via triadic similarity judgments (prosodic and cross-lexical) and a context–backchannel suitability task. Our results demonstrate that the learned projections substantially improve context-backchannel retrieval compared to previous methods. In addition, they reveal that backchannel form is highly sensitive to extended conversational context and that the learned embeddings align more closely with human judgments than raw WavLM features.

Anthology ID:: 2026.acl-long.629
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 13817–13833
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.629/
DOI:
Bibkey:
Cite (ACL):: Livia Qian and Gabriel Skantze. 2026. Aligning Backchannel and Dialogue Context Representations via Contrastive LLM Fine-Tuning. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 13817–13833, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Aligning Backchannel and Dialogue Context Representations via Contrastive LLM Fine-Tuning (Qian & Skantze, ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.629.pdf
Checklist:: 2026.acl-long.629.checklist.pdf

PDF Cite Search Checklist Fix data