F-Actor: Controllable Conversational Behavior in Full-Duplex Models

Maike Z\"ufle, Ondrej Klejch, Nicholas Sanders, Jan Niehues, Alexandra Birch, Tsz Kin Lam


Abstract
Spoken conversational systems require more than accurate speech generation to have human-like conversations: to feel natural and engaging, they must produce conversational behaviour that adapts dynamically to the context. Current spoken conversational systems, however, rarely allow such customization, limiting their naturalness and usability. In this work, we present the first open, instruction-following full-duplex conversational speech model that can be trained efficiently under typical academic resource constraints. By keeping the audio encoder frozen and finetuning only the language model, our model requires just 2,000 hours of data, without relying on large-scale pretraining or multi-stage optimization. The model can follow explicit instructions to control speaker voice, conversation topic, conversational behaviour (e.g., backchanneling and interruptions), and dialogue initiation. We propose a single-stage training protocol and systematically analyze design choices. Both the model and training code is released to enable reproducible research on controllable full-duplex speech systems.
Anthology ID:
2026.findings-acl.242
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
4904–4921
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.242/
DOI:
Bibkey:
Cite (ACL):
Maike Z\"ufle, Ondrej Klejch, Nicholas Sanders, Jan Niehues, Alexandra Birch, and Tsz Kin Lam. 2026. F-Actor: Controllable Conversational Behavior in Full-Duplex Models. In Findings of the Association for Computational Linguistics: ACL 2026, pages 4904–4921, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
F-Actor: Controllable Conversational Behavior in Full-Duplex Models (Z"ufle et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.242.pdf
Checklist:
 2026.findings-acl.242.checklist.pdf