F-Actor: Controllable Conversational Behavior in Full-Duplex Models
Maike Z\"ufle, Ondrej Klejch, Nicholas Sanders, Jan Niehues, Alexandra Birch, Tsz Kin Lam
Abstract
Spoken conversational systems require more than accurate speech generation to have human-like conversations: to feel natural and engaging, they must produce conversational behaviour that adapts dynamically to the context. Current spoken conversational systems, however, rarely allow such customization, limiting their naturalness and usability. In this work, we present the first open, instruction-following full-duplex conversational speech model that can be trained efficiently under typical academic resource constraints. By keeping the audio encoder frozen and finetuning only the language model, our model requires just 2,000 hours of data, without relying on large-scale pretraining or multi-stage optimization. The model can follow explicit instructions to control speaker voice, conversation topic, conversational behaviour (e.g., backchanneling and interruptions), and dialogue initiation. We propose a single-stage training protocol and systematically analyze design choices. Both the model and training code is released to enable reproducible research on controllable full-duplex speech systems.- Anthology ID:
- 2026.findings-acl.242
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2026
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 4904–4921
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.242/
- DOI:
- Cite (ACL):
- Maike Z\"ufle, Ondrej Klejch, Nicholas Sanders, Jan Niehues, Alexandra Birch, and Tsz Kin Lam. 2026. F-Actor: Controllable Conversational Behavior in Full-Duplex Models. In Findings of the Association for Computational Linguistics: ACL 2026, pages 4904–4921, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- F-Actor: Controllable Conversational Behavior in Full-Duplex Models (Z"ufle et al., Findings 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.242.pdf