On-Device LLMs for Home Assistant: Dual Role in Intent Detection and Response Generation

Rune Birkmose; Nathan Mørkeberg Reece; Esben Hofstedt Norvin; Johannes Bjerva; Mike Zhang

On-Device LLMs for Home Assistant: Dual Role in Intent Detection and Response Generation

Rune Birkmose, Nathan Mørkeberg Reece, Esben Hofstedt Norvin, Johannes Bjerva, Mike Zhang

Abstract

This paper investigates whether Large Language Models (LLMs), fine-tuned on synthetic but domain-representative data, can perform the twofold task of (i) slot and intent detection and (ii) natural language response generation for a smart home assistant, while running solely on resource-limited, CPU-only edge hardware. We fine-tune LLMs to produce both JSON action calls and text responses. Our experiments show that 16-bit and 8-bit quantized variants preserve high accuracy on slot and intent detection and maintain strong semantic coherence in generated text, while the 4-bit model, while retaining generative fluency, suffers a noticeable drop in device-service classification accuracy. Further evaluations on noisy human (non-synthetic) prompts and out-of-domain intents confirm the models’ generalization ability, obtaining around 80–86% accuracy. While the average inference time is 5–6 seconds per query—acceptable for one-shot commands but suboptimal for multi-turn dialogue—our results affirm that an on-device LLM can effectively unify command interpretation and flexible response generation for home automation without relying on specialized hardware.

Anthology ID:: 2025.wnut-1.7
Volume:: Proceedings of the Tenth Workshop on Noisy and User-generated Text
Month:: May
Year:: 2025
Address:: Albuquerque, New Mexico, USA
Editors:: JinYeong Bak, Rob van der Goot, Hyeju Jang, Weerayut Buaphet, Alan Ramponi, Wei Xu, Alan Ritter
Venues:: WNUT | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 57–67
Language:
URL:: https://preview.aclanthology.org/landing_page/2025.wnut-1.7/
DOI:
Bibkey:
Cite (ACL):: Rune Birkmose, Nathan Mørkeberg Reece, Esben Hofstedt Norvin, Johannes Bjerva, and Mike Zhang. 2025. On-Device LLMs for Home Assistant: Dual Role in Intent Detection and Response Generation. In Proceedings of the Tenth Workshop on Noisy and User-generated Text, pages 57–67, Albuquerque, New Mexico, USA. Association for Computational Linguistics.
Cite (Informal):: On-Device LLMs for Home Assistant: Dual Role in Intent Detection and Response Generation (Birkmose et al., WNUT 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/landing_page/2025.wnut-1.7.pdf

PDF Cite Search Fix data