Ro-SLM: Onboard Small Language Models for Robot Task Planning and Operation Code Generation

Wenhao Wang, Yanyan Li, Long Jiao, Jiawei Yuan


Abstract
Recent advances in large language models (LLMs) provide robots with contextual reasoning abilities to comprehend human instructions. Yet, current LLM-enabled robots typically depend on cloud-based models or high-performance computing infrastructure, which limit their deployment on robots under unreliable internet environments or with constrained computational resources, such as UAVs and small ground vehicles. Thus, deploying fine-tuned small language models (SLMs) that support onboard deployment offers a promising alternative. This paper introduces Ro-SLM, a framework that enables reliable SLM-driven robot operation by distilling LLMs’ knowledge and reasoning. Ro-SLM starts from dataset synthesis by leveraging LLMs to generate diverse task instructions, produce corresponding ground truth code with minimal human assistance, and augment instructions into real-world application scenarios. Ro-SLM is then fine-tuned with the dataset, in which LLM serves as a reward function to guide the training. Extensive experiments on UAV operation tasks demonstrate that Ro-SLM improves the performance of SLM from being incapable of supporting robotic task planning and code generation to achieving performance that approaches LLM.
Anthology ID:
2026.findings-acl.1720
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
34436–34460
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1720/
DOI:
Bibkey:
Cite (ACL):
Wenhao Wang, Yanyan Li, Long Jiao, and Jiawei Yuan. 2026. Ro-SLM: Onboard Small Language Models for Robot Task Planning and Operation Code Generation. In Findings of the Association for Computational Linguistics: ACL 2026, pages 34436–34460, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Ro-SLM: Onboard Small Language Models for Robot Task Planning and Operation Code Generation (Wang et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1720.pdf
Checklist:
 2026.findings-acl.1720.checklist.pdf