Memory-Efficient Structured Backpropagation for On-Device LLM Fine-Tuning

JuneYoung Park, Yuri Hong, Seongwan Kim, Jaeho Lee


Abstract
On-device fine-tuning enables privacy-preserving personalization of large language models, but mobile devices impose severe memory constraints, typically 6–12GB shared across all workloads. Existing approaches force a trade-off between exact gradients with high memory (MeBP) and low memory with noisy estimates (MeZO). We propose Memory-efficient Structured Backpropagation (MeSP), which bridges this gap by manually deriving backward passes that exploit LoRA’s low-rank structure. Our key insight is that the intermediate projection h = xA can be recomputed during backward at minimal cost since rank r ≪ din, eliminating the need to store it. MeSP achieves 49% average memory reduction compared to MeBP on Qwen2.5 models (0.5B–3B) while computing mathematically identical gradients. Our analysis also reveals that MeZO’s gradient estimates show near-zero correlation with true gradients (cosine similarity 0.001), explaining its slow convergence. MeSP reduces peak memory from 361MB to 136MB for Qwen2.5-0.5B, enabling fine-tuning scenarios previously infeasible on memory-constrained devices.
Anthology ID:
2026.acl-industry.62
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Yunyao Li, Georg Rehm, Mei Tu
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
906–916
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-industry.62/
DOI:
Bibkey:
Cite (ACL):
JuneYoung Park, Yuri Hong, Seongwan Kim, and Jaeho Lee. 2026. Memory-Efficient Structured Backpropagation for On-Device LLM Fine-Tuning. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), pages 906–916, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
Memory-Efficient Structured Backpropagation for On-Device LLM Fine-Tuning (Park et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-industry.62.pdf