GMFL: Efficient Global Masking for Federated LLM Fine-tuning

Xin Huang, Yan Hu, Yue-Jiao Gong, Xinglin Zhang


Abstract
Low-Rank Adaptation (LoRA) has emerged as a prominent solution to mitigate the communication and computation costs in federated fine-tuning of Large Language Models (LLMs). However, we observe that even within low-rank adapters, a substantial portion of parameters manifest negligible updates during federated training, leading to redundant communication and wasted local computation. To address this, we propose GMFL, a plug-and-play layer freezing mechanism designed to seamlessly integrate with existing federated fine-tuning frameworks. Specifically, the server monitors the global update magnitude of each LoRA layer to dynamically generate freezing masks. These masks are updated periodically with a fixed freezing rate, ensuring stable convergence by robustly identifying “saturated” layers. Theoretical analysis confirms the convergence of GMFL, where the freezing mechanism yields a bounded error that scales with client heterogeneity. Extensive experiments across multiple tasks (GLUE, Commonsense Reasoning, Math Reasoning and General Generation) demonstrate that GMFL reduces communication overhead and lowers computational costs while preserving the performance of the underlying federated fine-tuning methods. Our work provides a practical, versatile solution for deploying large-scale federated LLM fine-tuning in resource-constrained environments. Our code is available at: https://github.com/tunx-cyber/GMFL.
Anthology ID:
2026.acl-long.1160
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
25293–25312
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.1160/
DOI:
Bibkey:
Cite (ACL):
Xin Huang, Yan Hu, Yue-Jiao Gong, and Xinglin Zhang. 2026. GMFL: Efficient Global Masking for Federated LLM Fine-tuning. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 25293–25312, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
GMFL: Efficient Global Masking for Federated LLM Fine-tuning (Huang et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.1160.pdf
Checklist:
 2026.acl-long.1160.checklist.pdf