Zhenjia Li
2026
HyperAdaLoRA: Accelerating LoRA Rank Allocation During Training via Hypernetworks without Sacrificing Performance
Hao Zhang | Zhenjia Li | Yifan Gao | Xi Xiao | Heng Zhang | Shuyang Zhang | Xiaoxincc | Bo Huang | Yuhang Wu | Tianyang Wang | Hao Xu
Findings of the Association for Computational Linguistics: ACL 2026
Hao Zhang | Zhenjia Li | Yifan Gao | Xi Xiao | Heng Zhang | Shuyang Zhang | Xiaoxincc | Bo Huang | Yuhang Wu | Tianyang Wang | Hao Xu
Findings of the Association for Computational Linguistics: ACL 2026
Parameter-Efficient Fine-Tuning (PEFT), especially Low-Rank Adaptation (LoRA), has emerged as a promising approach to fine-tuning large language models(LLMs) while reducing computational and memory overhead. However, LoRA assumes a uniform rank r for each incremental matrix, not accounting for the varying significance of weight matrices across different modules and layers. AdaLoRA leverages Singular Value Decomposition (SVD) to parameterize updates and employs pruning of singular values to introduce dynamic rank allocation, thereby enhancing adaptability. However, during the training process, it often encounters issues of slow convergence speed and high computational overhead. To address this issue, we propose HyperAdaLoRA, a novel framework that accelerates the convergence of AdaLoRA by leveraging a hypernetwork. Instead of directly optimizing the components of Singular Value Decomposition (P, 𝛬, Q), HyperAdaLoRA employs a hypernetwork based on attention mechanisms to dynamically generate these parameters. By pruning the outputs of the hypernetwork that generates the singular values, dynamic rank allocation is achieved. Comprehensive experiments on various datasets and models demonstrate that our method achieves faster convergence without sacrificing performance. Moreover, our method generalizes well to other LoRA-based approaches, highlighting its strong generalization capability.
2025
Sensitivity-LoRA : Low-Load Sensitivity-Based Fine-Tuning for Large Language Models
Hao Zhang | Bo Huang | Zhenjia Li | Xi Xiao | Hui Yi Leong | Zumeng Zhang | Xinwei Long | Tianyang Wang | Hao Xu
Findings of the Association for Computational Linguistics: EMNLP 2025
Hao Zhang | Bo Huang | Zhenjia Li | Xi Xiao | Hui Yi Leong | Zumeng Zhang | Xinwei Long | Tianyang Wang | Hao Xu
Findings of the Association for Computational Linguistics: EMNLP 2025
Large Language Models (LLMs) have transformed both everyday life and scientific research. However, adapting LLMs from general-purpose models to specialized tasks remains challenging, particularly in resource-constrained environments. Low-Rank Adaptation (LoRA), a prominent method within Parameter-Efficient Fine-Tuning (PEFT), has emerged as a promising approach to LLMs by approximating model weight updates using low-rank decomposition. However, LoRA is limited by its uniform rank ( r ) allocation to each incremental matrix, and existing rank allocation techniques aimed at addressing this issue remain computationally inefficient, complex, and unstable, hindering practical applications. To address these limitations, we propose Sensitivity-LoRA, an efficient fine-tuning method that dynamically allocates ranks to weight matrices based on both their global and local sensitivities. It leverages the second-order derivatives (Hessian Matrix) of the loss function to effectively capture weight sensitivity, enabling optimal rank allocation with minimal computational overhead. Our experimental results have demonstrated robust effectiveness, efficiency and stability of Sensitivity-LoRA across diverse tasks and benchmarks.