Long Chen
Other people with similar names: Long Chen, Long Chen, Long Chen
Unverified author pages with similar names: Long Chen
2026
BalanceSFT: Improving LLM Function Calling with Balanced Training Signals and Data Hardness
Bingguang Hao | Zengzhuang Xu | Maolin Wang | Yuntao Wen | Yicheng Chen | Cunyin Peng | Long Chen | Xiangyu Zhao | Jinjie Gu | Chenyi Zhuang | Ji Zhang
Findings of the Association for Computational Linguistics: ACL 2026
Bingguang Hao | Zengzhuang Xu | Maolin Wang | Yuntao Wen | Yicheng Chen | Cunyin Peng | Long Chen | Xiangyu Zhao | Jinjie Gu | Chenyi Zhuang | Ji Zhang
Findings of the Association for Computational Linguistics: ACL 2026
While Supervised Fine-Tuning (SFT) is the prevailing method for equipping Large Language Models (LLMs) with function calling capabilities, its effectiveness is often compromised by two critical challenges: 1) **Imbalanced Training Signals**, where lengthy Chain-of-Thought (CoT) reasoning tokens dominate the training signals over concise function calls in the learning objective, and 2) **Imbalanced Data Hardness**, characterized by a scarcity of hard training examples. To overcome these limitations, we propose Balanced Supervised Fine-tuning (**BalanceSFT**), a novel framework that incorporates two key components: a Self-adjusted Signal Balancing (SSB) loss that employs a learnable hyperparameter to dynamically adjust the token contributions of CoT reasoning and function calls, together with a Hard Data Re-sampling (HDR) strategy that establishes a feedback loop to selectively generate new, high-quality complex data guided by model errors. Extensive experiments demonstrate the effectiveness of our proposed BalanceSFT framework. With BalanceSFT, a 7B model achieves function calling performance that surpasses state-of-the-art models like GPT-5. Our code, models, and dataset are open-sourced.