R2IF: Aligning Reasoning with Decisions via Composite Rewards for Interpretable LLM Function Calling

Aijia Cheng, Kailong Wang, Ling Shi, Yongxin Zhao


Abstract
Function calling empowers large language models (LLMs) to interface with external tools, yet existing RL-based approaches suffer from misalignment between reasoning processes and tool-call decisions. We propose R2IF, a reasoning-aware RL framework for interpretable function calling, adopting a composite reward integrating format/correctness constraints, Chain-of-Thought Effectiveness Reward (CER), and Specification-Modification-Value (SMV) reward, optimized via GRPO. Experiments on BFCL/ACEBench show R2IF outperforms baselines by up to 34.62% (Llama3.2-3B on BFCL) with positive Average CoT Effectiveness (0.05 for Llama3.2-3B), enhancing both function-calling accuracy and interpretability for reliable tool-augmented LLM deployment.
Anthology ID:
2026.acl-long.1715
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
36995–37008
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.1715/
DOI:
Bibkey:
Cite (ACL):
Aijia Cheng, Kailong Wang, Ling Shi, and Yongxin Zhao. 2026. R2IF: Aligning Reasoning with Decisions via Composite Rewards for Interpretable LLM Function Calling. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 36995–37008, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
R2IF: Aligning Reasoning with Decisions via Composite Rewards for Interpretable LLM Function Calling (Cheng et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.1715.pdf
Checklist:
 2026.acl-long.1715.checklist.pdf