Select Before Use: On the Importance of Reference Model Selection in Preference Alignment

Muyang Li; Runze Wu; Xiangyu Zhao; Bo Han; Daoyi Dong; Tongliang Liu

Select Before Use: On the Importance of Reference Model Selection in Preference Alignment

Muyang Li, Runze Wu, Xiangyu Zhao, Bo Han, Daoyi Dong, Tongliang Liu

Abstract

The post-training stage of Large Language Models (LLMs) typically involves Supervised Fine-Tuning (SFT) followed by preference alignment to ensure LLM to generate safe, helpful, and instruction-aligned content. The SFT model critically serves as both the initialization and reference model for subsequent preference alignment. However, an essential yet often neglected question is the optimal selection of the SFT checkpoint for this role. We show that checkpoint selection substantially affects final performance, and that the common practice of choosing the minimum validation-loss checkpoint often fails, due to a fundamental conflict between SFT’s focus on imitation and alignment’s goal of response discriminability. To this end, we propose RewardRank, a simple, effective, training-free metrics for estimating initial implicit alignment between reference model and preference objective. Empirical evidence suggests that, using our selected model as reference can gain up to 67.6% relative increase on length-controlled win rate on the popular Zephyr recipe comparing to baselines.

Anthology ID:: 2026.acl-long.780
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 17151–17171
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.780/
DOI:
Bibkey:
Cite (ACL):: Muyang Li, Runze Wu, Xiangyu Zhao, Bo Han, Daoyi Dong, and Tongliang Liu. 2026. Select Before Use: On the Importance of Reference Model Selection in Preference Alignment. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 17151–17171, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Select Before Use: On the Importance of Reference Model Selection in Preference Alignment (Li et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.780.pdf
Checklist:: 2026.acl-long.780.checklist.pdf

PDF Cite Search Checklist Fix data