ReAlign: Structured Revision for Small Language Model Alignment

Ruijun Chen; Jiajian Guo; Hongzhan Chen; Fanqi Wan; Qifan Wang; Xiaojun Quan

doi:10.18653/v1/2025.findings-emnlp.642

ReAlign: Structured Revision for Small Language Model Alignment

Ruijun Chen, Jiajian Guo, Hongzhan Chen, Fanqi Wan, Qifan Wang, Xiaojun Quan

Abstract

Aligning small language models with human preferences is challenging, as weak policies struggle to generate informative on-policy samples and suffer from unstable gradients when trained on off-policy signals from stronger models. In this work, we propose ReAlign, a training framework that combines the stability of on-policy learning with the guidance of reviser-assisted supervision. In the ReAlign, we first train a lightweight reviser to improve policy-generated responses using preference-based supervision, conditioned on both the prompt and the initial output. And then, the policy is optimized using a combination of standard on-policy preference pairs and reviser-enhanced pairs constructed as a structured revision task, where the latter provide richer, more learnable feedback. Experimental results on AlpacaEval-2 and Arena-Hard demonstrate that ReAlign significantly boosts alignment performance for weak policies, outperforming strong preference optimization baselines.

Anthology ID:: 2025.findings-emnlp.642
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2025
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 12005–12020
Language:
URL:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.642/
DOI:: 10.18653/v1/2025.findings-emnlp.642
Bibkey:
Cite (ACL):: Ruijun Chen, Jiajian Guo, Hongzhan Chen, Fanqi Wan, Qifan Wang, and Xiaojun Quan. 2025. ReAlign: Structured Revision for Small Language Model Alignment. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 12005–12020, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: ReAlign: Structured Revision for Small Language Model Alignment (Chen et al., Findings 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.642.pdf
Checklist:: 2025.findings-emnlp.642.checklist.pdf

PDF Cite Search Checklist Fix data