PAM: Enhancing General Alignment of Large Reasoning Models through Priority-Aware Metacognition

Zhihao Xu; Fuzhen Yang; Liang Lin; Xiting Wang

PAM: Enhancing General Alignment of Large Reasoning Models through Priority-Aware Metacognition

Zhihao Xu, Fuzhen Yang, Liang Lin, Xiting Wang

Abstract

Recent advancements in Large Reasoning Models (LRMs) have showcased strong performance across various reasoning tasks by leveraging System-2 thinking capabilities. However, existing studies indicate that this reasoning ability alone does not reliably transfer to the general alignment domain. Inspired by cognitive science and how humans solve tasks, we argue that LRMs must be equipped with metacognitive knowledge to fully utilize their System-2 capabilities. In this paper, we propose Priority-Aware Metacognition (PAM), which guides the model to first identify the top-level human preference (e.g., harmlessness) as a means of understanding the alignment task’s nature, and then apply other kinds of metacognitive knowledge to better monitor and regulate the model’s thinking process. We implement PAM via a two-stage pipeline: a cold-start phase that collects structured metacognitive knowledge based on Flavell’s theoretical framework, and a preference-optimization phase that further reinforces such metacognition. Extensive experiments validate the effectiveness of PAM. Under the same training pipelines, PAM consistently yields higher performance, improving general domain alignment performance by ~10 points on the helpfulness and harmless benchmarks. Code is available at https://anonymous.4open.science/r/PAM-RM-02DF.

Anthology ID:: 2026.acl-long.432
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 9554–9573
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.432/
DOI:
Bibkey:
Cite (ACL):: Zhihao Xu, Fuzhen Yang, Liang Lin, and Xiting Wang. 2026. PAM: Enhancing General Alignment of Large Reasoning Models through Priority-Aware Metacognition. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 9554–9573, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: PAM: Enhancing General Alignment of Large Reasoning Models through Priority-Aware Metacognition (Xu et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.432.pdf
Checklist:: 2026.acl-long.432.checklist.pdf

PDF Cite Search Checklist Fix data