Ziyu Shen
2025
Governance in Motion: Co-evolution of Constitutions and AI models for Scalable Safety
Chenhao Huang
|
Ziyu Shen
|
Yicong Ren
|
Huiyuan Zheng
|
Jiazheng Zhang
|
Mingxu Chai
|
Ming Zhang
|
Shihan Dou
|
Fan Mo
|
Jie Shi
|
Tao Gui
|
Qi Zhang
|
Xuanjing Huang
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Aligning large language models (LLMs) with human preferences is a central challenge for building reliable AI systems. Most existing alignment approaches rely on static signals, such as predefined principles or offline human annotations to guide model behavior toward a fixed approximation of human preferences. However, LLMs can exhibit distributional drift during training, and static alignment mechanisms lack the capacity to adaptively correct misaligned behaviors as they emerge. To address this limitation, we develop a two-stage framework that enables dynamic and continuous alignment. In the first stage, a constitution is continually revised based on observed model behaviors, and models are trained to comply with these evolving principles. In the second stage, this learned constitution is used to guide reinforcement learning, encouraging the model to align with the updated normative signals. We refer to this framework as COCOA: Co-evolution of Constitutions and AI Models. We show that COCOA enables a 7B model to greatly improve safety—raising StrongReject score from 0.741 to 0.935 and Safe-RLHF accuracy from 77.76% to 90.64% without human annotations, reaching performance close to much larger state-of-the-art models.
DocFusion: A Unified Framework for Document Parsing Tasks
Mingxu Chai
|
Ziyu Shen
|
Chong Zhang
|
Yue Zhang
|
Xiao Wang
|
Shihan Dou
|
Jihua Kang
|
Jiazheng Zhang
|
Qi Zhang
Findings of the Association for Computational Linguistics: ACL 2025
Document parsing involves layout element detection and recognition, essential for extracting information. However, existing methods often employ multiple models for these tasks, leading to increased system complexity and maintenance overhead. While some models attempt to unify detection and recognition, they often fail to address the intrinsic differences in data representations, thereby limiting performance in document processing. Our research reveals that recognition relies on discrete tokens, whereas detection relies on continuous coordinates, leading to challenges in gradient updates and optimization. To bridge this gap, we propose the Gaussian-Kernel Cross-Entropy Loss (GK-CEL), enabling generative frameworks to handle both tasks simultaneously. Building upon GK-CEL, we propose DocFusion, a unified document parsing model with only 0.28B parameters. Additionally, we construct the DocLatex-1.6M dataset to provide high-quality training support. Experimental results show that DocFusion, equipped with GK-CEL, performs competitively across four core document parsing tasks, validating the effectiveness of our unified approach.
Search
Fix author
Co-authors
- Mingxu Chai 2
- Shihan Dou 2
- Jiazheng Zhang 2
- Qi Zhang (张琦) 2
- Tao Gui 1
- show all...