Jianshe Li

2026

How to Train a Real-World Silicon Concierge? Internalizing Complex Business Workflow to Only OneModel
Yongqi Tong | Xiaoyun Feng | Lyuxin Xue | Jianshe Li | Xin Zhang | Jiang-Ming Yang
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)

Traditional industrial agents rely on modular pipelines, including Router, Retriever, Planner, Executor, Responder, Reviewer and so on, which inevitably fracture into a labyrinth of ad-hoc patches, leading to cascading errors and high latency. We propose OneModel, an applicable paradigm shift from external workflows to internalized knowledge representation. Unlike modular systems that slice fluid user intents into static steps, OneModel consolidates complex business logic and SOPs directly into the model’s parameters.Through Continual Pre-training (CPT) and logic-compilation SFT, we transform fragmented business rules into the model’s intuitive reasoning within a unified attention space. Deployed in our global financial service system, OneModel effectively breaks the impossible triangle of latency, accuracy, and complexity. Online A/B testing demonstrates end-to-end latency reduction of more than 50% (18.7s → 8s) while the Intelligent Resolution Rate (IRR) jumps from 64.3% to 83.3%. The results demonstrate our paradigm OneModel effectively replaces brittle engineering logic with internalized cognitive intuition, offering a scalable and future-proof blueprint for transitioning industrial agents from complex, error-prone workflow to unified model architectures.

Co-authors

Venues

ACL1

Fix author