LoopCoder: Scaling Code Intelligence via Looped Language Models

Jian Yang; Wei Zhang; Shuyue Guo; Yizhi Li; Linzheng Chai; Zhengmao Ye; Shukai Liu; Yuyang Song; Jiajun Wu; Che Liu; Tianyu Zheng; Siwei Wu; Leo L; Xudong Ma; Chuan Hao; Ran Tao; Yan Xing; Jianzhou Wang; Mingjie Tang; Aishan Liu; Zhoujun Li; Xianglong Liu; Weifeng Lv; Bryan Dai

LoopCoder: Scaling Code Intelligence via Looped Language Models

Jian Yang, Wei Zhang, Shuyue Guo, Yizhi LI, Linzheng Chai, Zhengmao Ye, Shukai Liu, Yuyang Song, Jiajun Wu, Che Liu, Tianyu Zheng, Siwei Wu, Leo L, Xudong Ma, Chuan Hao, Ran Tao, Yan Xing, Jianzhou Wang, Mingjie Tang, Aishan Liu, Zhoujun Li, Xianglong Liu, Weifeng Lv, Bryan Dai

Abstract

While large language models (LLMs) have mastered syntax-level code generation, complex algorithmic reasoning remains a challenge, typically addressed by scaling model depth and parameter count. Universal Transformers (UT) offer a compelling alternative by introducing a recurrent inductive bias that aligns with the recursive nature of programming logic. However, training looped architectures at scale has historically been hindered by severe instability and optimization difficulties associated with backpropagation through time (BPTT). We present LoopCoder (40B-A80B) pre-trained on 12T+ code and general tokens, along with LoopCoder-Thinking and LoopCoder-Instruct variants—the first large-scale looped transformer for code, achieving comparable performance to standard dense architectures with more parameters. Unlike prior approaches that restrict recurrence to small-scale tasks, we implement a comprehensive looped training protocol spanning both pre-training and post-training phases. We initiate the model via dense-to-loop transformation, folding a pre-trained dense checkpoint to initialize a recurrent block, followed by rigorous looped pre-training and specialized post-training for instruction following and reasoning. Our results establish a robust recipe for scaling coding intelligence via recurrent computation, proving that dense checkpoints serve as an optimal foundation for evolving into dynamic, looped reasoners.

Anthology ID:: 2026.findings-acl.796
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 16209–16223
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.796/
DOI:
Bibkey:
Cite (ACL):: Jian Yang, Wei Zhang, Shuyue Guo, Yizhi LI, Linzheng Chai, Zhengmao Ye, Shukai Liu, Yuyang Song, Jiajun Wu, Che Liu, Tianyu Zheng, Siwei Wu, Leo L, Xudong Ma, Chuan Hao, Ran Tao, Yan Xing, Jianzhou Wang, Mingjie Tang, Aishan Liu, Zhoujun Li, Xianglong Liu, Weifeng Lv, and Bryan Dai. 2026. LoopCoder: Scaling Code Intelligence via Looped Language Models. In Findings of the Association for Computational Linguistics: ACL 2026, pages 16209–16223, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: LoopCoder: Scaling Code Intelligence via Looped Language Models (Yang et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.796.pdf
Checklist:: 2026.findings-acl.796.checklist.pdf

PDF Cite Search Checklist Fix data