ProtoCycle: Reflective Tool-Augmented Planning for Text-Guided Protein Design
Yutang Ge, Guojiang Zhao, Sihang Li, Zheng Cheng, Zifeng Zhao, Hanchen Xia, Guolin Ke, Linfeng Zhang, Zhifeng Gao, Yu Guang Wang
Abstract
Designing proteins that satisfy natural language functional requirements is a central goal in protein engineering. A straightforward baseline is to fine-tune generic instruction-tuned LLMs as direct text-to-sequence generators, but this is data- and compute-hungry. With limited supervision, LLMs can produce coherent plans in text yet fail to reliably realize them as sequences. This plan–execute gap motivates ProtoCycle, an agentic framework for protein design that uses LLMs primarily to drive a multi-round, feedback-driven decision cycle. ProtoCycle couples an LLM planner with a lightweight tool environment designed to emulate the iterative workflow of human protein engineers and uses LLM-driven reflection on tool feedback to revise plans. Trained with supervised trajectories and online reinforcement learning, ProtoCycle achieves strong language alignment while maintaining competitive foldability, and ablations show that reflection substantially improves sequence quality.- Anthology ID:
- 2026.findings-acl.763
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2026
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 15562–15586
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.763/
- DOI:
- Cite (ACL):
- Yutang Ge, Guojiang Zhao, Sihang Li, Zheng Cheng, Zifeng Zhao, Hanchen Xia, Guolin Ke, Linfeng Zhang, Zhifeng Gao, and Yu Guang Wang. 2026. ProtoCycle: Reflective Tool-Augmented Planning for Text-Guided Protein Design. In Findings of the Association for Computational Linguistics: ACL 2026, pages 15562–15586, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- ProtoCycle: Reflective Tool-Augmented Planning for Text-Guided Protein Design (Ge et al., Findings 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.763.pdf