Dayu Yang
2025
DocAgent: A Multi-Agent System for Automated Code Documentation Generation
Dayu Yang
|
Antoine Simoulin
|
Xin Qian
|
Xiaoyi Liu
|
Yuwei Cao
|
Zhaopu Teng
|
Grey Yang
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)
High-quality code documentation is crucial for software development especially in the era of AI. However, generating it automatically using Large Language Models (LLMs) remains challenging, as existing approaches often produce incomplete, unhelpful, or factually incorrect outputs. We introduce DocAgent, a novel multi-agent collaborative system using topological code processing for incremental context building. Specialized agents (Reader, Searcher, Writer, Verifier, Orchestrator) then collaboratively generate documentation. We also propose a multi-faceted evaluation framework assessing Completeness, Helpfulness, and Truthfulness. Comprehensive experiments show DocAgent significantly outperforms baselines consistently. Our ablation study confirms the vital role of the topological processing order. DocAgent offers a robust approach for reliable code documentation generation in complex and proprietary repositories.
Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs
Dayu Yang
|
Tianyang Liu
|
Daoan Zhang
|
Antoine Simoulin
|
Xiaoyi Liu
|
Yuwei Cao
|
Zhaopu Teng
|
Xin Qian
|
Grey Yang
|
Jiebo Luo
|
Julian McAuley
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Code and reasoning recently exhibit a mutually reinforcing relationship in large language models (LLMs): Code is abstract, modular, highly structured and has strong logic, guiding reasoning in training and inference. While reasoning translates high-level goals into small executable steps, enable more sophisticated code intellignece, solving real-world challenging software development problems. In this study, we examine how code serves as a structured medium for enhancing reasoning - providing verifiable execution paths, enforcing logical decomposition, and enabling runtime validation, and how advances in reasoning have transformed code intelligence from basic completion to sophisticated agent - enabling models to tackle complex software engineering tasks through deliberate planning and systematic debugging. Finally, we identify key challenges and propose future research directions may deepen the synergy, ultimately advancing LLM performance in both complex reasoning and code intelligence.
Search
Fix author
Co-authors
- Yuwei Cao 2
- Xiaoyi Liu 2
- Xin Qian 2
- Antoine Simoulin 2
- Zhaopu Teng 2
- show all...