Xiang Li
Peking
Other people with similar names: Xiang Li (East China Normal University), Xiang Li, Xiang Li, Xiang Li, Xiang Li (Qilu), Xiang Li, Xiang Li (East China Normal University), Xiang Li, Xiang Li (Peking), Xiang Li (Massachusetts), Xiang Li (Beijing University of Posts and Telecommunications), Xiang Li (North China Electric Power University), Xiang Li (Beihang), Xiang Lorraine Li
Unverified author pages with similar names: Xiang Li
2026
PairCoder: Pair Programming-Inspired Two-Agent Collaboration for Code Generation
Junhao Chen | Xiang Li | Yibin Xu | Yuehan Cui | Fangsheng Weng | Hao Zhao | Fei Ma | Qi Tian
Findings of the Association for Computational Linguistics: ACL 2026
Junhao Chen | Xiang Li | Yibin Xu | Yuehan Cui | Fangsheng Weng | Hao Zhao | Fei Ma | Qi Tian
Findings of the Association for Computational Linguistics: ACL 2026
Large Language Models (LLMs) achieve strong results on code generation, but single model inference remains brittle on tasks that require iterative refinement. Existing multi agent frameworks improve reliability, yet they often incur substantial token and latency overhead. We introduce PairCoder, a framework that brings pair programming to autonomous LLM collaboration. PairCoder assigns one model to code generation and the other to review, and switches roles when repeated errors suggest that the current interaction has stalled. Across 13 LLMs on HumanEval, PairCoder consistently improves over single model inference. On eight representative backbones, it reaches 91.0% pass@1 and improves over single model inference by up to 20.3% while reducing token usage by 40% to 70% relative to multi agent baselines. Many heterogeneous pairings also outperform both constituent models, suggesting that the framework generalizes across model families. These results position PairCoder as an effective and deployment conscious alternative to heavier multi agent systems.Code is available at https://github.com/yisuanwang/PairCoder
2025
LLMsPark: A Benchmark for Evaluating Large Language Models in Strategic Gaming Contexts
Junhao Chen | Jingbo Sun | Xiang Li | Haidong Xin | Yuhao Xue | Yibin Xu | Hao Zhao
Findings of the Association for Computational Linguistics: EMNLP 2025
Junhao Chen | Jingbo Sun | Xiang Li | Haidong Xin | Yuhao Xue | Yibin Xu | Hao Zhao
Findings of the Association for Computational Linguistics: EMNLP 2025
As large language models (LLMs) advance across diverse tasks, the need for comprehensive evaluation beyond single metrics becomes increasingly important.To fully assess LLM intelligence, it is crucial to examine their interactive dynamics and strategic behaviors.We present LLMsPark, a game theory–based evaluation platform that measures LLMs’ decision-making strategies and social behaviors in classic game-theoretic settings, providing a multi-agent environment to explore strategic depth.Our system cross-evaluates 15 leading LLMs (both commercial and open-source) using leaderboard rankings and scoring mechanisms. Higher scores reflect stronger reasoning and strategic capabilities, revealing distinct behavioral patterns and performance differences across models.This work introduces a novel perspective for evaluating LLMs’ strategic intelligence, enriching existing benchmarks and broadening their assessment in interactive, game-theoretic scenarios.The benchmark and rankings are publicly available at https://llmsparks.github.io/.