Haoxuan Li
Other people with similar names: Haoxuan Li, Haoxuan Li
Unverified author pages with similar names: Haoxuan Li
2026
SimPBL: A Multi-Agent Framework for Project-Based Learning
Daniel Zhang-Li | Joy Jia Yin Lim | Binglin Liu | Shangqing Tu | Zijun Yao | Hao Peng | Jifan Yu | Haoxuan Li | Zhanxin Hao | Ye He | Zekun Li | Jiangyi Wang | Lei Hou | Bin Xu | Xin Cong | Zhiyuan Liu | Huiqin Liu | Yu Zhang | Juanzi Li
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Daniel Zhang-Li | Joy Jia Yin Lim | Binglin Liu | Shangqing Tu | Zijun Yao | Hao Peng | Jifan Yu | Haoxuan Li | Zhanxin Hao | Ye He | Zekun Li | Jiangyi Wang | Lei Hou | Bin Xu | Xin Cong | Zhiyuan Liu | Huiqin Liu | Yu Zhang | Juanzi Li
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Project-Based Learning (PBL) is an important learning method that promotes understanding and acquiring practical skills through training learners through a project. However, effective PBL often requires sustained orchestration and collaboration, but existing LLM-based learning tools provide partial assistance without explicitly modeling these roles, and overly comprehensive help provided by LLM can reduce learner autonomy. We propose SimPBL, a multi-agent framework with an orchestrator agent that provides adaptive scaffolding from interaction logs and collaborator agents that support project work through boundary-aware collaboration. We conduct comprehensive evaluation to study the effectiveness of SimPBL, where we observe a 14% improvement in learner examination score. Results from extensive studies further highlights the ability of SimPBL to manage learning behavior and improve learning experience. Code and materials are available at https://anonymous.4open.science/r/SimPBL-D5B8.
From Knowing to Teaching: Scaffolding Pedagogical Decisions for LLM Agent
Yucheng Wang | Shen Yang | Jifan Yu | Haoxuan Li | Joy Jia Yin Lim | Daniel Zhang-Li | Huiqin Liu | Lei Hou | Juanzi Li | Bin Xu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Yucheng Wang | Shen Yang | Jifan Yu | Haoxuan Li | Joy Jia Yin Lim | Daniel Zhang-Li | Huiqin Liu | Lei Hou | Juanzi Li | Bin Xu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Knowing and teaching differ fundamentally: effective instruction requires transforming knowledge into forms learners can grasp. Large language models, when asked to generate lessons (a concrete form of teaching), produce content lacking pedagogical depth. We trace this failure to three decisions that expert teachers make: selecting content by recognizing each source’s instructional role, sequencing topics so foundations precede applications, and synthesizing components into a unified whole. To scaffold these decisions, we introduce TeachCraft, a framework with three agents: Explorer classifies sources by pedagogical intent to guide selection; Planner orders objectives from foundational to advanced; Generator produces lesson materials through a schema that ensures consistency across components. To evaluate this approach, we construct LessonBench, 40 expert-designed lessons paired with two to five heterogeneous source documents, on which TeachCraft achieves 67.8% win rate in human evaluation and 79.6% in LLM-based evaluation against eight baselines, with ablations confirming that each decision contributes independently to overall lesson quality.[Source code is available at <https://anonymous.4open.science/r/TeachCraft-1672>]
Beyond Self-Report: Bridging the Intention-Behavior Gap in Critical Thinking Assessment via Interpretable Multi-Agent System
Zekun Li | Jifan Yu | Haoxuan Li | Ye He | Daniel Zhang-Li | Shangqing Tu | Joy Jia Yin Lim | Yikun Jiang | Jiaxin Yuan | Yu Zhang
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Zekun Li | Jifan Yu | Haoxuan Li | Ye He | Daniel Zhang-Li | Shangqing Tu | Joy Jia Yin Lim | Yikun Jiang | Jiaxin Yuan | Yu Zhang
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Accurate assessment of critical thinking is historically limited by the Intention Behavior Gap in psychology: the disconnect between what individuals self-reported disposition and their actual practical behaviors. We try to bridge this gap with MASA (Multi-Agent Scenario-based Assessment), a framework that operationalizes cognitive assessment into an interpretable and interactive multi-agent workflow with Assessment Chain-of-Thought (AsCoT). Validating on both large-scale simulations (N=1,161) and human participants (N=70), we find that MASA aligns better with human expert ratings (r=0.882) than traditional gold-standard inventories (r=0.720), with an average cost of only 0.41 per participant. These results suggest that by shifting from self-report inventory to behavior-grounded dialogue, MASA offers a more accurate, cost-effective, and transparent solution for real-world cognitive evaluation.
Graph of Trace: Visualizing Execution Traces of Scientific Agents
Tianci Gao | Haoxuan Li | Jian He Li | Tianxiang Zhao | Shi Runze | Weiran Wang | Zezhao Wu | Lu Mi
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)
Tianci Gao | Haoxuan Li | Jian He Li | Tianxiang Zhao | Shi Runze | Weiran Wang | Zezhao Wu | Lu Mi
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)
Scientific AI agents can autonomously carry out complex research workflows, yet these unfolded workflows often remains difficult for humans to inspect and review, limiting interpretable, controllable and effective human–AI collaboration. To address this challenge, we present a monitoring and visualization framework that records fine-grained execution events and organizes them into a directed graph that make agent workflows explicit as they proceed. The system records intermediate steps (e.g. tool calls and code executions), and renders them as real-time updated visual traces that expose workflow structure. This allows users to examine how results are produced, identify where failures emerge, and better understand agent behavior across different stages of the research process.We conduct an evaluation on complex research tasks with domain experts of interdisciplinary background in AI, neuroscience and biology. Experts report that structured traces visualization improves understanding of agent workflows, perceived interpretability, and usability for analysis and further interaction.