Zhichun Wang
Also published as: 志春 王
2026
CogBench: Benchmarking Cognitive Alignment of Large Language Models in Educational Question Answering
Tong Lu | Zhichun Wang | Yuanhao Sun | Yaoyu Zhou | Mingrui Li | Yiming Guan | Zhiyong Bai
Findings of the Association for Computational Linguistics: ACL 2026
Tong Lu | Zhichun Wang | Yuanhao Sun | Yaoyu Zhou | Mingrui Li | Yiming Guan | Zhiyong Bai
Findings of the Association for Computational Linguistics: ACL 2026
Large language models (LLMs) possess strong capabilities in language understanding and generation, as well as remarkable problem-solving abilities. In the educational domain, a representative application is to employ LLMs as learning assistants that answer students’ questions and support their learning processes. In such scenarios, it is crucial for the model to perceive a student’s cognitive level and provide explanations that are appropriate to that level. However, whether LLMs can effectively accomplish this task has not yet been thoroughly investigated. To address this gap, we introduce CogBench, an evaluation benchmark designed to assess the cognitive alignment capabilities of LLMs in educational QA. CogBench comprises 2.1K mathematics questions, each associated with multiple valid solutions that rely on knowledge and reasoning at different cognitive levels. Building on this structure, we formulate three cognition-aware evaluation tasks and propose three complementary metrics to quantify cognitive alignment from multiple perspectives. Extensive experiments on 11 representative LLMs reveal that, while models can often produce correct answers, they still struggle to consistently generate explanations that are aligned with the intended cognitive level. These results highlight substantial room for improvement and establish CogBench as a diagnostic benchmark for advancing cognitively aligned educational AI systems.
CheckRLM: Effective Knowledge–Thought Coherence Checking in Retrieval-Augmented Reasoning
Dingling Xu | Ruobing Wang | Qingfei Zhao | Yukun Yan | Zhichun Wang | Daren Zha | Shi Yu | Zhenghao Liu | Shuo Wang | Xu Han | Maosong Sun
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Dingling Xu | Ruobing Wang | Qingfei Zhao | Yukun Yan | Zhichun Wang | Daren Zha | Shi Yu | Zhenghao Liu | Shuo Wang | Xu Han | Maosong Sun
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Reasoning Language Models (RLMs) have significantly improved performance on complex tasks by extending the reasoning chain. However, these chains are prone to containing factual errors, particularly in knowledge-intensive tasks. To address this issue, we propose **CheckRLM**, a framework that improves the reliability of the reasoning process through Retrieval-Augmented Generation (RAG) by timely checking and correcting factual errors. Specifically, CheckRLM extracts factual claims from the reasoning chain to identify and localize subtle knowledge inconsistencies during inference. Upon detection of errors, a refinement mechanism performs minimal-cost yet precise corrections by leveraging external knowledge, ensuring coherence between the reasoning chain and correct knowledge. Extensive experiments demonstrate that CheckRLM substantially outperforms existing baselines, exhibiting a strong capability to mitigate error accumulation in long-horizon reasoning with lower costs. The code and data are available at https://github.com/AI9Stars/CheckRLM.
R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning
Qingfei Zhao | Ruobing Wang | Dingling Xu | Daren Zha | Ma Bowen | Zhichun Wang | Shijie Jia | Limin Liu | Xin Wang
Findings of the Association for Computational Linguistics: ACL 2026
Qingfei Zhao | Ruobing Wang | Dingling Xu | Daren Zha | Ma Bowen | Zhichun Wang | Shijie Jia | Limin Liu | Xin Wang
Findings of the Association for Computational Linguistics: ACL 2026
Large language models (LLMs) have notably progressed in multi-step and long-chain reasoning. However, extending their reasoning capabilities to encompass deep interactions with search remains a non-trivial challenge, as models often fail to identify optimal reasoning–search interaction trajectories, resulting in suboptimal responses. We propose R-Search, a novel reinforcement learning framework for Reasoning–Search integration, designed to enable LLMs to autonomously execute multi-step reasoning with deep search interaction, and learn optimal reasoning–search interaction trajectories via multi-reward signals, improving response quality in complex logic- and knowledge-intensive tasks. R-Search guides the LLM to dynamically decide when to search or reason, while globally integrating key evidence to enhance deep knowledge interaction between reasoning and search. During RL training, R-Search provides multi-type rewards to jointly optimize the reasoning–search trajectory. Experiments on seven datasets show that R-Search significantly outperforms mainstream RAG baselines.
EVGeoQA: Benchmarking LLMs on Dynamic, Multi-Objective Geo-Spatial Exploration
Jianfei Wu | Zhichun Wang | Zhensheng Wang | Zhiyu He
Findings of the Association for Computational Linguistics: ACL 2026
Jianfei Wu | Zhichun Wang | Zhensheng Wang | Zhiyu He
Findings of the Association for Computational Linguistics: ACL 2026
While Large Language Models (LLMs) demonstrate remarkable reasoning capabilities, their potential for purpose-driven exploration in dynamic geo-spatial environments remains under-investigated. Existing Geo-Spatial Question Answering (GSQA) benchmarks predominantly focus on static retrieval, failing to capture the complexity of real-world planning that involves dynamic user locations and compound constraints. To bridge this gap, we introduce EVGeoQA, a novel benchmark built upon Electric Vehicle (EV) charging scenarios that features a distinct location-anchored and dual-objective design. Specifically, each query in EVGeoQA is explicitly bound to a user’s real-time coordinate and integrates the dual objectives of a charging necessity and a co-located activity preference. To systematically assess models in such complex settings, we further propose GeoRover, a general evaluation framework based on a tool-augmented agent architecture to evaluate the LLMs’ capacity for dynamic, multi-objective exploration. Our experiments reveal that while LLMs successfully utilize tools to address sub-tasks, they struggle with long-range spatial exploration. Notably, we observe an emergent capability: LLMs can summarize historical exploration trajectories to enhance exploration efficiency. These findings establish EVGeoQA as a challenging testbed for future geo-spatial intelligence. The dataset and prompts are available at https://github.com/Hapluckyy/EVGeoQA/.
2025
DeMAC: Enhancing Multi-Agent Coordination with Dynamic DAG and Manager-Player Feedback
Yuhan Liu | Cong Xu | Lu Liu | Yihua Wang | Feiyu Chen | Qi Jia | Yaqian Zhao | Zhichun Wang | Xiang Li
Findings of the Association for Computational Linguistics: EMNLP 2025
Yuhan Liu | Cong Xu | Lu Liu | Yihua Wang | Feiyu Chen | Qi Jia | Yaqian Zhao | Zhichun Wang | Xiang Li
Findings of the Association for Computational Linguistics: EMNLP 2025
Multi-agent systems (MAS) powered by large language models (LLMs) have shown potential in tackling multifaceted problems through advanced understanding and reasoning. However, they struggle to adapt to evolving task dependencies and to handle uncertainties, such as shifting priorities or unpredictable disruptions. These constraints undermine their ability to dynamically adjust long-term strategies and inter-agent collaboration. To address these challenges, we propose DeMAC, a Dynamic Environment-Aware Manager-Player Agents Coordination framework that enhances multi-agent coordination through long-term strategic planning. DeMAC uses a dynamically updated directed acyclic graph (DAG) and a Manager-Player Dual-Feedback mechanism to align strategic and operational decisions. Moreover, DeMAC enables agents to maintain collaboration and dynamically adapt to changing environmental conditions, outperforming traditional reinforcement learning and human-agent collaboration in the Overcooked simulation. Experimental results highlight DeMAC’s ability to tackle complex coordination tasks, demonstrating its potential to advance LLM-based MAS in dynamic, complex task dependency environments.
2024
基于大模型数据增强的作文流畅性评价方法
Qianwen Peng (彭倩雯) | Yanzipeng Gao (高延子鹏) | Xiaoqing Li (李晓青) | Fanke Min (闵凡珂) | Mingrui Li (李明锐) | Zhichun Wang (王志春) | Tianyun Liu (刘天昀)
Proceedings of the 23rd Chinese National Conference on Computational Linguistics (Volume 3: Evaluations)
Qianwen Peng (彭倩雯) | Yanzipeng Gao (高延子鹏) | Xiaoqing Li (李晓青) | Fanke Min (闵凡珂) | Mingrui Li (李明锐) | Zhichun Wang (王志春) | Tianyun Liu (刘天昀)
Proceedings of the 23rd Chinese National Conference on Computational Linguistics (Volume 3: Evaluations)
“CCL2024-Eval任 务7为 中 小 学 生 作 文 流 畅 性 评 价 (Chinese Essay Fluency Evalua-tion,CEFE),该任务定义了三项重要且富有挑战性的问题,包括中小学作文病句类型识别、中小学作文病句改写、以及中小学作文流畅性评级。本队伍参加了评测任务7的三项子任务,分别获得了45.19、43.90和45.84的得分。本报告详细介绍本队伍在三个子任务上采用的技术方法,并对评测结果进行分析。”
2020
Knowledge Graph Alignment with Entity-Pair Embedding
Zhichun Wang | Jinjian Yang | Xiaoju Ye
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Zhichun Wang | Jinjian Yang | Xiaoju Ye
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Knowledge Graph (KG) alignment is to match entities in different KGs, which is important to knowledge fusion and integration. Recently, a number of embedding-based approaches for KG alignment have been proposed and achieved promising results. These approaches first embed entities in low-dimensional vector spaces, and then obtain entity alignments by computations on their vector representations. Although continuous improvements have been achieved by recent work, the performances of existing approaches are still not satisfactory. In this work, we present a new approach that directly learns embeddings of entity-pairs for KG alignment. Our approach first generates a pair-wise connectivity graph (PCG) of two KGs, whose nodes are entity-pairs and edges correspond to relation-pairs; it then learns node (entity-pair) embeddings of the PCG, which are used to predict equivalent relations of entities. To get desirable embeddings, a convolutional neural network is used to generate similarity features of entity-pairs from their attributes; and a graph neural network is employed to propagate the similarity features and get the final embeddings of entity-pairs. Experiments on five real-world datasets show that our approach can achieve the state-of-the-art KG alignment results.
2019
MAssistant: A Personal Knowledge Assistant for MOOC Learners
Lan Jiang | Shuhan Hu | Mingyu Huang | Zhichun Wang | Jinjian Yang | Xiaoju Ye | Wei Zheng
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP): System Demonstrations
Lan Jiang | Shuhan Hu | Mingyu Huang | Zhichun Wang | Jinjian Yang | Xiaoju Ye | Wei Zheng
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP): System Demonstrations
Massive Open Online Courses (MOOCs) have developed rapidly and attracted large number of learners. In this work, we present MAssistant system, a personal knowledge assistant for MOOC learners. MAssistant helps users to trace the concepts they have learned in MOOCs, and to build their own concept graphs. There are three key components in MAssistant: (i) a large-scale concept graph built from open data sources, which contains concepts in various domains and relations among them; (ii) a browser extension which interacts with learners when they are watching video lectures, and presents important concepts to them; (iii) a web application allowing users to explore their personal concept graphs, which are built based on their learning activities on MOOCs. MAssistant will facilitate the knowledge management task for MOOC learners, and make the learning on MOOCs easier.
2018
Cross-lingual Knowledge Graph Alignment via Graph Convolutional Networks
Zhichun Wang | Qingsong Lv | Xiaohan Lan | Yu Zhang
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
Zhichun Wang | Qingsong Lv | Xiaohan Lan | Yu Zhang
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
Multilingual knowledge graphs (KGs) such as DBpedia and YAGO contain structured knowledge of entities in several distinct languages, and they are useful resources for cross-lingual AI and NLP applications. Cross-lingual KG alignment is the task of matching entities with their counterparts in different languages, which is an important way to enrich the cross-lingual links in multilingual KGs. In this paper, we propose a novel approach for cross-lingual KG alignment via graph convolutional networks (GCNs). Given a set of pre-aligned entities, our approach trains GCNs to embed entities of each language into a unified vector space. Entity alignments are discovered based on the distances between entities in the embedding space. Embeddings can be learned from both the structural and attribute information of entities, and the results of structure embedding and attribute embedding are combined to get accurate alignments. In the experiments on aligning real multilingual KGs, our approach gets the best performance compared with other embedding-based KG alignment approaches.
Knowledge Graph Embedding with Numeric Attributes of Entities
Yanrong Wu | Zhichun Wang
Proceedings of the Third Workshop on Representation Learning for NLP
Yanrong Wu | Zhichun Wang
Proceedings of the Third Workshop on Representation Learning for NLP
Knowledge Graph (KG) embedding projects entities and relations into low dimensional vector space, which has been successfully applied in KG completion task. The previous embedding approaches only model entities and their relations, ignoring a large number of entities’ numeric attributes in KGs. In this paper, we propose a new KG embedding model which jointly model entity relations and numeric attributes. Our approach combines an attribute embedding model with a translation-based structure embedding model, which learns the embeddings of entities, relations, and attributes simultaneously. Experiments of link prediction on YAGO and Freebase show that the performance is effectively improved by adding entities’ numeric attributes in the embedding model.
Search
Fix author
Co-authors
- Mingrui Li (李明锐) 2
- Ruobing Wang 2
- Dingling Xu 2
- Jinjian Yang 2
- Xiaoju Ye 2
- Daren Zha 2
- Qingfei Zhao 2
- Zhiyong Bai 1
- Ma Bowen 1
- Feiyu Chen 1
- Yanzipeng Gao 1
- Yiming Guan 1
- Xu Han 1
- Zhiyu He 1
- Shuhan Hu 1
- Mingyu Huang 1
- Qi Jia 1
- Shijie Jia 1
- Lan Jiang 1
- Xiaohan Lan 1
- Xiang Li 1
- Xiaoqing Li 1
- Zhenghao Liu (刘正皓) 1
- Yuhan Liu 1
- Lu Liu 1
- Tianyun Liu 1
- Limin Liu 1
- Tong Lu 1
- Qingsong Lv 1
- Fanke Min 1
- Qianwen Peng 1
- Yuanhao Sun 1
- Maosong Sun (孙茂松) 1
- Shuo Wang 1
- Yihua Wang 1
- Xin Wang 1
- Zhensheng Wang 1
- Yanrong Wu 1
- Jianfei Wu 1
- Cong Xu 1
- Yukun Yan (闫宇坤) 1
- Shi Yu (于是) 1
- Yu Zhang 1
- Yaqian Zhao 1
- Wei Zheng 1
- Yaoyu Zhou 1