Yuwei Wang
2026
Diagnosing and Remedying Representation Deficiencies for Deterministic Reasoning in KGQA
Gewen Liang | Mufan Xu | Kehai Chen | Wei Wang | Yuwei Wang | Muyun Yang | Tiejun Zhao | Min Zhang
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Gewen Liang | Mufan Xu | Kehai Chen | Wei Wang | Yuwei Wang | Muyun Yang | Tiejun Zhao | Min Zhang
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Large language models (LLMs) have demonstrated increasingly strong reasoning capabilities, achieving remarkable progress in knowledge graph question answering (KGQA). However, a key challenge in such systems is non-deterministic reasoning, where the model indecisively activates multiple semantically related knowledge graph edges for a given query, frequently leading to incorrect answers. To address this issue, we propose Diagnosing and Remedying Representation Deficiencies for Deterministic Reasoning in KGQA (DR2). DR2 identifies and localizes non-deterministic reasoning behaviors, uncovering the underlying semantic representation deficiencies in LLMs. Building on this diagnosis, we design abductive reasoning-based preference learning, which promotes fine-grained semantic discrimination and mitigates non-deterministic reasoning errors. Experimental results demonstrate that the proposed DR2 significantly outperforms several strong baselines, achieving state-of-the-art performance on the widely used WebQSP and CWQ benchmarks.
Beyond Unimodal Shortcuts: MLLMs as Cross-Modal Reasoners for Grounded Named Entity Recognition
Jinlong Ma | Yu Zhang | Xuefeng Bai | Kehai Chen | Yuwei Wang | Zeming Liu | Jun Yu | Min Zhang
Findings of the Association for Computational Linguistics: ACL 2026
Jinlong Ma | Yu Zhang | Xuefeng Bai | Kehai Chen | Yuwei Wang | Zeming Liu | Jun Yu | Min Zhang
Findings of the Association for Computational Linguistics: ACL 2026
Grounded Multimodal Named Entity Recognition (GMNER) aims to extract text-based entities, assign them semantic categories, and ground them to corresponding visual regions. In this work, we explore the potential of Multimodal Large Language Models (MLLMs) to perform GMNER in an end-to-end manner, moving beyond their typical role as auxiliary tools within cascaded pipelines.Crucially, our investigation reveals a fundamental challenge: MLLMs exhibit modality bias, including visual bias and textual bias, which stems from their tendency to take unimodal shortcuts rather than rigorous cross-modal verification.To address this, we propose Modality-aware Consistency Reasoning (MCR), which enforces structured cross-modal reasoning through Multi-style Reasoning Schema Injection (MRSI) and Constraint-guided Verifiable Optimization (CVO). MRSI transforms abstract constraints into executable reasoning chains, while CVO empowers the model to dynamically align its reasoning trajectories with Group Relative Policy Optimization (GRPO).Experiments on GMNER and visual grounding tasks demonstrate that MCR effectively mitigates modality bias and achieves superior performance compared to existing baselines.
ZoFia: Zero-Shot Fake News Detection with Entity-Guided Retrieval and Multi-LLM Interaction
Lvhua Wu | Xuefeng Jiang | Sheng Sun | Yan Lei | Tian Wen | Yuwei Wang | Min Liu
Findings of the Association for Computational Linguistics: ACL 2026
Lvhua Wu | Xuefeng Jiang | Sheng Sun | Yan Lei | Tian Wen | Yuwei Wang | Min Liu
Findings of the Association for Computational Linguistics: ACL 2026
The rapid spread of fake news threatens social stability and public trust, highlighting the urgent need for its effective detection.Although large language models (LLMs) show potential in fake news detection, they are limited by knowledge cutoff and easily generate factual hallucinations when handling time-sensitive news.Furthermore, the thinking of a single LLM easily falls into early stance locking and confirmation bias, making it hard to handle both content reasoning and fact checking simultaneously.To address these challenges, we propose ZoFia, a two-stage zero-shot fake news detection framework.In the first retrieval stage, we propose novel Hierarchical Salience and Salience-Calibrated Minimum Marginal Relevance (SC-MMR) algorithm to extract core entities accurately, which drive dual-source retrieval to overcome knowledge and evidence gaps.In the subsequent stage, a multi-agent system conducts multi-perspective reasoning and verification in parallel and achieves an explainable and robust result via adversarial debate.Comprehensive experiments on two public datasets show that ZoFia outperforms existing zero-shot baselines and even most few-shot methods.Our code has been open-sourced to facilitate the research community at https://github.com/SakiRinn/ZoFia.
2025
MultiCAT: Multimodal Communication Annotations for Teams
Adarsh Pyarelal | John M Culnan | Ayesha Qamar | Meghavarshini Krishnaswamy | Yuwei Wang | Cheonkam Jeong | Chen Chen | Md Messal Monem Miah | Shahriar Hormozi | Jonathan Tong | Ruihong Huang
Findings of the Association for Computational Linguistics: NAACL 2025
Adarsh Pyarelal | John M Culnan | Ayesha Qamar | Meghavarshini Krishnaswamy | Yuwei Wang | Cheonkam Jeong | Chen Chen | Md Messal Monem Miah | Shahriar Hormozi | Jonathan Tong | Ruihong Huang
Findings of the Association for Computational Linguistics: NAACL 2025
Successful teamwork requires team members to understand each other and communicate effectively, managing multiple linguistic and paralinguistic tasks at once. Because of the potential for interrelatedness of these tasks, it is important to have the ability to make multiple types of predictions on the same dataset. Here, we introduce Multimodal Communication Annotations for Teams (MultiCAT), a speech- and text-based dataset consisting of audio recordings, automated and hand-corrected transcriptions. MultiCAT builds upon data from teams working collaboratively to save victims in a simulated search and rescue mission, and consists of annotations and benchmark results for the following tasks: (1) dialog act classification, (2) adjacency pair detection, (3) sentiment and emotion recognition, (4) closed-loop communication detection, and (5) vocal (phonetic) entrainment detection. We also present exploratory analyses on the relationship between our annotations and team outcomes. We posit that additional work on these tasks and their intersection will further improve understanding of team communication and its relation to team performance. Code & data: https://doi.org/10.5281/zenodo.14834835
2022
Rule Based Event Extraction for Artificial Social Intelligence
Remo Nitschke | Yuwei Wang | Chen Chen | Adarsh Pyarelal | Rebecca Sharp
Proceedings of the First Workshop on Pattern-based Approaches to NLP in the Age of Deep Learning
Remo Nitschke | Yuwei Wang | Chen Chen | Adarsh Pyarelal | Rebecca Sharp
Proceedings of the First Workshop on Pattern-based Approaches to NLP in the Age of Deep Learning
Natural language (as opposed to structured communication modes such as Morse code) is by far the most common mode of communication between humans, and can thus provide significant insight into both individual mental states and interpersonal dynamics. As part of DARPA’s Artificial Social Intelligence for Successful Teams (ASIST) program, we are developing an AI agent team member that constructs and maintains models of their human teammates and provides appropriate task-relevant advice to improve team processes and mission performance. One of the key components of this agent is a module that uses a rule-based approach to extract task-relevant events from natural language utterances in real time, and publish them for consumption by downstream components. In this case study, we evaluate the performance of our rule-based event extraction system on a recently conducted ASIST experiment consisting of a simulated urban search and rescue mission in Minecraft. We compare the performance of our approach with that of a zero-shot neural classifier, and find that our approach outperforms the classifier for all event types, even when the classifier is used in an oracle setting where it knows how many events should be extracted from each utterance.
Search
Fix author
Co-authors
- Kehai Chen (陈科海) 2
- Adarsh Pyarelal 2
- Min Zhang 2
- Xuefeng Bai (白雪峰) 1
- Chen Chen 1
- Chen Chen 1
- John M Culnan 1
- Shahriar Hormozi 1
- Ruihong Huang 1
- Cheonkam Jeong 1
- Xuefeng Jiang 1
- Meghavarshini Krishnaswamy 1
- Yan Lei 1
- Gewen Liang 1
- Zeming Liu 1
- Min Liu 1
- Jinlong Ma 1
- Md Messal Monem Miah 1
- Remo Nitschke 1
- Ayesha Qamar 1
- Rebecca Sharp 1
- Sheng Sun 1
- Jonathan Tong 1
- Wei Wang 1
- Tian Wen 1
- Lvhua Wu 1
- Mufan Xu 1
- Muyun Yang (杨沐昀) 1
- Jun Yu 1
- Yu Zhang 1
- Tiejun Zhao (赵铁军) 1