Xinyi Wang
Other people with similar names: Xinyi Wang, Xinyi Wang
Unverified author pages with similar names: Xinyi Wang
2026
MTP-RL: Acceleration of Reinforcement Learning Rollouts with Policy-Aligned Multi-Token Prediction
Ke Wang | Aohan Zeng | Zhengxiao Du | Yuxuan Hu | Bohan Zhang | Xinyi Wang | Jie Tang | Jing Zhang
Findings of the Association for Computational Linguistics: ACL 2026
Ke Wang | Aohan Zeng | Zhengxiao Du | Yuxuan Hu | Bohan Zhang | Xinyi Wang | Jie Tang | Jing Zhang
Findings of the Association for Computational Linguistics: ACL 2026
Reinforcement learning (RL) is widely applied to boost the performance of pretrained models, yet its training efficiency is severely constrained by rollout generation. While speculative decoding based on multi-token prediction (MTP) offers a potential acceleration pathway, its widespread adoption is hindered by the absence of MTP in vanilla pretrained models and the rapid degradation of the MTP acceptance length in RL training. To address these issues, this paper proposes MTP-RL, a two-stage framework that pioneers effective training of MTPs in RL and accelerates the rollout phase for diverse models. It involves a pipeline to equip the multi-layer parameter-sharing MTP for all models and an innovative advantage-aware MTP optimization strategy to facilitate policy-aligned training of MTPs. Experiments demonstrate that our method not only achieves stable growth of acceptance length during RL training, but also accelerates RL rollouts, achieving an average 23.1%–55.3% reduction in rollout time compared to baselines.
GUI0: Self-Evolving Foundational GUI Agents in Super App Ecosystems
Xinyi Wang | Wei Dai | Kyle Qiao | Ke Wang | Peng Chen | Gang Cao | Kangqin | Zhongpu Wang | Xiaode Zhang | Yanming Liu | Jihao Gu | Jingtao Xu | Gong Zhi
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Xinyi Wang | Wei Dai | Kyle Qiao | Ke Wang | Peng Chen | Gang Cao | Kangqin | Zhongpu Wang | Xiaode Zhang | Yanming Liu | Jihao Gu | Jingtao Xu | Gong Zhi
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Automated interaction with graphical user interfaces (GUIs) is central to General Artificial Intelligence yet remains challenging within Super App ecosystems, characterized by non-standard rendering and absent accessibility metadata. While GUI agents often rely on explicit accessibility trees or static imitation, they are less explored for dynamic environments marked by sparse feedback and implicit visual cues. We present GUI0, a framework synergizing autonomous data synthesis with dual-agent co-evolution. GUI0 establishes a domain-aware foundation model via synthesized corpora and employs curriculum-driven reinforcement learning, where a curriculum agent generates boundary tasks to optimize an actor agent.Empirical results demonstrate three key advantages: (1) State-of-the-art performance on the SuperAPP benchmark, outperforming Gemini-2.5-Pro and Claude-4-Sonnet; (2) universal efficacy across diverse base models, consistently yielding substantial improvements on both Qwen2.5-VL and GUI-Owl variants; and (3) robust zero-shot generalization to standard GUIs (e.g., +62.7% on ScreenSpot Pro).
ToolGate: Contract-Grounded and Verified Tool Execution for LLMs
Yanming Liu | Xinyue Peng | Jiannan Cao | Xinyi Wang | Songhang Deng | Jintao Chen | Jianwei Yin | Xuhong Zhang
Findings of the Association for Computational Linguistics: ACL 2026
Yanming Liu | Xinyue Peng | Jiannan Cao | Xinyi Wang | Songhang Deng | Jintao Chen | Jianwei Yin | Xuhong Zhang
Findings of the Association for Computational Linguistics: ACL 2026
Large Language Models (LLMs) augmented with external tools have demonstrated remarkable capabilities in complex reasoning tasks. However, existing frameworks rely heavily on natural language reasoning to determine when tools can be invoked and whether their results should be committed, lacking formal guarantees for logical safety and verifiability. We present ToolGate, a forward execution framework that provides logical safety guarantees and verifiable state evolution for LLM tool calling. ToolGate maintains an explicit symbolic state space as a typed key-value mapping representing trusted world information throughout the reasoning process. Each tool is formalized as a Hoare-style contract consisting of a precondition and a postcondition, where the precondition gates tool invocation by checking whether the current state satisfies the required conditions, and the postcondition determines whether the tool’s result can be committed to update the state through runtime verification. Our approach guarantees that the symbolic state evolves only through verified tool executions, preventing invalid or hallucinated results from corrupting the world representation. Experimental validation demonstrates that ToolGate significantly improves the reliability and verifiability of tool-augmented LLM systems while maintaining competitive performance on complex multi-step reasoning tasks. This work establishes a foundation for building more trustworthy and debuggable AI systems that integrate language models with external tools.
2025
RACQC: Advanced Retrieval-Augmented Generation for Chinese Query Correction
Jinbo Su | Lingzhe Gao | Wei Li | Shihao Liu | Haojie Lei | Xinyi Wang | Yuanzhao Guo | Ke Wang | Daiting Shi | Dawei Yin
Findings of the Association for Computational Linguistics: EMNLP 2025
Jinbo Su | Lingzhe Gao | Wei Li | Shihao Liu | Haojie Lei | Xinyi Wang | Yuanzhao Guo | Ke Wang | Daiting Shi | Dawei Yin
Findings of the Association for Computational Linguistics: EMNLP 2025
In web search scenarios, erroneous queries frequently degrade users’ experience through irrelevant results, underscoring the pivotal role of Chinese Spelling Check (CSC) systems. Although large language models (LLMs) exhibit remarkable capabilities across many tasks, they face critical challenges in the CSC scenario: (1) poor generalization to rare entities in open-domain searches, and (2) failure to adapt to temporal entity variations due to static parameters, resulting in serious over-correction issues. To tackle this, we present RACQC, a Chinese Query Correction system with Retrieval-Augmented Generation (RAG) and multi-task learning. Specifically, our approach (1) integrates dynamic knowledge retrieval through entity-centric RAG to address rare entities and innovatively proposes an entity-title collaborative corpus, and (2) employs contrastive correction tasks to mitigate LLM over-correction tendencies. Furthermore, we propose MDCQC, a Multi-Domain Chinese Query Correction benchmark to test the model’s entity correction capabilities. Extensive experiments on several datasets show that RACQC significantly outperforms existing baselines in CSC tasks. Specifically, RACQC achieves a maximum improvement of +9.92% on the search scenario benchmark and +3.2% on the general-domain dataset under the F1 metric.
Search
Fix author
Co-authors
- Ke Wang 3
- Yanming Liu 2
- Gang Cao 1
- Jiannan Cao 1
- Peng Chen 1
- Jintao Chen 1
- Wei Dai 1
- Songhang Deng 1
- Zhengxiao Du 1
- Lingzhe Gao 1
- Jihao Gu 1
- Yuanzhao Guo 1
- Yuxuan Hu 1
- Kangqin 1
- Haojie Lei 1
- Wei Li 1
- Shihao Liu 1
- Xinyue Peng 1
- Kyle Qiao 1
- Daiting Shi 1
- Jinbo Su 1
- Jie Tang 1
- Zhongpu Wang 1
- Jingtao Xu 1
- Dawei Yin 1
- Jianwei Yin 1
- Aohan Zeng 1
- Bohan Zhang 1
- Jing Zhang 1
- Xiaode Zhang 1
- Xuhong Zhang 1
- Gong Zhi 1