Ying Huang
2026
ProUIE: A Macro-to-Micro Progressive Learning Method for LLM-based Universal Information Extraction
Wenda Liu | Song Zhigang | Shuai Nie | Guangyao Liu | Lisung Chen | Binyu Yang | Yaran Chen | Peng Zhou | Hongzhen Wang | Yuchen Liu | Wenyue Hu | Jiaming Xu | Runyu Shi | Ying Huang
Findings of the Association for Computational Linguistics: ACL 2026
Wenda Liu | Song Zhigang | Shuai Nie | Guangyao Liu | Lisung Chen | Binyu Yang | Yaran Chen | Peng Zhou | Hongzhen Wang | Yuchen Liu | Wenyue Hu | Jiaming Xu | Runyu Shi | Ying Huang
Findings of the Association for Computational Linguistics: ACL 2026
LLM-based universal information extraction (UIE) methods often rely on additional information beyond the original training data, which increases training complexity yet often yields limited gains. To address this, we propose ProUIE, a Macro-to-Micro progressive learning approach that improves UIE without introducing any external information. ProUIE consists of three stages: (i) macro-level Complete Modeling (CM), which learns NER, RE, and EE along their intrinsic difficulty order on the full training data to build a unified extraction foundation, (ii) meso-level Streamlined Alignment (SA), which operates on sampled data with simplified target formats, streamlining and regularizing structured outputs to make them more concise and controllable, and (iii) micro-level Deep Exploration (DE), which applies GRPO with stepwise fine-grained rewards (SFR) over structural units to guide exploration and improve performance. Experiments on 36 public datasets show that ProUIE consistently improves unified extraction, outperforming strong instruction-tuned baselines on average for NER and RE while using a smaller backbone, and it further demonstrates clear gains in production-oriented information extraction.
From Experience to Skill: Multi-Agent Generative Engine Optimization via Reusable Strategy Learning
Beining Wu | Fuyou Mao | Jiong Lin | Cheng Yang | Jiaxuan Lu | Yifu Guo | Siyu Zhang | Yifan Wu | Ying Huang | Fu Li
Findings of the Association for Computational Linguistics: ACL 2026
Beining Wu | Fuyou Mao | Jiong Lin | Cheng Yang | Jiaxuan Lu | Yifu Guo | Siyu Zhang | Yifan Wu | Ying Huang | Fu Li
Findings of the Association for Computational Linguistics: ACL 2026
Generative engines (GEs) are reshaping information access by replacing ranked links with citation-grounded answers, yet current Generative Engine Optimization (GEO) methods optimize each instance in isolation, unable to accumulate or transfer effective strategies across tasks and engines. We reframe GEO as a strategy learning problem and propose MAGEO, a multi-agent framework in which coordinated planning, editing, and fidelity-aware evaluation serve as the execution layer, while validated editing patterns are progressively distilled into reusable, engine-specific optimization skills. To enable controlled assessment, we introduce a Twin Branch Evaluation Protocol for causal attribution of content edits and DSV-CF, a dual-axis metric that unifies semantic visibility with attribution accuracy. We further release MSME-GEO-Bench, a multi-scenario, multi-engine benchmark grounded in real-world queries. Experiments on three mainstream engines show that MAGEO substantially outperforms heuristic baselines in both visibility and citation fidelity, with ablations confirming that engine-specific preference modeling and strategy reuse are central to these gains, suggesting a scalable learning-driven paradigm for trustworthy GEO. Code is available at https://github.com/Wu-beining/MAGEO.