Anson Bastos
2026
Learning Optimal Message Representations for Agentic Communication
Shashwat Gupta | Anson Bastos | Mayukh Das | Supriyo Ghosh | Nagarajan Natarajan | Chetan Bansal | Saravan Rajmohan
Findings of the Association for Computational Linguistics: ACL 2026
Shashwat Gupta | Anson Bastos | Mayukh Das | Supriyo Ghosh | Nagarajan Natarajan | Chetan Bansal | Saravan Rajmohan
Findings of the Association for Computational Linguistics: ACL 2026
Large Language Models (LLMs) have demonstrated remarkable capabilities in agentic collaborative problem-solving, albeit a gap exists. Existing frameworks predominantly rely on natural language as a primary representation (format) for agentic communication. However natural language could be ambiguous and verbose. Furthermore, recent works have shown that alternative representations can enhance performance in LLMs on certain tasks. But current approaches lack the intelligence necessary to understand, learn or apply optimal communication representations adaptively. In this paper, we propose to dynamically learn the optimal message representations to enhance agentic performance. We model the optimization problem as an Expanding Markov Decision Process (EMDP) and propose our method named OPTiMACS. We evaluate our system across benchmark datasets of collaborative problem-solving. The results show significant performance improvements while maintaining efficiency. Our work bridges the gap between rigid communication protocols and open-ended natural language by providing an adaptive framework that learns task-aware structural representations.
SynthAgent: Adapting Web Agents with Synthetic Supervision
Zhaoyang Wang | Yiming Liang | Xuchao Zhang | Qianhui Wu | Siwei Han | Anson Bastos | Rujia Wang | Chetan Bansal | Baolin Peng | Jianfeng Gao | Saravan Rajmohan | Huaxiu Yao
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Zhaoyang Wang | Yiming Liang | Xuchao Zhang | Qianhui Wu | Siwei Han | Anson Bastos | Rujia Wang | Chetan Bansal | Baolin Peng | Jianfeng Gao | Saravan Rajmohan | Huaxiu Yao
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Web agents struggle to adapt to new websites due to the scarcity of environment specific tasks and demonstrations. Recent works have explored synthetic data generation to address this challenge, however, they suffer from data quality issues where synthesized tasks contain hallucinations that cannot be executed, and collected trajectories are noisy with redundant or misaligned actions. In this paper, we propose SynthAgent, a fully synthetic supervision framework that aims at improving synthetic data quality via dual refinement of both tasks and trajectories. Our approach begins by synthesizing diverse tasks through categorized exploration of web elements, ensuring efficient coverage of the target environment. During trajectory collection, tasks are refined only when conflicts with observations are detected, which mitigates hallucinations while preserving task consistency. After collection, we conduct trajectory refinement with global context to mitigate potential noise or misalignments. Finally, we fine-tune open-source web agents on the refined synthetic data to adapt them to the target environment. Experimental results demonstrate that SynthAgent outperforms existing synthetic data methods, validating the importance of high-quality synthetic supervision. The code is publicly available at https://github.com/aiming-lab/SynthAgent.