Biao Yi
Other people with similar names: Biao Yi
Unverified author pages with similar names: Biao Yi
2026
What’s Missing in Screen-to-Action? Towards a UI-in-the-Loop Paradigm for Multimodal GUI Reasoning
Songze Li | Xiaoke Guo | Tianqi Liu | Biao Yi | Zhaoyan Gong | Zhiqiang Liu | Huajun Chen | Wen Zhang
Findings of the Association for Computational Linguistics: ACL 2026
Songze Li | Xiaoke Guo | Tianqi Liu | Biao Yi | Zhaoyan Gong | Zhiqiang Liu | Huajun Chen | Wen Zhang
Findings of the Association for Computational Linguistics: ACL 2026
Existing Graphical User Interface (GUI) reasoning tasks remain challenging, particularly in UI understanding. Current methods typically rely on direct screen-based decision-making, which lacks interpretability and overlooks a comprehensive understanding of UI elements, ultimately leading to task failure. To enhance the understanding and interaction with UIs, we propose an innovative GUI reasoning paradigm called ***UI-in-the-Loop*** (UILoop). Our approach treats the GUI reasoning task as a cyclic ***Screen-UI elements-Action*** process. By enabling Multimodal Large Language Models (MLLMs) to explicitly learn the localization, semantic functions, and practical usage of key UI elements, UILoop achieves precise element discovery and performs interpretable reasoning. Furthermore, we introduce a more challenging ***UI Comprehension*** task centered on UI elements with three evaluation metrics. Correspondingly, we contribute a benchmark of 26K samples (UI Comprehension-Bench) to comprehensively evaluate existing methods’ mastery of UI elements. Extensive experiments demonstrate that UILoop achieves state-of-the-art UI understanding performance while yielding superior results in GUI reasoning tasks.
2025
OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use
Xueyu Hu | Tao Xiong | Biao Yi | Zishu Wei | Ruixuan Xiao | Yurun Chen | Jiasheng Ye | Meiling Tao | Xiangxin Zhou | Ziyu Zhao | Yuhuai Li | Shengze Xu | Shenzhi Wang | Xinchen Xu | Shuofei Qiao | Zhaokai Wang | Kun Kuang | Tieyong Zeng | Liang Wang | Jiwei Li | Yuchen Eleanor Jiang | Wangchunshu Zhou | Guoyin Wang | Keting Yin | Zhou Zhao | Hongxia Yang | Fan Wu | Shengyu Zhang | Fei Wu
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Xueyu Hu | Tao Xiong | Biao Yi | Zishu Wei | Ruixuan Xiao | Yurun Chen | Jiasheng Ye | Meiling Tao | Xiangxin Zhou | Ziyu Zhao | Yuhuai Li | Shengze Xu | Shenzhi Wang | Xinchen Xu | Shuofei Qiao | Zhaokai Wang | Kun Kuang | Tieyong Zeng | Liang Wang | Jiwei Li | Yuchen Eleanor Jiang | Wangchunshu Zhou | Guoyin Wang | Keting Yin | Zhou Zhao | Hongxia Yang | Fan Wu | Shengyu Zhang | Fei Wu
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
The dream to create AI assistants as capable and versatile as the fictional J.A.R.V.I.S from Iron Man has long captivated imaginations. With the evolution of multi-modal large language models ((M)LLMs), this dream is closer to reality, as (M)LLM-based Agents using computers, mobile phones and web browsers by operating within the environments and interfaces (e.g., Graphical User Interface (GUI) and Command Line Interface (CLI)) provided by operating systems (OS) to automate tasks have significantly advanced. This paper presents a comprehensive survey on these advanced agents, designated as OS Agents. We begin by elucidating the fundamentals of OS Agents, exploring their key components and capabilities. We then examine methodologies for constructing OS Agents, focusing on domain-specific foundation models and agent frameworks. A detailed review of evaluation metrics and benchmarks highlights how OS Agents are assessed across diverse platforms and tasks. Finally, we discuss current challenges and identify promising directions for future research. An open-source GitHub repository is maintained as a dynamic resource to foster further innovation in this field.
Search
Fix author
Co-authors
- Yurun Chen 1
- Huajun Chen 1
- Zhaoyan Gong 1
- Xiaoke Guo 1
- Xueyu Hu 1
- Yuchen Eleanor Jiang 1
- Kun Kuang 1
- Yuhuai Li 1
- Jiwei Li 1
- Songze Li 1
- Tianqi Liu 1
- Zhiqiang Liu (刘志强) 1
- Shuofei Qiao 1
- Meiling Tao 1
- Shenzhi Wang 1
- Zhaokai Wang 1
- Liang Wang 1
- Guoyin Wang 1
- Zishu Wei 1
- Fan Wu (吴凡, 吴钒) 1
- Fei Wu 1
- Ruixuan Xiao 1
- Tao Xiong 1
- Shengze Xu 1
- Xinchen Xu 1
- Hongxia Yang 1
- Jiasheng Ye 1
- Keting Yin 1
- Tieyong Zeng 1
- Shengyu Zhang 1
- Wen Zhang 1
- Ziyu Zhao 1
- Zhou Zhao 1
- Xiangxin Zhou 1
- Wangchunshu Zhou 1