UICOMPASS: UI Map Guided Mobile Task Automation via Adaptive Action Generation

Yuanzhang Lin, Zhe Zhang, He Rui, Qingao Dong, Mingyi Zhou, Jing Zhang, Xiang Gao, Hailong Sun


Abstract
Mobile task automation is an emerging technology that leverages AI to automatically execute routine tasks by users’ commands on mobile devices like Android, thus enhancing efficiency and productivity. While large language models (LLMs) excel at general mobile tasks through training on massive datasets, they struggle with app-specific workflows. To solve this problem, we designed UI Map, a structured representation of target app’s UI information. We further propose a UI Map-guided LLM-based approach UICompass to automate mobile tasks. Specifically, UICompass first leverages static analysis and LLMs to automatically build UI Map from either source codes of apps or byte codes (i.e., APK packages). During task execution, UICompass mines the task-relevant information from UI Map to feed into the LLMs, generate a planned paths, and adaptively adjust the path based on the actual app state and action history. Experimental results demonstrate that UICompass achieves a 15.87% higher task executing success rate than SOTA approaches. Even when only APK is available, UICompass maintains superior performance, demonstrating its applicability to closed-source apps.
Anthology ID:
2025.emnlp-main.1346
Volume:
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
26497–26517
Language:
URL:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1346/
DOI:
Bibkey:
Cite (ACL):
Yuanzhang Lin, Zhe Zhang, He Rui, Qingao Dong, Mingyi Zhou, Jing Zhang, Xiang Gao, and Hailong Sun. 2025. UICOMPASS: UI Map Guided Mobile Task Automation via Adaptive Action Generation. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 26497–26517, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
UICOMPASS: UI Map Guided Mobile Task Automation via Adaptive Action Generation (Lin et al., EMNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1346.pdf
Checklist:
 2025.emnlp-main.1346.checklist.pdf