Dingyu Zhang


2026

This paper presents the OZemi team’s submission to SemEval-2026 Task 9: Detecting Multilingual, Multicultural and Multievent Online Polarization.We propose a unified multilingual approach that addresses multiple languages and subtasks efficiently. Our system combines multilingual models with data-level techniques and a class-weighted cross-entropy loss to mitigate data imbalance across languages, subtasks, and categories. Results show consistent performance across languages, achieving macro F1 scores above 70% in most languages for Subtask 1 achieving our highest rank in subtask 1 for Persian (1 out of 44). These results suggest that the proposed framework provides a flexible foundation for multilingual and multi-task polarization analysis.

2025

AI agents have drawn increasing attention mostly on their ability to perceive environments, understand tasks, and autonomously achieve goals. To advance research on AI agents in mobile scenarios, we introduce the Android Multi-annotation EXpo (AMEX), a comprehensive, large-scale dataset designed for generalist mobile GUI-control agents which are capable of completing tasks by directly interacting with the graphical user interface (GUI) on mobile devices. AMEX comprises over 104K high-resolution screenshots from popular mobile applications, which are annotated at multiple levels. Unlike existing GUI-related datasets, e.g., Rico, AitW, etc., AMEX includes three levels of annotations: GUI interactive element grounding, GUI screen and element functionality descriptions, and complex natural language instructions with stepwise GUI-action chains. We develop this dataset from a more instructive and detailed perspective, complementing the general settings of existing datasets. Additionally, we finetune a baseline model SPHINX Agent and illustrate the effectiveness of AMEX.