Dingyu Zhang

2026

OZemi at SemEval-2026 Task 9: A Cross-Lingual Approach to Online Text Polarization Classification Using Multilingual Models and Adaptive Loss Formulation
Hidetsune Takahashi | Eleale Nusi Tee | Aika Yu | Ruri Furukawa | Sooeun Kim | Shuta Niinomi | Dingyu Zhang | Emily Ohman
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)

This paper presents the OZemi team’s submission to SemEval-2026 Task 9: Detecting Multilingual, Multicultural and Multievent Online Polarization.We propose a unified multilingual approach that addresses multiple languages and subtasks efficiently. Our system combines multilingual models with data-level techniques and a class-weighted cross-entropy loss to mitigate data imbalance across languages, subtasks, and categories. Results show consistent performance across languages, achieving macro F1 scores above 70% in most languages for Subtask 1 achieving our highest rank in subtask 1 for Persian (1 out of 44). These results suggest that the proposed framework provides a flexible foundation for multilingual and multi-task polarization analysis.

2025

pdf bib abs

AI agents have drawn increasing attention mostly on their ability to perceive environments, understand tasks, and autonomously achieve goals. To advance research on AI agents in mobile scenarios, we introduce the Android Multi-annotation EXpo (AMEX), a comprehensive, large-scale dataset designed for generalist mobile GUI-control agents which are capable of completing tasks by directly interacting with the graphical user interface (GUI) on mobile devices. AMEX comprises over 104K high-resolution screenshots from popular mobile applications, which are annotated at multiple levels. Unlike existing GUI-related datasets, e.g., Rico, AitW, etc., AMEX includes three levels of annotations: GUI interactive element grounding, GUI screen and element functionality descriptions, and complex natural language instructions with stepwise GUI-action chains. We develop this dataset from a more instructive and detailed perspective, complementing the general settings of existing datasets. Additionally, we finetune a baseline model SPHINX Agent and illustrate the effectiveness of AMEX.

Co-authors

Hidetsune Takahashi 1

Guozhi Wang 1

Han Xiao 1

Aika Yu 1

Venues

Fix author