Hangchao Ma
2026
PuerAI at SemEval-2026 Task 5: Homograph Appropriateness Assessment via DeBERTa Contrastive Regression and Contextual Grouping
Jiaxu Dao | Zhuoying Li | Hangchao Ma | Jinli Tong | Xiaoli Lan | Yifan Lu | Zhanji Yang
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Jiaxu Dao | Zhuoying Li | Hangchao Ma | Jinli Tong | Xiaoli Lan | Yifan Lu | Zhanji Yang
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
To assess homograph appropriateness in narrative contexts for SemEval-2026 Task 5, we propose a contrastive regression framework. This approach combines candidate sense definitions with full narrative texts to establish an MSE regression baseline, further enhanced by a contextual grouping ranking loss that models relative rationality among senses. Evaluated on the official AmbiStory dataset, our method consistently outperforms the baseline in accuracy and Spearman correlation. These results validate the efficacy of relative order modeling for capturing fine-grained semantic nuances in complex narratives. The code is available at: https://github.com/daojiaxu/Semeval2026task5.
PEU Lab at SemEval-2026 Task 4: Pairwise Text Comparison using RoBERTa and Ranking Loss
Hangchao Ma | Jiaxu Dao | Jinli Tong | Zhuoying Li | Qingsong Zhou | Xiuzhong Tang
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Hangchao Ma | Jiaxu Dao | Jinli Tong | Zhuoying Li | Qingsong Zhou | Xiuzhong Tang
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
This paper describes the system developed by the PEU Lab for SemEval-2026 Task 4, specifically focusing on Track A: Comparative Narrative Similarity. To address the pairwise nature of the task, a lightweight contrastive ranking approach is proposed. Specifically, the pretrained RoBERTa-Large model is utilized to encode the anchor and candidate stories. Rather than employing standard cross-entropy, a margin ranking loss is introduced, which allows the relative narrative proximity between different candidate stories to be explicitly modeled. Furthermore, a 5-fold cross-validation ensemble strategy is integrated to stabilize predictions on unseen data. Evaluated on the official dataset, the optimal configuration achieved an overall accuracy of 64.50%, demonstrating the effectiveness of relative order modeling. The code for this system is available at: https://github.com/mhchhh/SemEval2026-Task-4.