Ådne Jøssing

2026

Multi-Label Polarization Classification with twHIN-BERT and SCUT Threshold Optimization
Ilinca Vandici | Ådne Jøssing | Lukas Viestädt
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)

Tackling task 2, we fine tune a BERT-style encoder with classification heads added on top. We first try out different pre-trained encoder models, before settling on the Twhin-bert multilingual model, since its pretraining corpus (mainly tweets) provides a suitable starting point for our task. To resolve the issue of diverging label annotation styles, we apply the S-Cut algorithm, in order to calibrate thresholds for label selection, and examine its impact. We take a look at the resulting hidden representations in a reduced dimensional space, and examine the linguistic information encoded by our model after fine-tuning using linguistic probing.

Co-authors

Ilinca Vandici 1
Lukas Viestädt 1

Venues

SemEval1
WS1

Fix author