Maria Alexandra Roussopoulou

2026

From Form to Meaning: Interlingua Sense-Alignment of Offensive Language with LLMs
Maria Alexandra Roussopoulou | Stella Markantonatou
Proceedings of the Sixth Workshop on Language Technology for Equality, Diversity, Inclusion

This paper presents a methodology that uses LLMs to align multilingual offensive lexicons at the sense level. Lexicons of different structures and origins in Arabic, Bulgarian, Modern Greek, French, and Italian have been aligned directly without pivoting through English. The Modern Greek lexicon is LLM-generated, and the other four lexicons are WordNet-compatible. For inter-language alignment of senses, an LLM-as-a-judge rubric was used over lemma–definition–example triples. The LLM makes 2.87M pairwise comparisons and yields 31 strict global-sense categories. The paper discusses the challenges involved in sense alignment tasks. The resource is available to support downstream applications such as Machine Translation and cross-lingual hate-speech detection.

Co-authors

Stella Markantonatou 1

Venues

LTEDI1
WS1

Fix author