Alexandros Poulis


2025

pdf bib
BullyBench: Youth & Experts-in-the-loop Framework for Intrinsic and Extrinsic Cyberbullying NLP Benchmarking
Kanishk Verma | Sri Balaaji | Joachim Wagner | Arefeh Kazemi | Darragh Mccashin | Isobel Walsh@dcu | Sayani Basak | Sinan Asci | Yelena Cherkasova | Alexandros Poulis | James Ohiggins Norman | Rebecca Umbach Umbach | Tijana Milosevic | Brian Davis
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track

Cyberbullying (CB) involves complex relational dynamics that are often oversimplified as a binary classification task. Existing youth-focused CB datasets rely on scripted role-play, lacking conversational realism and ethical youth involvement, with little or no evaluation of their social plausibility. To address this, we introduce a youth-in-the-loop dataset “BullyBench” developed by adolescents (ages 15–16) through an ethical co-research framework. We introduce a structured intrinsic quality evaluation with experts-in-the-loop (social scientists, psychologists, and content moderators) for assessing realism, relevance, and coherence in youth CB data. Additionally, we perform extrinsic baseline evaluation of this dataset by benchmarking encoder- and decoder-only language models for multi-class CB role classification for future research. A three-stage annotation process by young adults refines the dataset into a gold-standard test benchmark, a high-quality resource grounded in minors’ lived experiences of CB detection. Code and data are available for review

2017

pdf bib
A Case Study of Machine Translation in Financial Sentiment Analysis
Chong Zhang | Matteo Capelletti | Alexandros Poulis | Thorben Stemann | Jane Nemcova
Proceedings of Machine Translation Summit XVI: Commercial MT Users and Translators Track

2012

pdf bib
To post-edit or not to post-edit? Estimating the benefits of MT post-editing for a European organization
Alexandros Poulis | David Kolovratnik
Workshop on Post-Editing Technology and Practice

In the last few years the European Parliament has witnessed a significant increase in translation demand. Although Translation Memory (TM) tools, terminology databases and bilingual concordancers have provided significant leverage in terms of quality and productivity the European Parliament is in need for advanced language technology to keep facing successfully the challenge of multilingualism. This paper describes an ongoing large-scale machine translation post-editing evaluation campaign the purpose of which is to estimate the business benefits from the use of machine translation for the European Parliament. This paper focuses mainly on the design, the methodology and the tools used by the evaluators but it also presents some preliminary results for the following language pairs: Polish-English, Danish-English, Lithuanian-English, English-German and English-French.

bib
Experiments on domain-specific statistical machine translation at the European Parliament
Konstantinos Chatzitheodorou | Alexandros Poulis
Proceedings of Translating and the Computer 34

2010

pdf bib
Exodus - Exploring SMT for EU Institutions
Michael Jellinghaus | Alexandros Poulis | David Kolovratník
Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR