Robert Geislinger
2026
SemEval-2026 Task 9: Detecting Multilingual, Multicultural and Multievent Online Polarization
Usman Naseem | Robert Geislinger | Ada Ren | Sarah Kohail | Rudy Garrido Veliz | P Sam Sahil | Yiran Zhang | Marco Antonio Stranisci | Idris Abdulmumin | Özge Alacam | Cengiz Acarturk | Aisha Jabr | Saba Anwar | Abinew Ali Ayele | Elena Tutubalina | Aung Kyaw Htet | Xintong Wang | Surendrabikram Thapa | Tanmoy Chakraborty | Dheeraj Kodati | Sahar Moradizeyveh | Firoj Alam | Ye Kyaw Thu | Shantipriya Parida | Ihsan Ayyub Qazi | Lilian Diana Awuor Wanzare | Nelson Odhiambo | Clemencia Siro | Ibrahim Said Ahmad | Adem Chanie Ali | Martin Semmann | Chris Biemann | Shamsuddeen Hassan Muhammad | Seid Muhie Yimam
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Usman Naseem | Robert Geislinger | Ada Ren | Sarah Kohail | Rudy Garrido Veliz | P Sam Sahil | Yiran Zhang | Marco Antonio Stranisci | Idris Abdulmumin | Özge Alacam | Cengiz Acarturk | Aisha Jabr | Saba Anwar | Abinew Ali Ayele | Elena Tutubalina | Aung Kyaw Htet | Xintong Wang | Surendrabikram Thapa | Tanmoy Chakraborty | Dheeraj Kodati | Sahar Moradizeyveh | Firoj Alam | Ye Kyaw Thu | Shantipriya Parida | Ihsan Ayyub Qazi | Lilian Diana Awuor Wanzare | Nelson Odhiambo | Clemencia Siro | Ibrahim Said Ahmad | Adem Chanie Ali | Martin Semmann | Chris Biemann | Shamsuddeen Hassan Muhammad | Seid Muhie Yimam
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
We present SemEval-2026 Task 9, a shared task on online polarization detection, covering 22 languages and comprising over 110K annotated instances. Each data instance is multi-labeled with the presence of polarization, polarization type, and polarization manifestation. Participants were asked to predict labels in three subtasks: (1) detecting the presence of polarization, (2) identifying the type of polarization, and (3) recognizing the polarization manifestation. The three tasks attracted over 1,000 participants worldwide and more than 10k submissions on Codabench. We received final submissions from 67 teams and 69 system description papers. We report the baseline results and analyze the performance of the best-performing systems, highlighting the most common approaches and the most effective methods across different subtasks and languages. The dataset and other resources for this task are publicly available.
POLAR: A Benchmark for Multilingual, Multicultural, and Multi-Event Online Polarization
Usman Naseem | Robert Geislinger | Juan Ren | Sarah Kohail | Rudy Alexandro Garrido Veliz | P Sam Sahil | Yiran Zhang | Idris Abdulmumin | Marco Antonio Stranisci | Özge Alacam | Cengiz Acarturk | Aisha Jabr | Saba Anwar | Abinew Ali Ayele | Simona Frenda | Alessandra Teresa Cignarella | Elena Tutubalina | Oleg Rogov | Aung Kyaw Htet | Xintong Wang | Surendrabikram Thapa | Kritesh Rauniyar | Tanmoy Chakraborty | MD Arfeen Zeeshan | Dheeraj Kodati | Satya Keerthi | Sahar Moradizeyveh | Firoj Alam | Md Arid Hasan | Syed Ishtiaque Ahmed | Ye Kyaw Thu | Shantipriya Parida | Ihsan Ayyub Qazi | Lilian Diana Awuor Wanzare | Nelson Odhiambo Onyango | Clemencia Siro | Jane Wanjiru Kimani | Ibrahim Said Ahmad | Adem Chanie Ali | Martin Semmann | Chris Biemann | Shamsuddeen Hassan Muhammad | Seid Muhie Yimam
Findings of the Association for Computational Linguistics: ACL 2026
Usman Naseem | Robert Geislinger | Juan Ren | Sarah Kohail | Rudy Alexandro Garrido Veliz | P Sam Sahil | Yiran Zhang | Idris Abdulmumin | Marco Antonio Stranisci | Özge Alacam | Cengiz Acarturk | Aisha Jabr | Saba Anwar | Abinew Ali Ayele | Simona Frenda | Alessandra Teresa Cignarella | Elena Tutubalina | Oleg Rogov | Aung Kyaw Htet | Xintong Wang | Surendrabikram Thapa | Kritesh Rauniyar | Tanmoy Chakraborty | MD Arfeen Zeeshan | Dheeraj Kodati | Satya Keerthi | Sahar Moradizeyveh | Firoj Alam | Md Arid Hasan | Syed Ishtiaque Ahmed | Ye Kyaw Thu | Shantipriya Parida | Ihsan Ayyub Qazi | Lilian Diana Awuor Wanzare | Nelson Odhiambo Onyango | Clemencia Siro | Jane Wanjiru Kimani | Ibrahim Said Ahmad | Adem Chanie Ali | Martin Semmann | Chris Biemann | Shamsuddeen Hassan Muhammad | Seid Muhie Yimam
Findings of the Association for Computational Linguistics: ACL 2026
Online polarization poses a growing challenge for democratic discourse, yet most computational social science research remains monolingual, culturally narrow, or event-specific. We introduce POLAR, a multilingual, multicultural, and multi-event dataset with over 110K instances in 22 languages drawn from diverse online platforms and real-world events. Polarization is annotated along three axes, namely detection, type, and manifestation, using a variety of annotation platforms adapted to each cultural context. We conduct two main experiments: (1) fine-tuning six pretrained small language models; and (2) evaluating a range of open and closed large language models in few-shot and zero-shot settings. Results show that while most models perform well on binary polarization detection, they achieve substantially lower performance when predicting polarization types and manifestations. These findings highlight the complex, highly contextual nature of polarization and underscore the need for robust, adaptable approaches in NLP and computational social science. All resources will be released to support further research and effective mitigation of digital polarization globally.
2025
HatePRISM: Policies, Platforms, and Research Integration. Advancing NLP for Hate Speech Proactive Mitigation
Naquee Rizwan | Seid Muhie Yimam | Daryna Dementieva | Dr. Florian Skupin | Tim Fischer | Daniil Moskovskiy | Aarushi Ajay Borkar | Robert Geislinger | Punyajoy Saha | Sarthak Roy | Martin Semmann | Alexander Panchenko | Chris Biemann | Animesh Mukherjee
Findings of the Association for Computational Linguistics: ACL 2025
Naquee Rizwan | Seid Muhie Yimam | Daryna Dementieva | Dr. Florian Skupin | Tim Fischer | Daniil Moskovskiy | Aarushi Ajay Borkar | Robert Geislinger | Punyajoy Saha | Sarthak Roy | Martin Semmann | Alexander Panchenko | Chris Biemann | Animesh Mukherjee
Findings of the Association for Computational Linguistics: ACL 2025
Despite regulations imposed by nations and social media platforms, e.g. (Government of India, 2021; European Parliament and Council of the European Union, 2022), inter alia, hateful content persists as a significant challenge. Existing approaches primarily rely on reactive measures such as blocking or suspending offensive messages, with emerging strategies focusing on proactive measurements like detoxification and counterspeech. In our work, which we call HATEPRISM, we conduct a comprehensive examination of hate speech regulations and strategies from three perspectives: country regulations, social platform policies, and NLP research datasets. Our findings reveal significant inconsistencies in hate speech definitions and moderation practices across jurisdictions and platforms, alongside a lack of alignment with research efforts. Based on these insights, we suggest ideas and research direction for further exploration of a unified framework for automated hate speech moderation incorporating diverse strategies.
BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages
Shamsuddeen Hassan Muhammad | Nedjma Ousidhoum | Idris Abdulmumin | Jan Philip Wahle | Terry Ruas | Meriem Beloucif | Christine de Kock | Nirmal Surange | Daniela Teodorescu | Ibrahim Said Ahmad | David Ifeoluwa Adelani | Alham Fikri Aji | Felermino D. M. A. Ali | Ilseyar Alimova | Vladimir Araujo | Nikolay Babakov | Naomi Baes | Ana-Maria Bucur | Andiswa Bukula | Guanqun Cao | Rodrigo Tufiño | Rendi Chevi | Chiamaka Ijeoma Chukwuneke | Alexandra Ciobotaru | Daryna Dementieva | Murja Sani Gadanya | Robert Geislinger | Bela Gipp | Oumaima Hourrane | Oana Ignat | Falalu Ibrahim Lawan | Rooweither Mabuya | Rahmad Mahendra | Vukosi Marivate | Alexander Panchenko | Andrew Piper | Charles Henrique Porto Ferreira | Vitaly Protasov | Samuel Rutunda | Manish Shrivastava | Aura Cristina Udrea | Lilian Diana Awuor Wanzare | Sophie Wu | Florian Valentin Wunderlich | Hanif Muhammad Zhafran | Tianhui Zhang | Yi Zhou | Saif M. Mohammad
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Shamsuddeen Hassan Muhammad | Nedjma Ousidhoum | Idris Abdulmumin | Jan Philip Wahle | Terry Ruas | Meriem Beloucif | Christine de Kock | Nirmal Surange | Daniela Teodorescu | Ibrahim Said Ahmad | David Ifeoluwa Adelani | Alham Fikri Aji | Felermino D. M. A. Ali | Ilseyar Alimova | Vladimir Araujo | Nikolay Babakov | Naomi Baes | Ana-Maria Bucur | Andiswa Bukula | Guanqun Cao | Rodrigo Tufiño | Rendi Chevi | Chiamaka Ijeoma Chukwuneke | Alexandra Ciobotaru | Daryna Dementieva | Murja Sani Gadanya | Robert Geislinger | Bela Gipp | Oumaima Hourrane | Oana Ignat | Falalu Ibrahim Lawan | Rooweither Mabuya | Rahmad Mahendra | Vukosi Marivate | Alexander Panchenko | Andrew Piper | Charles Henrique Porto Ferreira | Vitaly Protasov | Samuel Rutunda | Manish Shrivastava | Aura Cristina Udrea | Lilian Diana Awuor Wanzare | Sophie Wu | Florian Valentin Wunderlich | Hanif Muhammad Zhafran | Tianhui Zhang | Yi Zhou | Saif M. Mohammad
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
People worldwide use language in subtle and complex ways to express emotions. Although emotion recognition–an umbrella term for several NLP tasks–impacts various applications within NLP and beyond, most work in this area has focused on high-resource languages. This has led to significant disparities in research efforts and proposed solutions, particularly for under-resourced languages, which often lack high-quality annotated datasets.In this paper, we present BRIGHTER–a collection of multi-labeled, emotion-annotated datasets in 28 different languages and across several domains. BRIGHTER primarily covers low-resource languages from Africa, Asia, Eastern Europe, and Latin America, with instances labeled by fluent speakers. We highlight the challenges related to the data collection and annotation processes, and then report experimental results for monolingual and crosslingual multi-label emotion identification, as well as emotion intensity recognition. We analyse the variability in performance across languages and text domains, both with and without the use of LLMs, and show that the BRIGHTER datasets represent a meaningful step towards addressing the gap in text-based emotion recognition.
LECTURE4ALL: A Lightweight Approach to Precise Timestamp Detection in Online Lecture Videos
Torben Hannemann | Frank Hammerschmidt | Simon Kazemi | Gregor Stange | Viktoria Wrobel | Robert Geislinger | Seid Muhie Yimam
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)
Torben Hannemann | Frank Hammerschmidt | Simon Kazemi | Gregor Stange | Viktoria Wrobel | Robert Geislinger | Seid Muhie Yimam
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)
This paper presents LECTURE4ALL, a web application developed to improve the search experience of educational video platforms. Lecture2Go provides a vast collection of recorded lectures, but locating specific content within videos can be time-consuming. LECTURE4ALL addresses this issue by leveraging a vector database and a streamlined user interface to enable direct retrieval of precise video timestamps. By enhancing search accuracy and efficiency, LECTURE4ALL significantly improves the accessibility and usability of educational video platforms.
2024
Concept Over Time Analysis: Unveiling Temporal Patterns for Qualitative Data Analysis
Tim Fischer | Florian Schneider | Robert Geislinger | Florian Helfer | Gertraud Koch | Chris Biemann
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 3: System Demonstrations)
Tim Fischer | Florian Schneider | Robert Geislinger | Florian Helfer | Gertraud Koch | Chris Biemann
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 3: System Demonstrations)
In this system demonstration paper, we present the Concept Over Time Analysis extension for the Discourse Analysis Tool Suite.The proposed tool empowers users to define, refine, and visualize their concepts of interest within an interactive interface. Adhering to the Human-in-the-loop paradigm, users can give feedback through sentence annotations. Utilizing few-shot sentence classification, the system employs Sentence Transformers to compute representations of sentences and concepts. Through an iterative process involving semantic similarity searches, sentence annotation, and fine-tuning with contrastive data, the model continuously refines, providing users with enhanced analysis outcomes. The final output is a timeline visualization of sentences classified to concepts. Especially suited for the Digital Humanities, Concept Over Time Analysis serves as a valuable tool for qualitative data analysis within extensive datasets. The chronological overview of concepts enables researchers to uncover patterns, trends, and shifts in discourse over time.
2023
Multi-Modal Learning Application – Support Language Learners with NLP Techniques and Eye-Tracking
Robert Geislinger | Ali Ebrahimi Pourasad | Deniz Gül | Daniel Djahangir | Seid Muhie Yimam | Steffen Remus | Chris Biemann
Proceedings of the 1st Workshop on Linguistic Insights from and for Multimodal Language Processing
Robert Geislinger | Ali Ebrahimi Pourasad | Deniz Gül | Daniel Djahangir | Seid Muhie Yimam | Steffen Remus | Chris Biemann
Proceedings of the 1st Workshop on Linguistic Insights from and for Multimodal Language Processing
2022
Search
Fix author
Co-authors
- Chris Biemann 6
- Seid Muhie Yimam 5
- Idris Abdulmumin 3
- Ibrahim Said Ahmad 3
- Shamsuddeen Hassan Muhammad 3
- Martin Semmann 3
- Lilian Diana Awuor Wanzare 3
- Cengiz Acarturk 2
- Özge Alacam 2
- Firoj Alam 2
- Adem Chanie Ali 2
- Saba Anwar 2
- Abinew Ali Ayele 2
- Tanmoy Chakraborty 2
- Daryna Dementieva 2
- Tim Fischer 2
- Aung Kyaw Htet 2
- Aisha Jabr 2
- Dheeraj Kodati 2
- Sarah Kohail 2
- Sahar Moradizeyveh 2
- Usman Naseem 2
- Alexander Panchenko 2
- Shantipriya Parida 2
- Ihsan Ayyub Qazi 2
- P Sam Sahil 2
- Clemencia Siro 2
- Marco Antonio Stranisci 2
- Surendrabikram Thapa 2
- Ye Kyaw Thu 2
- Elena Tutubalina 2
- Xintong Wang 2
- Yiran Zhang 2
- David Ifeoluwa Adelani 1
- Syed Ishtiaque Ahmed 1
- Alham Fikri Aji 1
- Felermino D. M. A. Ali 1
- Ilseyar Alimova 1
- Vladimir Araujo 1
- Nikolay Babakov 1
- Naomi Baes 1
- Meriem Beloucif 1
- Aarushi Ajay Borkar 1
- Ana-Maria Bucur 1
- Andiswa Bukula 1
- Guanqun Cao 1
- Rendi Chevi 1
- Chiamaka Ijeoma Chukwuneke 1
- Alessandra Teresa Cignarella 1
- Alexandra Ciobotaru 1
- Daniel Djahangir 1
- Charles Henrique Porto Ferreira 1
- Simona Frenda 1
- Murja Sani Gadanya 1
- Rudy Garrido Veliz 1
- Bela Gipp 1
- Deniz Gül 1
- Frank Hammerschmidt 1
- Torben Hannemann 1
- Md. Arid Hasan 1
- Florian Helfer 1
- Oumaima Hourrane 1
- Oana Ignat 1
- Simon Kazemi 1
- Satya Keerthi 1
- Jane Wanjiru Kimani 1
- Gertraud Koch 1
- Falalu Ibrahim Lawan 1
- Rooweither Mabuya 1
- Rahmad Mahendra 1
- Vukosi Marivate 1
- Benjamin Milde 1
- Saif Mohammad 1
- Daniil Moskovskiy 1
- Animesh Mukherjee 1
- Nelson Odhiambo 1
- Nelson Odhiambo Onyango 1
- Nedjma Ousidhoum 1
- Andrew Piper 1
- Ali Ebrahimi Pourasad 1
- Vitaly Protasov 1
- Kritesh Rauniyar 1
- Steffen Remus 1
- Ada Ren 1
- Juan Ren 1
- Naquee Rizwan 1
- Oleg Rogov 1
- Sarthak Roy 1
- Terry Ruas 1
- Samuel Rutunda 1
- Punyajoy Saha 1
- Florian Schneider 1
- Manish Shrivastava 1
- Dr. Florian Skupin 1
- Gregor Stange 1
- Nirmal Surange 1
- Daniela Teodorescu 1
- Rodrigo Tufiño 1
- Aura Cristina Udrea 1
- Rudy Alexandro Garrido Veliz 1
- Jan Philip Wahle 1
- Viktoria Wrobel 1
- Sophie Wu 1
- Florian Valentin Wunderlich 1
- MD Arfeen Zeeshan 1
- Hanif Muhammad Zhafran 1
- Tianhui Zhang 1
- Yi Zhou 1
- Christine de Kock 1