Conference on Natural Language Processing (2024)


up

bib (full) Proceedings of the 20th Conference on Natural Language Processing (KONVENS 2024)

pdf bib
Proceedings of the 20th Conference on Natural Language Processing (KONVENS 2024)
Pedro Henrique Luz de Araujo | Andreas Baumann | Dagmar Gromann | Brigitte Krenn | Benjamin Roth | Michael Wiegand

pdf bib
Large Language Models as Evaluators for Scientific Synthesis
Julia Evans | Jennifer D’Souza | Sören Auer

pdf bib
A Crosslingual Approach to Dependency Parsing for Middle High German
Cora Haiber

pdf
Complexity of German Texts Written by Primary School Children
Jammila Laâguidi | Dana Neumann | Ronja Laarmann-Quante | Stefanie Dipper | Mihail Chifligarov

pdf
Exploring Automatic Text Simplification for Lithuanian
Justina Mandravickaitė | Egle Rimkiene | Danguolė Kalinauskaitė | Danguolė Kotryna Kapkan

pdf
Word alignment in Discourse Representation Structure parsing
Christian Obereder | Gabor Recski

pdf
Evaluating and Fine-Tuning Retrieval-Augmented Language Models to Generate Text with Accurate Citations
Vinzent Penzkofer | Timo Baumann

pdf
Discourse Parsing for German with new RST Corpora
Sara Shahmohammadi | Manfred Stede

pdf
Revisiting the Phenomenon of Syntactic Complexity Convergence on German Dialogue Data
Yu Wang | Hendrik Buschmeier

pdf
Estimating Word Concreteness from Contextualized Embeddings
Christian Wartena

pdf
Using GermaNet for the Generation of Crossword Puzzles
Claus Zinn | Marie Hinrichs | Erhard Hinrichs

pdf
Leveraging Cross-Lingual Transfer Learning in Spoken Named Entity Recognition Systems
Moncef Benaicha | David Thulke | Mehmet Ali Tuğtekin Turan

pdf
Exploring Data Acquisition Strategies for the Domain Adaptation of QA Models
Maurice Falk | Adrian Ulges | Dirk Krechel

pdf
CO-Fun: A German Dataset on Company Outsourcing in Fund Prospectuses for Named Entity Recognition and Relation Extraction
Neda Foroutan | Markus Schröder | Andreas Dengel

pdf
GERestaurant: A German Dataset of Annotated Restaurant Reviews for Aspect-Based Sentiment Analysis
Nils Constantin Hellwig | Jakob Fehle | Markus Bink | Christian Wolff

pdf
How to Translate SQuAD to German? A Comparative Study of Answer Span Retrieval Methods for Question Answering Dataset Creation
Jens Kaiser | Agnieszka Falenska

pdf
Few-Shot Prompting for Subject Indexing of German Medical Book Titles
Lisa Kluge | Maximilian Kähler

pdf
Binary indexes for optimising corpus queries
Peter Ljunglöf | Nicholas Smallbone | Mijo Thoresson | Victor Salomonsson

pdf
An Improved Method for Class-specific Keyword Extraction: A Case Study in the German Business Registry
Stephen Meisenbacher | Tim Schopf | Weixin Yan | Patrick Holl | Florian Matthes

pdf
Tabular JSON: A Proposal for a Pragmatic Linguistic Data Format
Adam Roussel

pdf
Semiautomatic Data Generation for Academic Named Entity Recognition in German Text Corpora
Pia Schwarz

pdf
Redundancy Aware Multiple Reference Based Gainwise Evaluation of Extractive Summarization
Mousumi Akter | Santu Karmaker

pdf
Fine-grained quotation detection and attribution in German news articles
Fynn Petersen-Frey | Chris Biemann

pdf
Decoding 16th-Century Letters: From Topic Models to GPT-Based Keyword Mapping
Phillip Benjamin Ströbel | Stefan Aderhold | Ramona Roller

pdf
Analysing Effects of Inducing Gender Bias in Language Models
Stephanie Gross | Brigitte Krenn | Craig Lincoln | Lena Holzwarth

pdf
OMoS-QA: A Dataset for Cross-Lingual Extractive Question Answering in a German Migration Context
Steffen Kleinle | Jakob Prange | Annemarie Friedrich

pdf
Role-Playing LLMs in Professional Communication Training: The Case of Investigative Interviews with Children
Don Tuggener | Teresa Schneider | Ariana Huwiler | Tobias Kreienbühl | Simon Hischier | Pius von Däniken | Susanna Niehaus

pdf
Features and Detectability of German Texts Generated with Large Language Models
Verena Irrgang | Veronika Solopova | Steffen Zeiler | Robert M. Nickel | Dorothea Kolossa

pdf
Lex2Sent: A bagging approach to unsupervised sentiment analysis
Kai-Robin Lange | Jonas Rieger | Carsten Jentsch

pdf
Discourse-Level Features in Spoken and Written Communication
Hannah J. Seemann | Sara Shahmohammadi | Manfred Stede | Tatjana Scheffler

pdf
Version Control for Speech Corpora
Vlad Dumitru | Matthias Boehm | Martin Hagmüller | Barbara Schuppler

pdf
Querying Repetitions in Spoken Language Corpora
Elena Frick | Henrike Helmer | Dolores Lemmenmeier-Batinić

pdf
Exploring Phonetic Features in Language Embeddings for Unseen Language Varieties of Austrian German
Lorenz Gutscher | Michael Pucher

pdf
A Multilingual Dataset of Adversarial Attacks to Automatic Content Scoring Systems
Ronja Laarmann-Quante | Christopher Chandler | Noemi Incirkus | Vitaliia Ruban | Alona Solopov | Luca Steen

pdf
Towards Improving ASR Outputs of Spontaneous Speech with LLMs
Karner Manuel | Julian Linke | Mark Kröll | Barbara Schuppler | Bernhard C. Geiger

pdf
OneLove beyond the field - A few-shot pipeline for topic and sentiment analysis during the FIFA World Cup in Qatar
Christoph Rauchegger | Sonja Mei Wang | Pieter Delobelle

pdf
Linguistic and extralinguistic factors in automatic speech recognition of German atypical speech
Eugenia Rykova | Mathias Walther

pdf
LLM-based Translation Across 500 Years. The Case for Early New High German
Martin Volk | Dominic P. Fischer | Patricia Scheurer | Raphael Schwitter | Phillip B. Ströbel


up

pdf (full)
bib (full)
Proceedings of the 4th Workshop on Computational Linguistics for the Political and Social Sciences: Long and short papers

pdf bib
Proceedings of the 4th Workshop on Computational Linguistics for the Political and Social Sciences: Long and short papers
Christopher Klamm | Gabriella Lapesa | Simone Paolo Ponzetto | Ines Rehbein | Indira Sen

pdf bib
Detecting Calls to Action in Multimodal Content: Analysis of the 2021 German Federal Election Campaign on Instagram
Michael Achmann-Denkler | Jakob Fehle | Mario Haim | Christian Wolff

This study investigates the automated classification of Calls to Action (CTAs) within the 2021 German Instagram election campaign to advance the understanding of mobilization in social media contexts. We analyzed over 2,208 Instagram stories and 712 posts using fine-tuned BERT models and OpenAI’s GPT-4 models. The fine-tuned BERT model incorporating synthetic training data achieved a macro F1 score of 0.93, demonstrating a robust classification performance. Our analysis revealed that 49.58% of Instagram posts and 10.64% of stories contained CTAs, highlighting significant differences in mobilization strategies between these content types. Additionally, we found that FDP and the Greens had the highest prevalence of CTAs in posts, whereas CDU and CSU led in story CTAs.

pdf bib
Multilingual Bot Accusations: How Different Linguistic Contexts Shape Perceptions of Social Bots
Leon Fröhling | Xiaofei Li | Dennis Assenmacher

Recent research indicates that the online use of the term ”bot” has evolved over time. In the past, people used the term to accuse others of displaying automated behavior. However, it has gradually transformed into a linguistic tool to dehumanize the conversation partner, particularly on polarizing topics. Although this trend has been observed in English-speaking contexts, it is still unclear whether it holds true in other socio-linguistic environments. In this work we extend existing work on bot accusations and explore the phenomenon in a multilingual setting. We identify three distinct accusation patterns that characterize the different languages.

pdf
Operationalising the Hermeneutic Grouping Process in Corpus-assisted Discourse Studies
Philipp Heinrich | Stephanie Evert

We propose a framework for quantitative-qualitative research in corpus-assisted discourse studies (CADS), which operationalises the central process of manually forming groups of related words and phrases in terms of “discoursemes” and their constellations. We introduce an open-source implementation of this framework in the form of a REST API based on Corpus Workbench. Going through the workflow of a collocation analysis for fleeing and related terms in the German Federal Parliament, the paper gives details about the underlying algorithms, with available parameters and further possible choices. We also address multi-word units (which are often disregarded by CADS tools), a semantic map visualisation of collocations, and how to compute assocations between discoursemes.

pdf
A Few Hypocrites: Few-Shot Learning and Subtype Definitions for Detecting Hypocrisy Accusations in Online Climate Change Debates
Paulina Garcia Corral | Avishai Green | Hendrik Meyer | Anke Stoll | Xiaoyue Yan | Myrthe Reuver

The climate crisis is a salient issue in online discussions, and hypocrisy accusations are a central rhetorical element in these debates. However, for large-scale text analysis, hypocrisy accusation detection is an understudied tool, most often defined as a smaller subtask of fallacious argument detection. In this paper, we define hypocrisy accusation detection as an independent task in NLP, and identify different relevant subtypes of hypocrisy accusations. Our Climate Hypocrisy Accusation Corpus (CHAC) consists of 420 Reddit climate debate comments, expert-annotated into two different types of hypocrisy accusations: personal versus political hypocrisy. We evaluate few-shot in-context learning with 6 shots and 3 instruction-tuned Large Language Models (LLMs) for detecting hypocrisy accusations in this dataset. Results indicate that the GPT-4o and Llama-3 models in particular show promise in detecting hypocrisy accusations (F1 reaching 0.68, while previous work shows F1 of 0.44). However, context matters for a complex semantic concept such as hypocrisy accusations, and we find models struggle especially at identifying political hypocrisy accusations compared to personal moral hypocrisy. Our study contributes new insights in hypocrisy detection and climate change discourse, and is a stepping stone for large-scale analysis of hypocrisy accusation in online climate debates.

pdf
Language Complexity in Populist Rhetoric
Sergio E. Zanotto | Diego Frassinelli | Miriam Butt

Research suggests that politicians labeled as populists tend to use simpler language than their mainstream opponents. Yet, the metrics traditionally employed to assess the complexity of their language do not show consistent and generalizable results across different datasets and languages. This inconsistencies raise questions about the claimed simplicity of populist discourse, suggesting that the issue may be more nuanced than it initially seemed. To address this topic, we analyze the linguistic profile of IMPAQTS, a dataset of transcribed Italian political speeches, to identify linguistic features differentiating populist and non-populist parties. Our methodology ensures comparability of political texts and combines various statistical analyses to reliably identify key linguistic characteristics to test our case study. Results show that the “simplistic” language features previously described in the literature are not robust predictors of populism. This suggests that the characteristics defining populist statements are highly dependent on the specific dataset and the language being analysed, thus limiting the conclusions drawn in previous research. In our study, various linguistic features statistically differentiate between populist and mainstream parties, indicating that populists tend to employ specific well-known rhetorical strategies more frequently; however, none of them strongly indicate that populist parties use simpler language.

pdf
ChatGPT as Your n-th Annotator: Experiments in Leveraging Large Language Models for Social Science Text Annotation in Slovak Language
Endre Hamerlik | Marek Šuppa | Miroslav Blšták | Jozef Kubík | Martin Takáč | Marián Šimko | Andrej Findor

Large Language Models (LLMs) are increasingly influential in Computational Social Science, offering new methods for processing and analyzing data, particularly in lower-resource language contexts. This study explores the use of OpenAI’s GPT-3.5 Turbo and GPT-4 for automating annotations for a unique news media dataset in a lower resourced language, focusing on stance classification tasks. Our results reveal that prompting in the native language, explanation generation, and advanced prompting strategies like Retrieval Augmented Generation and Chain of Thought prompting enhance LLM performance, particularly noting GPT-4’s superiority in predicting stance. Further evaluation indicates that LLMs can serve as a useful tool for social science text annotation in lower resourced languages, notably in identifying inconsistencies in annotation guidelines and annotated datasets.

pdf
Detecting emotional polarity in Finnish parliamentary proceedings
Suvi Lehtosalo | John Nerbonne

Few studies have focused on detecting emotion in parliamentary corpora, and none have done this for the Finnish parliament. In this paper, this gap is addressed by applying the polarity lexicon–based methodology of a study by Rheault et al. (2016) on speeches in the British Parliament to a Finnish corpus. The findings show an increase in positive sentiment over time. Additionally, the findings indicate that politicians’ emotional states may be impacted by the state of the economy and other major events, such as the Covid-19 pandemic and the Russian invasion of Ukraine.

pdf
Topic-specific social science theory in stance detection: a proposal and interdisciplinary pilot study on sustainability initiatives
Myrthe Reuver | Alessandra Polimeno | Antske Fokkens | Ana Isabel Lopes

Topic-specificity is often seen as a limitation of stance detection models and datasets, especially for analyzing political and societal debates. However, stances contain topic-specific aspects that are crucial for an in-depth understanding of these debates. Our interdisciplinary approach identifies social science theories on specific debate topics as an opportunity for further defining stance detection research and analyzing online debate. This paper explores sustainability as debate topic, and connects stance to the sustainability-related Value-Belief-Norm (VBN) theory. VBN theory states that arguments in favor or against sustainability initiatives contain the dimensions of feeling power to change the issue with the initiative, and thinking whether or not the initiative tackles an urgent threat to the environment. In a pilot study with our Reddit European Sustainability Initiatives corpus, we develop an annotation procedure for these complex concepts. We then compare crowd-workers with Natural Language Processing experts’ annotation proficiency. Both crowd-workers and NLP experts find the tasks difficult, but experts reach more agreement on some difficult examples. This pilot study shows that complex theories about debate topics are feasible and worthwhile as annotation tasks for stance detection.

pdf
The Echoes of the ‘I’: Tracing Identity with Demographically Enhanced Word Embeddings
Ivan Smirnov

Identity is one of the most commonly studied constructs in social science. However, despite extensive theoretical work on identity, there remains a need for additional empirical data to validate and refine existing theories. This paper introduces a novel approach to studying identity by enhancing word embeddings with socio-demographic information. As a proof of concept, we demonstrate that our approach successfully reproduces and extends established findings regarding gendered self-views. Our methodology can be applied in a wide variety of settings, allowing researchers to tap into a vast pool of naturally occurring data, such as social media posts. Unlike similar methods already introduced in computer science, our approach allows for the study of differences between social groups. This could be particularly appealing to social scientists and may encourage the faster adoption of computational methods in the field.

pdf
TPPMI - a Temporal Positive Pointwise Mutual Information Embedding of Words
Paul Schmitt | Zsófia Rakovics | Márton Rakovics | Gábor Recski

We present Temporal Positive Pointwise Mutual Information (TPPMI) embeddings as a robust and data-efficient alternative for modeling temporal semantic change. Based on the assumption that the semantics of the most frequent words in a corpus are relatively stable over time, our model represents words as vectors of their PPMI similarities with a predefined set of such context words. We evaluate our method on the temporal word analogy benchmark of Yao et al. (2018) and compare it to the TWEC model (Di Carlo et al., 2019), demonstrating the competitiveness of the approach. While the performance of TPPMI stays below that of the state-of-the-art TWEC model, it offers a higher degree of interpretability and is applicable in scenarios where only a limited amount of data is available.

pdf
Augmented Political Leaning Detection: Leveraging Parliamentary Speeches for Classifying News Articles
Charlott Jakob | Pia Wenzel | Salar Mohtaj | Vera Schmitt

In an era where political discourse infiltrates online platforms and news media, identifying opinion is increasingly critical, especially in news articles, where objectivity is expected. Readers frequently encounter authors’ inherent political viewpoints, challenging them to discern facts from opinions. Classifying text on a spectrum from left to right is a key task for uncovering these viewpoints. Previous approaches rely on outdated datasets to classify current articles, neglecting that political opinions on certain subjects change over time. This paper explores a novel methodology for detecting political leaning in news articles by augmenting them with political speeches specific to the topic and publication time. We evaluated the impact of the augmentation using BERT and Mistral models. The results show that the BERT model’s F1 score improved from a baseline of 0.82 to 0.85, while the Mistral model’s F1 score increased from 0.30 to 0.31.