Jakob Fehle


2025

pdf bib
Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad Prediction
Nils Hellwig | Jakob Fehle | Udo Kruschwitz | Christian Wolff
Proceedings of the 1st Joint Workshop on Large Language Models and Structure Modeling (XLLM 2025)

Aspect sentiment quad prediction (ASQP) facilitates a detailed understanding of opinions expressed in a text by identifying the opinion term, aspect term, aspect category and sentiment polarity for each opinion. However, annotating a full set of training examples to fine-tune models for ASQP is a resource-intensive process. In this study, we explore the capabilities of large language models (LLMs) for zero- and few-shot learning on the ASQP task across five diverse datasets. We report F1 scores almost up to par with those obtained with state-of-the-art fine-tuned models and exceeding previously reported zero- and few-shot performance. In the 20-shot setting on the Rest16 restaurant domain dataset, LLMs achieved an F1 score of 51.54, compared to 60.39 by the best-performing fine-tuned method MVP. Additionally, we report the performance of LLMs in target aspect sentiment detection (TASD), where the F1 scores were close to fine-tuned models, achieving 68.93 on Rest16 in the 30-shot setting, compared to 72.76 with MVP. While human annotators remain essential for achieving optimal performance, LLMs can reduce the need for extensive manual annotation in ASQP tasks.

2024

pdf bib
Detecting Calls to Action in Multimodal Content: Analysis of the 2021 German Federal Election Campaign on Instagram
Michael Achmann-Denkler | Jakob Fehle | Mario Haim | Christian Wolff
Proceedings of the 4th Workshop on Computational Linguistics for the Political and Social Sciences: Long and short papers

This study investigates the automated classification of Calls to Action (CTAs) within the 2021 German Instagram election campaign to advance the understanding of mobilization in social media contexts. We analyzed over 2,208 Instagram stories and 712 posts using fine-tuned BERT models and OpenAI’s GPT-4 models. The fine-tuned BERT model incorporating synthetic training data achieved a macro F1 score of 0.93, demonstrating a robust classification performance. Our analysis revealed that 49.58% of Instagram posts and 10.64% of stories contained CTAs, highlighting significant differences in mobilization strategies between these content types. Additionally, we found that FDP and the Greens had the highest prevalence of CTAs in posts, whereas CDU and CSU led in story CTAs.

pdf bib
GERestaurant: A German Dataset of Annotated Restaurant Reviews for Aspect-Based Sentiment Analysis
Nils Constantin Hellwig | Jakob Fehle | Markus Bink | Christian Wolff
Proceedings of the 20th Conference on Natural Language Processing (KONVENS 2024)

2023

pdf bib
Transformer-Based Analysis of Sentiment Towards German Political Parties on Twitter During the 2021 Election Year
Nils Constantin Hellwig | Markus Bink | Thomas Schmidt | Jakob Fehle | Christian Wolff
Proceedings of the 6th International Conference on Natural Language and Speech Processing (ICNLSP 2023)

pdf bib
Aspect-Based Sentiment Analysis as a Multi-Label Classification Task on the Domain of German Hotel Reviews
Jakob Fehle | Leonie Münster | Thomas Schmidt | Christian Wolff
Proceedings of the 19th Conference on Natural Language Processing (KONVENS 2023)

2022

pdf bib
Sentiment Analysis on Twitter for the Major German Parties during the 2021 German Federal Election
Thomas Schmidt | Jakob Fehle | Maximilian Weissenbacher | Jonathan Richter | Philipp Gottschalk | Christian Wolff
Proceedings of the 18th Conference on Natural Language Processing (KONVENS 2022)

2021

pdf bib
Lexicon-based Sentiment Analysis in German: Systematic Evaluation of Resources and Preprocessing Techniques
Jakob Fehle | Thomas Schmidt | Christian Wolff
Proceedings of the 17th Conference on Natural Language Processing (KONVENS 2021)