GerAV: Towards New Heights in German Authorship Verification using Fine-Tuned LLMs on a New Benchmark

Lotta Kiefer, Christoph Leiter, Sotaro Takeshita, Elena Schmidt, Steffen Eger


Abstract
Authorship verification (AV) is the task of determining whether two texts were written by the same author and has been studied extensively, predominantly for English data. In contrast, large-scale benchmarks and systematic evaluations for other languages remain scarce. We address this gap by introducing GerAV, a comprehensive benchmark for German AV comprising over 400k labeled text pairs. GerAV is built from Twitter and Reddit data, with the Reddit part further divided into in-domain and cross-domain message-based subsets, as well as a profile-based subset. This design enables controlled analysis of the effects of data source, topical domain, and text length. Using the provided training splits, we conduct a systematic evaluation of strong baselines and state-of-the-art models and find that our best approach, a fine-tuned large language model, outperforms recent baselines by up to 0.09 absolute F1 score and surpasses GPT-5 in a zero-shot setting by 0.08. We further observe a trade-off between specialization and generalization: models trained on specific data types perform best under matching conditions but generalize less well across data regimes, a limitation that can be mitigated by combining training sources. Overall, GerAV provides a challenging and versatile benchmark for advancing research on German and cross-domain AV.
Anthology ID:
2026.findings-acl.1991
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
40050–40069
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1991/
DOI:
Bibkey:
Cite (ACL):
Lotta Kiefer, Christoph Leiter, Sotaro Takeshita, Elena Schmidt, and Steffen Eger. 2026. GerAV: Towards New Heights in German Authorship Verification using Fine-Tuned LLMs on a New Benchmark. In Findings of the Association for Computational Linguistics: ACL 2026, pages 40050–40069, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
GerAV: Towards New Heights in German Authorship Verification using Fine-Tuned LLMs on a New Benchmark (Kiefer et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1991.pdf
Checklist:
 2026.findings-acl.1991.checklist.pdf