Jeremias Bohn
2026
Challenging Quadratic Attention - A Holistic View On the Rise of Alternative Language Model Architectures
Alexander M. Fichtl | Jeremias Bohn | Josefin Kelber | Edoardo Mosca | Georg Groh
Proceedings of The Big Picture v2: Crafting a Research Narrative
Alexander M. Fichtl | Jeremias Bohn | Josefin Kelber | Edoardo Mosca | Georg Groh
Proceedings of The Big Picture v2: Crafting a Research Narrative
Transformers have dominated sequence processing tasks for the past seven years—most notably language modeling. However, the inherent quadratic complexity of their attention mechanism remains a significant bottleneck as context length increases. We review and distill the recent efforts to overcome this bottleneck, including advances in (sub-quadratic) attention variants, recurrent neural networks, state space models, and hybrid architectures. We critically analyze approaches regarding compute and memory complexity, benchmark results, and fundamental limitations to assess whether the dominance of pure-attention transformers may soon be challenged, which we consider possible, particularly in domain-specific and edge-device applications.
2025
Adaptive Parameter Compression for Language Models
Jeremias Bohn | Frederic Mrozinski | Georg Groh
Findings of the Association for Computational Linguistics: NAACL 2025
Jeremias Bohn | Frederic Mrozinski | Georg Groh
Findings of the Association for Computational Linguistics: NAACL 2025
2021
Semi-Automated Labeling of Requirement Datasets for Relation Extraction
Jeremias Bohn | Jannik Fischbach | Martin Schmitt | Hinrich Schütze | Andreas Vogelsang
Proceedings of the 14th Workshop on Building and Using Comparable Corpora (BUCC 2021)
Jeremias Bohn | Jannik Fischbach | Martin Schmitt | Hinrich Schütze | Andreas Vogelsang
Proceedings of the 14th Workshop on Building and Using Comparable Corpora (BUCC 2021)
Creating datasets manually by human annotators is a laborious task that can lead to biased and inhomogeneous labels. We propose a flexible, semi-automatic framework for labeling data for relation extraction. Furthermore, we provide a dataset of preprocessed sentences from the requirements engineering domain, including a set of automatically created as well as hand-crafted labels. In our case study, we compare the human and automatic labels and show that there is a substantial overlap between both annotations.