Kai Krüger
2025
Improving Online Job Advertisement Analysis via Compositional Entity Extraction
Kai Krüger
|
Johanna Binnewitt
|
Kathrin Ehmann
|
Stefan Winnige
|
Alan Akbik
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
We propose a compositional entity modeling framework for requirement extraction from online job advertisements (OJAs), representing complex, tree-like structures that connect atomic entities via typed relations. Based on this schema, we introduce GOJA, a manually annotated dataset of 500 German job ads that captures roles, tools, experience levels, attitudes, and their functional context. We report strong inter-annotator agreement and benchmark transformer models, demonstrating the feasibility of learning this structure. A focused case study on AI-related requirements illustrates the analytical value of our approach for labor market research.