Improving Online Job Advertisement Analysis via Compositional Entity Extraction
Kai Krüger, Johanna Binnewitt, Kathrin Ehmann, Stefan Winnige, Alan Akbik
Abstract
We propose a compositional entity modeling framework for requirement extraction from online job advertisements (OJAs), representing complex, tree-like structures that connect atomic entities via typed relations. Based on this schema, we introduce GOJA, a manually annotated dataset of 500 German job ads that captures roles, tools, experience levels, attitudes, and their functional context. We report strong inter-annotator agreement and benchmark transformer models, demonstrating the feasibility of learning this structure. A focused case study on AI-related requirements illustrates the analytical value of our approach for labor market research.- Anthology ID:
- 2025.emnlp-main.1375
- Volume:
- Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
- Month:
- November
- Year:
- 2025
- Address:
- Suzhou, China
- Editors:
- Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
- Venue:
- EMNLP
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 27035–27053
- Language:
- URL:
- https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1375/
- DOI:
- Cite (ACL):
- Kai Krüger, Johanna Binnewitt, Kathrin Ehmann, Stefan Winnige, and Alan Akbik. 2025. Improving Online Job Advertisement Analysis via Compositional Entity Extraction. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 27035–27053, Suzhou, China. Association for Computational Linguistics.
- Cite (Informal):
- Improving Online Job Advertisement Analysis via Compositional Entity Extraction (Krüger et al., EMNLP 2025)
- PDF:
- https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1375.pdf