Building a Custom Taxonomy of AI Skills and Tasks from the Ground Up with Job Postings

Stephen Meisenbacher, Peter Norlander


Abstract
Utilizing LLMs for automated taxonomy construction presents a clear opportunity for the comprehensive, yet efficient mapping of potentially complex domains. When contending with high volumes of rapidly growing corpora, however, it becomes unclear how to best leverage such data for optimal taxonomy construction. Taking the case of systematizing *AI skills in the workplace*, we use two large-scale job postings corpora to investigate key design decisions for the inclusion (or exclusion) of data points for taxonomy construction. We propose **TaxonomyBuilder** as a blueprint for our systematic study, with which we evaluate various configurations of custom, data-informed, and hierarchical taxonomies. We demonstrate that *less* data can provide more clarity: filtering inputs to **TaxonomyBuilder** provides better domain-specific coverage than offering unfiltered inputs to clustering and LLM-enhanced hierarchical taxonomy labeling tools.
Anthology ID:
2026.customnlp4u-1.11
Volume:
Proceedings of the Second Workshop on Customizable NLP: Progress and Challenges in Customizing NLP for a Domain, Application, Group, or Individual (CustomNLP4U)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Sheshera Mysore, Sachin Kumar, Vidhisha Balachandran, Shirley Anugrah Hayati, Faeze Brahman, Hanane Nour Moussa, Alireza Salemi
Venues:
CustomNLP4U | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
117–130
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.customnlp4u-1.11/
DOI:
Bibkey:
Cite (ACL):
Stephen Meisenbacher and Peter Norlander. 2026. Building a Custom Taxonomy of AI Skills and Tasks from the Ground Up with Job Postings. In Proceedings of the Second Workshop on Customizable NLP: Progress and Challenges in Customizing NLP for a Domain, Application, Group, or Individual (CustomNLP4U), pages 117–130, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
Building a Custom Taxonomy of AI Skills and Tasks from the Ground Up with Job Postings (Meisenbacher & Norlander, CustomNLP4U 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.customnlp4u-1.11.pdf