Donnacha Daly
2026
Reference-Free Evaluation of Taxonomies
Pascal Wullschleger | Majid Zarharan | Donnacha Daly | Marc Pouly | Jennifer Foster
Findings of the Association for Computational Linguistics: ACL 2026
Pascal Wullschleger | Majid Zarharan | Donnacha Daly | Marc Pouly | Jennifer Foster
Findings of the Association for Computational Linguistics: ACL 2026
We introduce two reference-free metrics for quality evaluation of taxonomies in the absence of labels. The first metric evaluates robustness by calculating the correlation between semantic and taxonomic similarity, addressing error types not considered by existing metrics. The second uses Natural Language Inference to assess logical adequacy. Both metrics are tested on five taxonomies and are shown to correlate well with F1 against ground truth taxonomies. We further demonstrate that our metrics can predict downstream performance in hierarchical classification when used with label hierarchies.
2025
FoodTaxo: Generating Food Taxonomies with Large Language Models
Pascal Wullschleger | Majid Zarharan | Donnacha Daly | Marc Pouly | Jennifer Foster
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track)
Pascal Wullschleger | Majid Zarharan | Donnacha Daly | Marc Pouly | Jennifer Foster
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track)
We investigate the utility of Large Language Models for automated taxonomy generation and completion specifically applied to taxonomies from the food technology industry. We explore the extent to which taxonomies can be completed from a seed taxonomy or generated without a seed from a set of known concepts, in an iterative fashion using recent prompting techniques.Experiments on five taxonomies using an open-source LLM (Llama-3), while promising, point to the difficulty of correctly placing inner nodes.