Jim Hahn


2025

pdf bib
Jim at SemEval-2025 Task 5: Multilingual BERT Ensemble
Jim Hahn
Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)

The SemEval-2025 Task 5 calls for the utilization of LLM capabilities to apply controlled subject labels to record descriptions in the multilingual library collection of the German National Library of Science and Technology. The multilingual BERT ensemble system described herein produces subject labels for various record types, including articles, books, conference papers, reports, and theses. Results indicate that for English language article records, bidirectional encoder-only LLMs can achieve high recall in automated subject assignment.
Search
Co-authors
    Venues
    Fix author