Extracting Multi-Word Expressions Representing Technical Terms and Proper Nouns in Log Messages

Kilian Dangendorf, Sven-Ove Hänsel, Jannik Rosendahl, Felix Heine, Carsten Kleiner, Christian Wartena


Abstract
IT-systems generate log messages containing important information about the system’s health. To gather information about system entities, we extract technical terms and proper nouns as multi-word expressions (MWEs) from a wide range of log messages from 16 different real systems. We apply Gries’ information-theoretic approach which iteratively calculates the best MWE candidates using an eight-dimensional ranking method. These candidates are evaluated in an annotation study, achieving a precision of 66 %. This value is significantly higher than evaluations on general-purpose texts, demonstrating the higher occurrence of compound technical terms and proper nouns in log messages. The MWEs found can be used to reduce the number of nodes in a system behavior graph while increasing the information density of the nodes.
Anthology ID:
2026.mwe-1.7
Volume:
Proceedings of the 22nd Workshop on Multiword Expressions (MWE 2026)
Month:
March
Year:
2026
Address:
Rabat, Marocco
Editors:
Atul Kr. Ojha, Verginica Barbu Mititelu, Mathieu Constant, Ivelina Stoyanova, A. Seza Doğruöz, Alexandre Rademaker
Venues:
MWE | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
61–65
Language:
URL:
https://preview.aclanthology.org/ingest-eacl/2026.mwe-1.7/
DOI:
Bibkey:
Cite (ACL):
Kilian Dangendorf, Sven-Ove Hänsel, Jannik Rosendahl, Felix Heine, Carsten Kleiner, and Christian Wartena. 2026. Extracting Multi-Word Expressions Representing Technical Terms and Proper Nouns in Log Messages. In Proceedings of the 22nd Workshop on Multiword Expressions (MWE 2026), pages 61–65, Rabat, Marocco. Association for Computational Linguistics.
Cite (Informal):
Extracting Multi-Word Expressions Representing Technical Terms and Proper Nouns in Log Messages (Dangendorf et al., MWE 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-eacl/2026.mwe-1.7.pdf