Probing Internal Representations of Multi-Word Verbs in Large Language Models

Hassane Kissane, Achim Schilling, Patrick Krauss


Abstract
This study investigates the internal representations of verb-particle combinations, called multi-word verbs, within transformer-based large language models (LLMs), specifically examining how these models capture lexical and syntactic properties at different neural network layers. Using the BERT architecture, we analyze the representations of its layers for two different verb-particle constructions: phrasal verbs like “give up” and prepositional verbs like “look at”. Our methodology includes training probing classifiers on the model output to classify these categories at both word and sentence levels. The results indicate that the model’s middle layers achieve the highest classification accuracies. To further analyze the nature of these distinctions, we conduct a data separability test using the Generalized Discrimination Value (GDV). While GDV results show weak linear separability between the two verb types, probing classifiers still achieve high accuracy, suggesting that representations of these linguistic categories may be “non-linearly separable”. This aligns with previous research indicating that linguistic distinctions in neural networks are not always encoded in a linearly separable manner. These findings computationally support usage-based claims on the representation of verb-particle constructions and highlight the complex interaction between neural network architectures and linguistic structures.
Anthology ID:
2025.mwe-1.2
Volume:
Proceedings of the 21st Workshop on Multiword Expressions (MWE 2025)
Month:
May
Year:
2025
Address:
Albuquerque, New Mexico, U.S.A.
Editors:
Atul Kr. Ojha, Voula Giouli, Verginica Barbu Mititelu, Mathieu Constant, Gražina Korvel, A. Seza Doğruöz, Alexandre Rademaker
Venues:
MWE | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
7–13
Language:
URL:
https://preview.aclanthology.org/fix-sig-urls/2025.mwe-1.2/
DOI:
Bibkey:
Cite (ACL):
Hassane Kissane, Achim Schilling, and Patrick Krauss. 2025. Probing Internal Representations of Multi-Word Verbs in Large Language Models. In Proceedings of the 21st Workshop on Multiword Expressions (MWE 2025), pages 7–13, Albuquerque, New Mexico, U.S.A.. Association for Computational Linguistics.
Cite (Informal):
Probing Internal Representations of Multi-Word Verbs in Large Language Models (Kissane et al., MWE 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-sig-urls/2025.mwe-1.2.pdf