No Context Needed: Contextual Quandary In Idiomatic Reasoning With Pre-Trained Language Models

Kellen Cheng; Suma Bhat

doi:10.18653/v1/2024.naacl-long.272

No Context Needed: Contextual Quandary In Idiomatic Reasoning With Pre-Trained Language Models

Abstract

Reasoning in the presence of idiomatic expressions (IEs) remains a challenging frontier in natural language understanding (NLU). Unlike standard text, the non-compositional nature of an IE makes it difficult for model comprehension, as their figurative or non-literal mean- ing usually cannot be inferred from the constituent words alone. It stands to reason that in these challenging circumstances, pre-trained language models (PTLMs) should make use of the surrounding context to infer additional in- formation about the IE. In this paper, we investigate the utilization of said context for idiomatic reasoning tasks, which is under-explored relative to arithmetic or commonsense reason- ing (Liu et al., 2022; Yu et al., 2023). Preliminary findings point to a surprising observation: general purpose PTLMs are actually negatively affected by the context, as performance almost always increases with its removal. In these scenarios, models may see gains of up to 3.89%. As a result, we argue that only IE-aware models remain suitable for idiomatic reasoning tasks, given the unexpected and unexplainable manner in which general purpose PTLMs reason over IEs. Additionally, we conduct studies to examine how models utilize the context in various situations, as well as an in-depth analysis on dataset formation and quality. Finally, we provide some explanations and insights into the reasoning process itself based on our results.

Anthology ID:: 2024.naacl-long.272
Volume:: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
Month:: June
Year:: 2024
Address:: Mexico City, Mexico
Editors:: Kevin Duh, Helena Gomez, Steven Bethard
Venue:: NAACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 4863–4880
Language:
URL:: https://aclanthology.org/2024.naacl-long.272
DOI:: 10.18653/v1/2024.naacl-long.272
Bibkey:
Cite (ACL):: Kellen Cheng and Suma Bhat. 2024. No Context Needed: Contextual Quandary In Idiomatic Reasoning With Pre-Trained Language Models. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 4863–4880, Mexico City, Mexico. Association for Computational Linguistics.
Cite (Informal):: No Context Needed: Contextual Quandary In Idiomatic Reasoning With Pre-Trained Language Models (Cheng & Bhat, NAACL 2024)
Copy Citation:
PDF:: https://preview.aclanthology.org/nschneid-patch-4/2024.naacl-long.272.pdf

PDF Search