LLM-based Literal Example Generation for Japanese Multiword Expressions

Mio Ohashi, Hajime Kiyama, Zhidong Ling, Mamoru Komachi


Abstract
We investigate whether large language models (LLMs) can generate literal usage examples for Japanese multiword expressions (MWEs), whose literal readings are structurally low-frequency in available corpora.Prior work on MWEs has largely focused on detecting idiomatic usages in context, leaving literal usages underrepresented particularly for Japanese MWEs whose literal readings are rare and structurally diverse.Because literal readings are rarely attested in corpora, we design a lexicon-grounded setup that uses corpus non-literal usages as contrastive cues for controlled prompting. We evaluate the generated sentences using automatic literalness judgments and human literalness judgments, together with manual inspection.Our results show that providing contrastive non-literal information stabilizes literal generation and improves quality compared with prompts that include only literal information or no hints. In addition, we conduct an LLM-based understanding test that compares model predictions of literal and idiomatic plausibility with human judgments.The results indicate that the model aligns more closely with human judgments for idiomatic interpretations than for literal ones, highlighting the relative difficulty of modeling literal readings of MWEs.The study demonstrates that LLMs can complement existing resources by supplying frequency-independent literal examples and offers a controlled framework for examining contextual meaning understanding of Japanese MWEs.
Anthology ID:
2026.acl-srw.25
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Santosh T.Y.S.S., Juan Diego Rodriguez, Ona de Gibert
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
307–325
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-srw.25/
DOI:
Bibkey:
Cite (ACL):
Mio Ohashi, Hajime Kiyama, Zhidong Ling, and Mamoru Komachi. 2026. LLM-based Literal Example Generation for Japanese Multiword Expressions. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), pages 307–325, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
LLM-based Literal Example Generation for Japanese Multiword Expressions (Ohashi et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-srw.25.pdf