Is a Peeled Apple Still Red? Evaluating LLMs’ Ability for Conceptual Combination with Property Type

Seokwon Song, Taehyun Lee, Jaewoo Ahn, Jae Hyuk Sung, Gunhee Kim


Abstract
Conceptual combination is a cognitive process that merges basic concepts, enabling the creation of complex expressions. During this process, the properties of combination (e.g., the whiteness of a peeled apple) can be inherited from basic concepts, newly emerge, or be canceled. However, previous studies have evaluated a limited set of properties and have not examined the generative process.To address this gap, we introduce the Conceptual Combination with Property Type dataset (CCPT), which consists of 12.3K annotated triplets of noun phrases, properties, and property types. Using CCPT, we establish three types of tasks to evaluate LLMs for conceptual combination thoroughly.Our key findings are threefold:(1) Our automatic metric grading property emergence and cancellation closely corresponds with human judgments.(2) LLMs, including OpenAI’s o1, struggle to generate noun phrases which possess given emergent properties.(3) Our proposed method, inspired by cognitive psychology model that explains how relationships between concepts are formed, improves performances in all generative tasks.The dataset and experimental code are available at https://github.com/seokwon99/CCPT.git.
Anthology ID:
2025.naacl-long.456
Volume:
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
Month:
April
Year:
2025
Address:
Albuquerque, New Mexico
Editors:
Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
9028–9048
Language:
URL:
https://preview.aclanthology.org/fix-sig-urls/2025.naacl-long.456/
DOI:
Bibkey:
Cite (ACL):
Seokwon Song, Taehyun Lee, Jaewoo Ahn, Jae Hyuk Sung, and Gunhee Kim. 2025. Is a Peeled Apple Still Red? Evaluating LLMs’ Ability for Conceptual Combination with Property Type. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 9028–9048, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):
Is a Peeled Apple Still Red? Evaluating LLMs’ Ability for Conceptual Combination with Property Type (Song et al., NAACL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-sig-urls/2025.naacl-long.456.pdf