Why do objects have many names? A study on word informativeness in language use and lexical systems

Eleonora Gualdoni, Gemma Boleda


Abstract
Human lexicons contain many different words that speakers can use to refer to the same object, e.g., *purple* or *magenta* for the same shade of color. On the one hand, studies on language use have explored how speakers adapt their referring expressions to successfully communicate in context, without focusing on properties of the lexical system. On the other hand, studies in language evolution have discussed how competing pressures for informativeness and simplicity shape lexical systems, without tackling in-context communication. We aim at bridging the gap between these traditions, and explore why a soft mapping between referents and words is a good solution for communication, by taking into account both in-context communication and the structure of the lexicon. We propose a simple measure of informativeness for words and lexical systems, grounded in a visual space, and analyze color naming data for English and Mandarin Chinese. We conclude that optimal lexical systems are those where multiple words can apply to the same referent, conveying different amounts of information. Such systems allow speakers to maximize communication accuracy and minimize the amount of information they convey when communicating about referents in contexts.
Anthology ID:
2024.emnlp-main.1009
Volume:
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
18150–18163
Language:
URL:
https://preview.aclanthology.org/fix-sig-urls/2024.emnlp-main.1009/
DOI:
10.18653/v1/2024.emnlp-main.1009
Bibkey:
Cite (ACL):
Eleonora Gualdoni and Gemma Boleda. 2024. Why do objects have many names? A study on word informativeness in language use and lexical systems. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 18150–18163, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
Why do objects have many names? A study on word informativeness in language use and lexical systems (Gualdoni & Boleda, EMNLP 2024)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-sig-urls/2024.emnlp-main.1009.pdf