Yibin Ni


2025

pdf bib
Creating a Lens of Chinese Culture: A Multimodal Dataset for Chinese Pun Rebus Art Understanding
Tuo Zhang | Tiantian Feng | Yibin Ni | Mengqin Cao | Ruying Liu | Kiana Avestimehr | Katharine Butler | Yanjun Weng | Mi Zhang | Shrikanth Narayanan | Salman Avestimehr
Findings of the Association for Computational Linguistics: ACL 2025

Large vision-language models (VLMs) have demonstrated remarkable abilities in understanding everyday content. However, their performance in the domain of art, particularly culturally rich art forms, remains less explored. As a pearl of human wisdom and creativity, art encapsulates complex cultural narratives and symbolism. In this paper, we offer the Pun Rebus Art Dataset, a multimodal dataset for art understanding deeply rooted in traditional Chinese culture. We focus on three primary tasks: identifying salient visual elements, matching elements with their symbolic meanings, and explanations for the conveyed messages. Our evaluation reveals that state-of-the-art VLMs struggle with these tasks, often providing biased and hallucinated explanations and showing limited improvement through in-context learning. By releasing the Pun Rebus Art Dataset, we aim to facilitate the development of VLMs that can better understand and interpret culturally specific content, promoting greater inclusiveness beyond English-based corpora. The dataset and evaluation code are available at [this link](https://github.com/zhang-tuo-pdf/Pun-Rebus-Art-Benchmark).

1998

pdf bib
Co-Interpretation Network in English Discourse
Yibin Ni
Proceedings of the 12th Pacific Asia Conference on Language, Information and Computation