Yuang Peng
2025
Modeling Uncertainty in Composed Image Retrieval via Probabilistic Embeddings
Haomiao Tang
|
Jinpeng Wang
|
Yuang Peng
|
GuangHao Meng
|
Ruisheng Luo
|
Bin Chen
|
Long Chen
|
Yaowei Wang
|
Shu-Tao Xia
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Composed Image Retrieval (CIR) enables users to search for images using multimodal queries that combine text and reference images. While metric learning methods have shown promise, they rely on deterministic point embeddings that fail to capture the inherent uncertainty in the input data, in which user intentions may be imprecisely specified or open to multiple interpretations. We address this challenge by reformulating CIR through our proposed Composed Probabilistic Embedding (CoPE) framework, which represents both queries and targets as Gaussian distributions in latent space rather than fixed points. Through careful design of probabilistic distance metrics and hierarchical learning objectives, CoPE explicitly captures uncertainty at both instance and feature levels, enabling more flexible, nuanced, and robust matching that can handle polysemy and ambiguity in search intentions. Extensive experiments across multiple benchmarks demonstrate that CoPE effectively quantifies both quality and semantic uncertainties within Composed Image Retrieval, achieving state-of-the-art performance on recall rate. Code: https://github.com/tanghme0w/ACL25-CoPE.
Search
Fix author
Co-authors
- Bin Chen 1
- Long Chen (陈龙) 1
- Ruisheng Luo 1
- GuangHao Meng 1
- Haomiao Tang 1
- show all...
Venues
- acl1