Beyond Query Bias: Candidate-Aware Iterative Refinement for Zero-Shot Composed Image Retrieval
Nan Sun, Jing Tang, Lei Sun, Rui Chen, Yuxing Lu, Xiangxiang Chu, Hefei Ling, Yujun Cai
Abstract
Zero-Shot Composed Image Retrieval (ZS-CIR) retrieves target images using a reference image and modification text without task-specific training. Existing methods typically rely on MLLMs to generate query vectors with pre-trained models like CLIP. However, those constructed queries suffer from inherent cognitive bias due to unknown candidate distribution. We propose CoRR, a training-free framework that reframes ZS-CIR as a self-correcting process through bias-aware query refinement. CoRR uses retrieved results as feedback to perceive the candidate distribution. With carefully designed CoT prompting, the MLLM inspects the retrieved candidates to identify intent misalignments in the query and then corrects them via Historical Query Fusion. We also introduce Retrieval-Driven Caption Optimization to provide context-aligned examples, reducing phrasing and style mismatches. Experiments on public benchmarks show that CoRR significantly outperforms other SOTA methods.- Anthology ID:
- 2026.findings-acl.1120
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2026
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 22318–22329
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1120/
- DOI:
- Cite (ACL):
- Nan Sun, Jing Tang, Lei Sun, Rui Chen, Yuxing Lu, Xiangxiang Chu, Hefei Ling, and Yujun Cai. 2026. Beyond Query Bias: Candidate-Aware Iterative Refinement for Zero-Shot Composed Image Retrieval. In Findings of the Association for Computational Linguistics: ACL 2026, pages 22318–22329, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- Beyond Query Bias: Candidate-Aware Iterative Refinement for Zero-Shot Composed Image Retrieval (Sun et al., Findings 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1120.pdf