Evaluating Scene-based In-Situ Item Labeling for Immersive Conversational Recommendation
Jiazhou Liang, Yifan Simon Liu, David Guo, Yilun Jiang, Minqi Sun, Scott Sanner
Abstract
The growing ubiquity of Extended Reality (XR) is driving Conversational Recommendation Systems (CRS) toward visually immersive experiences. We formalize this paradigm as Immersive CRS (ICRS), where recommended items are highlighted directly in the user’s scene-based visual environment and augmented with in-situ labels. While item recommendation has been widely studied, the problem of how to select and evaluate which information to present as immersive labels remains an open problem. To this end, we introduce a principled categorization of information needs into explicit intent satisfaction and proactive information needs and use these to define novel evaluation metrics for item label selection. We benchmark IR-, LLM-, and VLM-based methods across three datasets and ICRS scenarios: fashion, movie recommendation, and retail shopping. Our evaluation reveals three important limitations of existing methods: (1) they fail to leverage scenario-specific information modalities (e.g., visual cues for fashion, metadata for retail), (2) they present redundant information that is visually inferable, and (3) they poorly anticipate users’ proactive information needs from explicit dialogue alone. In summary, this work provides both a novel evaluation paradigm for in-situ item labeling in ICRS and highlights key challenges for future work.- Anthology ID:
- 2026.findings-acl.2033
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2026
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 40932–40953
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.2033/
- DOI:
- Cite (ACL):
- Jiazhou Liang, Yifan Simon Liu, David Guo, Yilun Jiang, Minqi Sun, and Scott Sanner. 2026. Evaluating Scene-based In-Situ Item Labeling for Immersive Conversational Recommendation. In Findings of the Association for Computational Linguistics: ACL 2026, pages 40932–40953, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- Evaluating Scene-based In-Situ Item Labeling for Immersive Conversational Recommendation (Liang et al., Findings 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.2033.pdf