Evaluating Scene-based In-Situ Item Labeling for Immersive Conversational Recommendation

Jiazhou Liang; Yifan Simon Liu; David Guo; Yilun Jiang; Minqi Sun; Scott Sanner

Evaluating Scene-based In-Situ Item Labeling for Immersive Conversational Recommendation

Jiazhou Liang, Yifan Simon Liu, David Guo, Yilun Jiang, Minqi Sun, Scott Sanner

Abstract

The growing ubiquity of Extended Reality (XR) is driving Conversational Recommendation Systems (CRS) toward visually immersive experiences. We formalize this paradigm as Immersive CRS (ICRS), where recommended items are highlighted directly in the user’s scene-based visual environment and augmented with in-situ labels. While item recommendation has been widely studied, the problem of how to select and evaluate which information to present as immersive labels remains an open problem. To this end, we introduce a principled categorization of information needs into explicit intent satisfaction and proactive information needs and use these to define novel evaluation metrics for item label selection. We benchmark IR-, LLM-, and VLM-based methods across three datasets and ICRS scenarios: fashion, movie recommendation, and retail shopping. Our evaluation reveals three important limitations of existing methods: (1) they fail to leverage scenario-specific information modalities (e.g., visual cues for fashion, metadata for retail), (2) they present redundant information that is visually inferable, and (3) they poorly anticipate users’ proactive information needs from explicit dialogue alone. In summary, this work provides both a novel evaluation paradigm for in-situ item labeling in ICRS and highlights key challenges for future work.

Anthology ID:: 2026.findings-acl.2033
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 40932–40953
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.2033/
DOI:
Bibkey:
Cite (ACL):: Jiazhou Liang, Yifan Simon Liu, David Guo, Yilun Jiang, Minqi Sun, and Scott Sanner. 2026. Evaluating Scene-based In-Situ Item Labeling for Immersive Conversational Recommendation. In Findings of the Association for Computational Linguistics: ACL 2026, pages 40932–40953, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Evaluating Scene-based In-Situ Item Labeling for Immersive Conversational Recommendation (Liang et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.2033.pdf
Checklist:: 2026.findings-acl.2033.checklist.pdf

PDF Cite Search Checklist Fix data