DeepResearch Retail: Benchmarking Tool-Augmented Deep Research in the E-Commerce Domain

Rafael Ferreira, Flavio Di Palo, Huilin Lu, Ayush Jain, Harsha Aduri


Abstract
Deep Research (DR) systems autonomously retrieve and synthesize information from web sources, however, industrial DR applications face a critical gap: effective integration of internal tools with web search. In this work, we introduce DeepResearch Retail, an evaluation framework grounded in real-world e-commerce data for assessing Deep Research with tools (DR+Tools) in realistic commercial settings. The framework evaluates both factual faithfulness and multidimensional response quality when reasoning over heterogeneous web and internal data sources.We further present Hybrid-ReAct, a multi-agent architecture that demonstrates how collaborative reasoning and tool use can produce evidence-grounded answers. Experimental results validate our framework’s utility, showing improvements in agent’s performance when leveraging web-page information and multi-agent specialization.
Anthology ID:
2026.acl-industry.26
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Month:
July
Year:
2026
Address:
San Diego, California, USA
Editors:
Yunyao Li, Georg Rehm, Mei Tu
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
386–409
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-industry.26/
DOI:
Bibkey:
Cite (ACL):
Rafael Ferreira, Flavio Di Palo, Huilin Lu, Ayush Jain, and Harsha Aduri. 2026. DeepResearch Retail: Benchmarking Tool-Augmented Deep Research in the E-Commerce Domain. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), pages 386–409, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):
DeepResearch Retail: Benchmarking Tool-Augmented Deep Research in the E-Commerce Domain (Ferreira et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-industry.26.pdf