Reinforcement Learning for Adversarial Query Generation to Enhance Relevance in Cold-Start Product Search

Akshay Jagatap, Neeraj Anand, Sonali Singh, Prakash Mandayam Comar


Abstract
Accurate mapping of queries to product categories is crucial for efficient retrieval and ranking of relevant products in e-commerce search. Conventionally, such query classification models rely on supervised learning using historical user interactions, but their effectiveness diminishes in cold-start scenarios, where new categories or products lack sufficient training data. This results in poor query-to-category mappings, negatively affecting retrieval and ranking. Synthetic query generation has emerged as a promising solution by augmenting training data; however, existing methods do not incorporate feedback from the query relevance model, limiting their ability to generate queries that enhance product retrieval. To address this, we propose an adversarial reinforcement learning framework that optimizes an LLM-based generator to expose weaknesses in query classification models. The generator produces synthetic queries to augment the classifier’s training set, ultimately improving its performance. Additionally, we introduce a structured reward signal to ensure stable training. Experiments on public datasets show an average PR-AUC improvement of +1.82% on benchmarks and +3.26% on a proprietary dataset, demonstrating the framework’s effectiveness in enhancing query classification and mitigating cold-start challenges.
Anthology ID:
2025.acl-industry.91
Volume:
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Georg Rehm, Yunyao Li
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1300–1307
Language:
URL:
https://preview.aclanthology.org/display_plenaries/2025.acl-industry.91/
DOI:
Bibkey:
Cite (ACL):
Akshay Jagatap, Neeraj Anand, Sonali Singh, and Prakash Mandayam Comar. 2025. Reinforcement Learning for Adversarial Query Generation to Enhance Relevance in Cold-Start Product Search. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track), pages 1300–1307, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Reinforcement Learning for Adversarial Query Generation to Enhance Relevance in Cold-Start Product Search (Jagatap et al., ACL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/display_plenaries/2025.acl-industry.91.pdf