Advancing E-commerce Merchants Telemarketing with Synthetic Data-Driven LLMs

Qi Gou, Zehua Xia, Li Juan, Qingyang Zhao, Wenjing Yang


Abstract
Telemarketing towards merchants is considerably more complex than traditional dialogue systems. Given a user utterance, the response must not only follow the context but also strategically and naturally guide the conversation toward marketing objectives. A common approach is to fine-tune LLMs using high-quality dialogue data from top sales. However, we find that even after careful data cleaning, these data cannot be used directly due to two issues:(1) Poor strategy-following: Real-world conversations are highly random with much chit-chat topics, easily leading deviation from intended strategy.(2) Insufficient expert knowledge learning: Expert knowledge appears infrequently or not at all in limited collected corpus.To this end, we introduce a hybrid data synthesis framework with two main innovations. First, we unify the input schema with profile and strategy designed by top sales, and extract them via a Multi-task paradigm.In addition, we propose Role-playing Simulation and Session Prefix Completion to complementarily improve the strategy-following and inject long-tail expert knowledge into our fine-tuned model – TeleBot.Comprehensive online and offline evaluations demonstrate its effectiveness.In particular, in terms of the final marketing results – High Intention Rate, TeleBot reaches the performance level of the top 25% of human sales.
Anthology ID:
2025.emnlp-industry.150
Volume:
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track
Month:
November
Year:
2025
Address:
Suzhou (China)
Editors:
Saloni Potdar, Lina Rojas-Barahona, Sebastien Montella
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2146–2154
Language:
URL:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-industry.150/
DOI:
Bibkey:
Cite (ACL):
Qi Gou, Zehua Xia, Li Juan, Qingyang Zhao, and Wenjing Yang. 2025. Advancing E-commerce Merchants Telemarketing with Synthetic Data-Driven LLMs. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track, pages 2146–2154, Suzhou (China). Association for Computational Linguistics.
Cite (Informal):
Advancing E-commerce Merchants Telemarketing with Synthetic Data-Driven LLMs (Gou et al., EMNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-industry.150.pdf