Sentimatic: Sentiment-guided Automatic Generation of Preference Datasets for Customer Support Dialogue System

Suhyun Lee, ChangHeon Han


Abstract
Supervised Fine-tuning (SFT) and preference optimization (PO) are key methods for enhancing language models and aligning them with human preferences. However, scaling preference datasets for PO training is challenging, leading AI customer support systems to rely on SFT. To address this, we propose the Sentiment-guided Automatic Generation of Preference Datasets (Sentimatic) methodology to automatically generate customer preference datasets without human intervention using a publicly available dataset constructed for SFT. Our approach classifies responses by sentiment, fine-tunes models on them, and applies advanced sampling and evaluation techniques to ensure diversity and quality. Ultimately, we generated 1,174 customer preference datasets based on 357 test datasets, and through experiments, we confirmed that the AI customer support system trained on these datasets is capable of carefully considering customer emotions and generating professional and appropriate responses.
Anthology ID:
2025.naacl-srw.12
Volume:
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 4: Student Research Workshop)
Month:
April
Year:
2025
Address:
Albuquerque, USA
Editors:
Abteen Ebrahimi, Samar Haider, Emmy Liu, Sammar Haider, Maria Leonor Pacheco, Shira Wein
Venues:
NAACL | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
120–128
Language:
URL:
https://preview.aclanthology.org/moar-dois/2025.naacl-srw.12/
DOI:
10.18653/v1/2025.naacl-srw.12
Bibkey:
Cite (ACL):
Suhyun Lee and ChangHeon Han. 2025. Sentimatic: Sentiment-guided Automatic Generation of Preference Datasets for Customer Support Dialogue System. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 4: Student Research Workshop), pages 120–128, Albuquerque, USA. Association for Computational Linguistics.
Cite (Informal):
Sentimatic: Sentiment-guided Automatic Generation of Preference Datasets for Customer Support Dialogue System (Lee & Han, NAACL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/moar-dois/2025.naacl-srw.12.pdf