High-Quality Medical Dialogue Synthesis for Improving EMR Generation

Chengze Ge, Yu Xu, Qi Shao, Shengping Liu


Abstract
High-quality doctor–patient dialogues, by which we mean realistic and human-like interactions that are intent-consistent, clinically faithful, and free of contradictions, are crucial for accurate Electronic Medical Record (EMR) generation. However, collecting large-scale real dialogues is costly and constrained by privacy regulations, while existing synthetic methods often yield rigid and medically inconsistent dialogues. We propose a scalable framework integrating (1) Intent Graph Planning for diverse clinical flows, (2) Dual-Agent Simulation for realistic doctor-patient interactions, and (3) Rule-Reward Quality Control combining explicit medical rules with a self-supervised reward model. Experiments across multiple clinical domains demonstrate that our synthesized dialogues significantly enhance realism, diversity, and downstream EMR quality, substantially reducing physician editing efforts. Our framework provides a practical and privacy-compliant solution for deploying robust clinical NLP systems.
Anthology ID:
2025.emnlp-industry.181
Volume:
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track
Month:
November
Year:
2025
Address:
Suzhou (China)
Editors:
Saloni Potdar, Lina Rojas-Barahona, Sebastien Montella
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2675–2687
Language:
URL:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-industry.181/
DOI:
Bibkey:
Cite (ACL):
Chengze Ge, Yu Xu, Qi Shao, and Shengping Liu. 2025. High-Quality Medical Dialogue Synthesis for Improving EMR Generation. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track, pages 2675–2687, Suzhou (China). Association for Computational Linguistics.
Cite (Informal):
High-Quality Medical Dialogue Synthesis for Improving EMR Generation (Ge et al., EMNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-industry.181.pdf