CoAD: Automatic Diagnosis through Symptom and Disease Collaborative Generation

Huimin Wang, Wai Chung Kwan, Kam-Fai Wong, Yefeng Zheng


Abstract
Automatic diagnosis (AD), a critical application of AI in healthcare, employs machine learning techniques to assist doctors in gathering patient symptom information for precise disease diagnosis. The Transformer-based method utilizes an input symptom sequence, predicts itself through auto-regression, and employs the hidden state of the final symptom to determine the disease. Despite its simplicity and superior performance demonstrated, a decline in disease diagnosis accuracy is observed caused by 1) a mismatch between symptoms observed during training and generation, and 2) the effect of different symptom orders on disease prediction. To address the above obstacles, we introduce the CoAD, a novel disease and symptom collaborative generation framework, which incorporates several key innovations to improve AD: 1) aligning sentence-level disease labels with multiple possible symptom inquiry steps to bridge the gap between training and generation; 2) expanding symptom labels for each sub-sequence of symptoms to enhance annotation and eliminate the effect of symptom order; 3) developing a repeated symptom input schema to effectively and efficiently learn the expanded disease and symptom labels. We evaluate the CoAD framework using four datasets, including three public and one private, and demonstrate that it achieves an average 2.3% improvement over previous state-of-the-art results in automatic disease diagnosis. For reproducibility, we release the code and data at https://github.com/KwanWaiChung/coad.
Anthology ID:
2023.acl-long.350
Volume:
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2023
Address:
Toronto, Canada
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
6348–6361
Language:
URL:
https://aclanthology.org/2023.acl-long.350
DOI:
10.18653/v1/2023.acl-long.350
Bibkey:
Cite (ACL):
Huimin Wang, Wai Chung Kwan, Kam-Fai Wong, and Yefeng Zheng. 2023. CoAD: Automatic Diagnosis through Symptom and Disease Collaborative Generation. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 6348–6361, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
CoAD: Automatic Diagnosis through Symptom and Disease Collaborative Generation (Wang et al., ACL 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/remove-xml-comments/2023.acl-long.350.pdf