Abstract
Health-related speech datasets are often small and varied in focus. This makes it difficult to leverage them to effectively support healthcare goals. Robust transfer of linguistic features across different datasets orbiting the same goal carries potential to address this concern. To test this hypothesis, we experiment with domain adaptation (DA) techniques on heterogeneous spoken language data to evaluate generalizability across diverse datasets for a common task: dementia detection. We find that adapted models exhibit better performance across conversational and task-oriented datasets. The feature-augmented DA method achieves a 22% increase in accuracy adapting from a conversational to task-specific dataset compared to a jointly trained baseline. This suggests promising capacity of these techniques to allow for productive use of disparate data for a complex spoken language healthcare task.- Anthology ID:
- 2023.acl-long.668
- Volume:
- Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
- Month:
- July
- Year:
- 2023
- Address:
- Toronto, Canada
- Editors:
- Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
- Venue:
- ACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 11965–11978
- Language:
- URL:
- https://aclanthology.org/2023.acl-long.668
- DOI:
- 10.18653/v1/2023.acl-long.668
- Cite (ACL):
- Shahla Farzana and Natalie Parde. 2023. Towards Domain-Agnostic and Domain-Adaptive Dementia Detection from Spoken Language. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 11965–11978, Toronto, Canada. Association for Computational Linguistics.
- Cite (Informal):
- Towards Domain-Agnostic and Domain-Adaptive Dementia Detection from Spoken Language (Farzana & Parde, ACL 2023)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-2/2023.acl-long.668.pdf