Abstract
We present two models for combining word and character embeddings for cause-of-death classification of verbal autopsy reports using the text of the narratives. We find that for smaller datasets (500 to 1000 records), adding character information to the model improves classification, making character-based CNNs a promising method for automated verbal autopsy coding.- Anthology ID:
- W19-5025
- Volume:
- Proceedings of the 18th BioNLP Workshop and Shared Task
- Month:
- August
- Year:
- 2019
- Address:
- Florence, Italy
- Venue:
- BioNLP
- SIG:
- SIGBIOMED
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 234–239
- Language:
- URL:
- https://aclanthology.org/W19-5025
- DOI:
- 10.18653/v1/W19-5025
- Cite (ACL):
- Zhaodong Yan, Serena Jeblee, and Graeme Hirst. 2019. Can Character Embeddings Improve Cause-of-Death Classification for Verbal Autopsy Narratives?. In Proceedings of the 18th BioNLP Workshop and Shared Task, pages 234–239, Florence, Italy. Association for Computational Linguistics.
- Cite (Informal):
- Can Character Embeddings Improve Cause-of-Death Classification for Verbal Autopsy Narratives? (Yan et al., BioNLP 2019)
- PDF:
- https://preview.aclanthology.org/remove-xml-comments/W19-5025.pdf