Abstract
This paper describes our approach to the CLPsych 2021 Shared Task, in which we aimed to predict suicide attempts based on Twitter feed data. We addressed this challenge by emphasizing reliance on prior domain knowledge. We engineered novel theory-driven features, and integrated prior knowledge with empirical evidence in a principled manner using Bayesian modeling. While this theory-guided approach increases bias and lowers accuracy on the training set, it was successful in preventing over-fitting. The models provided reasonable classification accuracy on unseen test data (0.68<=AUC<= 0.84). Our approach may be particularly useful in prediction tasks trained on a relatively small data set.- Anthology ID:
- 2021.clpsych-1.12
- Volume:
- Proceedings of the Seventh Workshop on Computational Linguistics and Clinical Psychology: Improving Access
- Month:
- June
- Year:
- 2021
- Address:
- Online
- Editors:
- Nazli Goharian, Philip Resnik, Andrew Yates, Molly Ireland, Kate Niederhoffer, Rebecca Resnik
- Venue:
- CLPsych
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 103–109
- Language:
- URL:
- https://aclanthology.org/2021.clpsych-1.12
- DOI:
- 10.18653/v1/2021.clpsych-1.12
- Cite (ACL):
- Avi Gamoran, Yonatan Kaplan, Almog Simchon, and Michael Gilead. 2021. Using Psychologically-Informed Priors for Suicide Prediction in the CLPsych 2021 Shared Task. In Proceedings of the Seventh Workshop on Computational Linguistics and Clinical Psychology: Improving Access, pages 103–109, Online. Association for Computational Linguistics.
- Cite (Informal):
- Using Psychologically-Informed Priors for Suicide Prediction in the CLPsych 2021 Shared Task (Gamoran et al., CLPsych 2021)
- PDF:
- https://preview.aclanthology.org/emnlp-22-attachments/2021.clpsych-1.12.pdf