Recognition of Genuine Polish Suicide Notes

Maciej Piasecki, Ksenia Młynarczyk, Jan Kocoń


Abstract
In this article we present the result of the recent research in the recognition of genuine Polish suicide notes (SNs). We provide useful method to distinguish between SNs and other types of discourse, including counterfeited SNs. The method uses a wide range of word-based and semantic features and it was evaluated using Polish Corpus of Suicide Notes, which contains 1244 genuine SNs, expanded with manually prepared set of 334 counterfeited SNs and 2200 letter-like texts from the Internet. We utilized the algorithm to create the class-related sense dictionaries to improve the result of SNs classification. The obtained results show that there are fundamental differences between genuine SNs and counterfeited SNs. The applied method of the sense dictionary construction appeared to be the best way of improving the model.
Anthology ID:
R17-1076
Volume:
Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017
Month:
September
Year:
2017
Address:
Varna, Bulgaria
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd.
Note:
Pages:
583–591
Language:
URL:
https://doi.org/10.26615/978-954-452-049-6_076
DOI:
10.26615/978-954-452-049-6_076
Bibkey:
Cite (ACL):
Maciej Piasecki, Ksenia Młynarczyk, and Jan Kocoń. 2017. Recognition of Genuine Polish Suicide Notes. In Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017, pages 583–591, Varna, Bulgaria. INCOMA Ltd..
Cite (Informal):
Recognition of Genuine Polish Suicide Notes (Piasecki et al., RANLP 2017)
Copy Citation:
PDF:
https://doi.org/10.26615/978-954-452-049-6_076