Leveraging Discourse Information Effectively for Authorship Attribution

Elisa Ferracane; Su Wang; Raymond Mooney

Leveraging Discourse Information Effectively for Authorship Attribution

Elisa Ferracane, Su Wang, Raymond Mooney

[How to correct problems with metadata yourself]

Abstract

We explore techniques to maximize the effectiveness of discourse information in the task of authorship attribution. We present a novel method to embed discourse features in a Convolutional Neural Network text classifier, which achieves a state-of-the-art result by a significant margin. We empirically investigate several featurization methods to understand the conditions under which discourse features contribute non-trivial performance gains, and analyze discourse embeddings.

Anthology ID:: I17-1059
Volume:: Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
Month:: November
Year:: 2017
Address:: Taipei, Taiwan
Editors:: Greg Kondrak, Taro Watanabe
Venue:: IJCNLP
SIG:
Publisher:: Asian Federation of Natural Language Processing
Note:
Pages:: 584–593
Language:
URL:: https://aclanthology.org/I17-1059
DOI:
Bibkey:
Cite (ACL):: Elisa Ferracane, Su Wang, and Raymond Mooney. 2017. Leveraging Discourse Information Effectively for Authorship Attribution. In Proceedings of the Eighth International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 584–593, Taipei, Taiwan. Asian Federation of Natural Language Processing.
Cite (Informal):: Leveraging Discourse Information Effectively for Authorship Attribution (Ferracane et al., IJCNLP 2017)
Copy Citation:
PDF:: https://preview.aclanthology.org/teach-a-man-to-fish/I17-1059.pdf
Presentation:: I17-1059.Presentation.pdf
Code: elisaF/authorship-attribution-discourse

PDF Search Code Presentation