Jointly Learning Author and Annotated Character N-gram Embeddings: A Case Study in Literary Text

Suraj Maharjan, Deepthi Mave, Prasha Shrestha, Manuel Montes, Fabio A. González, Thamar Solorio


Abstract
An author’s way of presenting a story through his/her writing style has a great impact on whether the story will be liked by readers or not. In this paper, we learn representations for authors of literary texts together with representations for character n-grams annotated with their functional roles. We train a neural character n-gram based language model using an external corpus of literary texts and transfer learned representations for use in downstream tasks. We show that augmenting the knowledge from external works of authors produces results competitive with other style-based methods for book likability prediction, genre classification, and authorship attribution.
Anthology ID:
R19-1080
Volume:
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019)
Month:
September
Year:
2019
Address:
Varna, Bulgaria
Editors:
Ruslan Mitkov, Galia Angelova
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd.
Note:
Pages:
684–692
Language:
URL:
https://aclanthology.org/R19-1080
DOI:
10.26615/978-954-452-056-4_080
Bibkey:
Cite (ACL):
Suraj Maharjan, Deepthi Mave, Prasha Shrestha, Manuel Montes, Fabio A. González, and Thamar Solorio. 2019. Jointly Learning Author and Annotated Character N-gram Embeddings: A Case Study in Literary Text. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019), pages 684–692, Varna, Bulgaria. INCOMA Ltd..
Cite (Informal):
Jointly Learning Author and Annotated Character N-gram Embeddings: A Case Study in Literary Text (Maharjan et al., RANLP 2019)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-4/R19-1080.pdf