Authorship Identification of Romanian Texts with Controversial Paternity

Liviu Dinu, Marius Popescu, Anca Dinu


Abstract
In this work we propose a new strategy for the authorship identification problem and we test it on an example from Romanian literature: did Radu Albala found the continuation of Mateiu Caragiale’s novel Sub pecetea tainei, or did he write himself the respective continuation? The proposed strategy is based on the similarity of rankings of function words; we compare the obtained results with the results obtained by a learning method (namely Support Vector Machines -SVM- with a string kernel).
Anthology ID:
L08-1343
Volume:
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
Month:
May
Year:
2008
Address:
Marrakech, Morocco
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2008/pdf/862_paper.pdf
DOI:
Bibkey:
Cite (ACL):
Liviu Dinu, Marius Popescu, and Anca Dinu. 2008. Authorship Identification of Romanian Texts with Controversial Paternity. In Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08), Marrakech, Morocco. European Language Resources Association (ELRA).
Cite (Informal):
Authorship Identification of Romanian Texts with Controversial Paternity (Dinu et al., LREC 2008)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2008/pdf/862_paper.pdf