Stylometric Analysis of Parliamentary Speeches: Gender Dimension

Justina Mandravickaitė, Tomas Krilavičius


Abstract
Relation between gender and language has been studied by many authors, however, there is still some uncertainty left regarding gender influence on language usage in the professional environment. Often, the studied data sets are too small or texts of individual authors are too short in order to capture differences of language usage wrt gender successfully. This study draws from a larger corpus of speeches transcripts of the Lithuanian Parliament (1990-2013) to explore language differences of political debates by gender via stylometric analysis. Experimental set up consists of stylistic features that indicate lexical style and do not require external linguistic tools, namely the most frequent words, in combination with unsupervised machine learning algorithms. Results show that gender differences in the language use remain in professional environment not only in usage of function words, preferred linguistic constructions, but in the presented topics as well.
Anthology ID:
W17-1416
Volume:
Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing
Month:
April
Year:
2017
Address:
Valencia, Spain
Editors:
Tomaž Erjavec, Jakub Piskorski, Lidia Pivovarova, Jan Šnajder, Josef Steinberger, Roman Yangarber
Venue:
BSNLP
SIG:
SIGSLAV
Publisher:
Association for Computational Linguistics
Note:
Pages:
102–107
Language:
URL:
https://aclanthology.org/W17-1416
DOI:
10.18653/v1/W17-1416
Bibkey:
Cite (ACL):
Justina Mandravickaitė and Tomas Krilavičius. 2017. Stylometric Analysis of Parliamentary Speeches: Gender Dimension. In Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing, pages 102–107, Valencia, Spain. Association for Computational Linguistics.
Cite (Informal):
Stylometric Analysis of Parliamentary Speeches: Gender Dimension (Mandravickaitė & Krilavičius, BSNLP 2017)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/W17-1416.pdf