Feature Discovery for Diachronic Register Analysis: a Semi-Automatic Approach

Stefania Degaetano-Ortlieb, Ekaterina Lapshinova-Koltunski, Elke Teich


Abstract
In this paper, we present corpus-based procedures to semi-automatically discover features relevant for the study of recent language change in scientific registers. First, linguistic features potentially adherent to recent language change are extracted from the SciTex Corpus. Second, features are assessed for their relevance for the study of recent language change in scientific registers by means of correspondence analysis. The discovered features will serve for further investigations of the linguistic evolution of newly emerged scientific registers.
Anthology ID:
L12-1111
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2786–2790
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/268_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Stefania Degaetano-Ortlieb, Ekaterina Lapshinova-Koltunski, and Elke Teich. 2012. Feature Discovery for Diachronic Register Analysis: a Semi-Automatic Approach. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 2786–2790, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Feature Discovery for Diachronic Register Analysis: a Semi-Automatic Approach (Degaetano-Ortlieb et al., LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/268_Paper.pdf