Irene Lopez Garcia
2026
SKILL-IR-Discourse: A Large, Annotated Corpus of Argumentation and Domain Discourse on International Relations
Magdalena Wolska | Matti Wiegmann | Sassan Gholiagha | Mitja Sienknecht | Dora Kiesel | Irene Lopez Garcia | Patrick Riehmann | Bernd Fröhlich | Katrin Girgensohn | Jürgen Neyer | Benno Stein
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Magdalena Wolska | Matti Wiegmann | Sassan Gholiagha | Mitja Sienknecht | Dora Kiesel | Irene Lopez Garcia | Patrick Riehmann | Bernd Fröhlich | Katrin Girgensohn | Jürgen Neyer | Benno Stein
Proceedings of the Fifteenth Language Resources and Evaluation Conference
We present a large annotated corpus of scholarly discourse in the domain of International Relations, a subfield of political science. The corpus comprises 190 articles (over 1500K tokens) annotated at the argumentation, basic rhetorical, and domain level. Five of the included articles (ca. 62K tokens) constitute a Gold-standard, coded by domain experts. The remaining articles were coded by annotators trained on the Gold-standard and monitored for annotation quality. We describe our corpus creation methodology, the annotation process and quality assurance, the corpus itself, and present insights into the data: Most argumentative structures in the data are simple premise-conclusion structures, fewer than half of the claims have explicit supporting evidence. Counter-arguments to claims are rare. The claim-to-support ratio varies widely between articles; possibly to some extent due to the topics covered (with clear common ground) or to the differences between authors’ styles. The distribution of theoretical vs. evaluative statements varies strongly between articles; this can be attributed to such factors as different methodological approaches between the articles and the methodological focus of the publishing journal.
2025
Argumentation and Domain Discourse in Scholarly Articles on the Theory of International Relations
Magdalena Wolska | Sassan Gholiagha | Mitja Sienknecht | Dora Kiesel | Irene Lopez Garcia | Patrick Riehmann | Matti Wiegmann | Bernd Froehlich | Katrin Girgensohn | Jürgen Neyer | Benno Stein
Proceedings of the 31st International Conference on Computational Linguistics
Magdalena Wolska | Sassan Gholiagha | Mitja Sienknecht | Dora Kiesel | Irene Lopez Garcia | Patrick Riehmann | Matti Wiegmann | Bernd Froehlich | Katrin Girgensohn | Jürgen Neyer | Benno Stein
Proceedings of the 31st International Conference on Computational Linguistics
We present the first dataset, an annotation scheme, discourse analysis, and baseline experiments on argumentation and domain content types in scholarly articles on political science, specifically on the theory of International Relations (IR). The dataset comprises over 1 600 sentences stemming from three foundational articles on Neo-Realism, Liberalism, and Constructivism. We show that our annotation scheme enables educationally-relevant insight into the scholarly IR discourse and that state-of-the-art classifiers, while effective in distinguishing basic argumentative elements (Claims and Support/Attack relations) reaching up to 0.97 micro F1 , require domain-specific training and fine-tuning on the more fine-grained tasks of relation and content type prediction.