Reproducing the Argument Quality Prediction of Project Debater

Ines Zelch, Matthias Hagen, Benno Stein, Johannes Kiesel


Abstract
A crucial task when analyzing arguments is to determine their quality. Especially when you have to choose from a large number of suitable arguments, the determination of a reliable argument quality value is of great benefit. Probably the best-known model for determining such an argument quality value was developed in IBM’s Project Debater and made available to the research community free of charge via an API. In fact, the model was never open and the API is no longer available. In this paper, IBM’s model is reproduced using the freely available training data and the description in the corresponding publication. Our reproduction achieves similar results on the test data as described in the original publication. Further, the predicted quality scores of reproduction and original show a very high correlation (Pearson’s r=0.9) on external data.
Anthology ID:
2025.argmining-1.17
Volume:
Proceedings of the 12th Argument mining Workshop
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Elena Chistova, Philipp Cimiano, Shohreh Haddadan, Gabriella Lapesa, Ramon Ruiz-Dolz
Venues:
ArgMining | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
181–188
Language:
URL:
https://preview.aclanthology.org/display_plenaries/2025.argmining-1.17/
DOI:
Bibkey:
Cite (ACL):
Ines Zelch, Matthias Hagen, Benno Stein, and Johannes Kiesel. 2025. Reproducing the Argument Quality Prediction of Project Debater. In Proceedings of the 12th Argument mining Workshop, pages 181–188, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Reproducing the Argument Quality Prediction of Project Debater (Zelch et al., ArgMining 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/display_plenaries/2025.argmining-1.17.pdf