The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015

Mijail Kabadjov, Udo Kruschwitz, Massimo Poesio, Josef Steinberger, Jorge Valderrama, Hugo Zaragoza


Abstract
In this paper we present the OnForumS corpus developed for the shared task of the same name on Online Forum Summarisation (OnForumS at MultiLing’15). The corpus consists of a set of news articles with associated readers’ comments from The Guardian (English) and La Repubblica (Italian). It comes with four levels of annotation: argument structure, comment-article linking, sentiment and coreference. The former three were produced through crowdsourcing, whereas the latter, by an experienced annotator using a mature annotation scheme. Given its annotation breadth, we believe the corpus will prove a useful resource in stimulating and furthering research in the areas of Argumentation Mining, Summarisation, Sentiment, Coreference and the interlinks therein.
Anthology ID:
L16-1131
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
814–818
Language:
URL:
https://aclanthology.org/L16-1131
DOI:
Bibkey:
Cite (ACL):
Mijail Kabadjov, Udo Kruschwitz, Massimo Poesio, Josef Steinberger, Jorge Valderrama, and Hugo Zaragoza. 2016. The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 814–818, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015 (Kabadjov et al., LREC 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/L16-1131.pdf