The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015
Mijail Kabadjov, Udo Kruschwitz, Massimo Poesio, Josef Steinberger, Jorge Valderrama, Hugo Zaragoza
Abstract
In this paper we present the OnForumS corpus developed for the shared task of the same name on Online Forum Summarisation (OnForumS at MultiLing’15). The corpus consists of a set of news articles with associated readers’ comments from The Guardian (English) and La Repubblica (Italian). It comes with four levels of annotation: argument structure, comment-article linking, sentiment and coreference. The former three were produced through crowdsourcing, whereas the latter, by an experienced annotator using a mature annotation scheme. Given its annotation breadth, we believe the corpus will prove a useful resource in stimulating and furthering research in the areas of Argumentation Mining, Summarisation, Sentiment, Coreference and the interlinks therein.- Anthology ID:
- L16-1131
- Volume:
- Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
- Month:
- May
- Year:
- 2016
- Address:
- Portorož, Slovenia
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 814–818
- Language:
- URL:
- https://aclanthology.org/L16-1131
- DOI:
- Cite (ACL):
- Mijail Kabadjov, Udo Kruschwitz, Massimo Poesio, Josef Steinberger, Jorge Valderrama, and Hugo Zaragoza. 2016. The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 814–818, Portorož, Slovenia. European Language Resources Association (ELRA).
- Cite (Informal):
- The OnForumS corpus from the Shared Task on Online Forum Summarisation at MultiLing 2015 (Kabadjov et al., LREC 2016)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/L16-1131.pdf