Fact vs. Opinion: the Role of Argumentation Features in News Classification

Tariq Alhindi, Smaranda Muresan, Daniel Preotiuc-Pietro


Abstract
A 2018 study led by the Media Insight Project showed that most journalists think that a clearmarking of what is news reporting and what is commentary or opinion (e.g., editorial, op-ed)is essential for gaining public trust. We present an approach to classify news articles into newsstories (i.e., reporting of factual information) and opinion pieces using models that aim to sup-plement the article content representation with argumentation features. Our hypothesis is thatthe nature of argumentative discourse is important in distinguishing between news stories andopinion articles. We show that argumentation features outperform linguistic features used previ-ously and improve on fine-tuned transformer-based models when tested on data from publishersunseen in training. Automatically flagging opinion pieces vs. news stories can aid applicationssuch as fact-checking or event extraction.
Anthology ID:
2020.coling-main.540
Volume:
Proceedings of the 28th International Conference on Computational Linguistics
Month:
December
Year:
2020
Address:
Barcelona, Spain (Online)
Editors:
Donia Scott, Nuria Bel, Chengqing Zong
Venue:
COLING
SIG:
Publisher:
International Committee on Computational Linguistics
Note:
Pages:
6139–6149
Language:
URL:
https://aclanthology.org/2020.coling-main.540
DOI:
10.18653/v1/2020.coling-main.540
Bibkey:
Cite (ACL):
Tariq Alhindi, Smaranda Muresan, and Daniel Preotiuc-Pietro. 2020. Fact vs. Opinion: the Role of Argumentation Features in News Classification. In Proceedings of the 28th International Conference on Computational Linguistics, pages 6139–6149, Barcelona, Spain (Online). International Committee on Computational Linguistics.
Cite (Informal):
Fact vs. Opinion: the Role of Argumentation Features in News Classification (Alhindi et al., COLING 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2020.coling-main.540.pdf
Data
New York Times Annotated Corpus