Categorizing Comparative Sentences
Alexander Panchenko, Alexander Bondarenko, Mirco Franzek, Matthias Hagen, Chris Biemann
Abstract
We tackle the tasks of automatically identifying comparative sentences and categorizing the intended preference (e.g., “Python has better NLP libraries than MATLAB” → Python, better, MATLAB). To this end, we manually annotate 7,199 sentences for 217 distinct target item pairs from several domains (27% of the sentences contain an oriented comparison in the sense of “better” or “worse”). A gradient boosting model based on pre-trained sentence embeddings reaches an F1 score of 85% in our experimental evaluation. The model can be used to extract comparative sentences for pro/con argumentation in comparative / argument search engines or debating technologies.- Anthology ID:
- W19-4516
- Volume:
- Proceedings of the 6th Workshop on Argument Mining
- Month:
- August
- Year:
- 2019
- Address:
- Florence, Italy
- Venue:
- ArgMining
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 136–145
- Language:
- URL:
- https://aclanthology.org/W19-4516
- DOI:
- 10.18653/v1/W19-4516
- Cite (ACL):
- Alexander Panchenko, Alexander Bondarenko, Mirco Franzek, Matthias Hagen, and Chris Biemann. 2019. Categorizing Comparative Sentences. In Proceedings of the 6th Workshop on Argument Mining, pages 136–145, Florence, Italy. Association for Computational Linguistics.
- Cite (Informal):
- Categorizing Comparative Sentences (Panchenko et al., ArgMining 2019)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/W19-4516.pdf
- Code
- uhh-lt/comparative + additional community code