Abstract
Paraphrases exist on different granularity levels, the most frequently used one being the sentential level. However, we argue that working on the sentential level is not optimal for both machines and humans, and that it would be easier and more efficient to work on sub-sentential levels. To prove this, we quantify and analyze the difference between paraphrases on both sentence and sub-sentence level in order to show the significance of the problem. First results on a preliminary dataset seem to confirm our hypotheses.- Anthology ID:
- R17-1014
- Volume:
- Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017
- Month:
- September
- Year:
- 2017
- Address:
- Varna, Bulgaria
- Editors:
- Ruslan Mitkov, Galia Angelova
- Venue:
- RANLP
- SIG:
- Publisher:
- INCOMA Ltd.
- Note:
- Pages:
- 90–96
- Language:
- URL:
- https://doi.org/10.26615/978-954-452-049-6_014
- DOI:
- 10.26615/978-954-452-049-6_014
- Cite (ACL):
- Darina Benikova and Torsten Zesch. 2017. Same same, but different: Compositionality of paraphrase granularity levels. In Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017, pages 90–96, Varna, Bulgaria. INCOMA Ltd..
- Cite (Informal):
- Same same, but different: Compositionality of paraphrase granularity levels (Benikova & Zesch, RANLP 2017)
- PDF:
- https://doi.org/10.26615/978-954-452-049-6_014