Abstract
The aim of this experiment is to present an easy way to compare fragments of texts in order to detect (supposed) results of copy & paste operations between articles in the domain of Natural Language Processing (NLP). The search space of the comparisons is a corpus labeled as NLP4NLP gathering a large part of the NLP field. The study is centered on LREC papers in both directions, first with an LREC paper borrowing a fragment of text from the collection, and secondly in the reverse direction with fragments of LREC documents borrowed and inserted in the collection.- Anthology ID:
- L16-1298
- Volume:
- Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
- Month:
- May
- Year:
- 2016
- Address:
- Portorož, Slovenia
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 1890–1897
- Language:
- URL:
- https://aclanthology.org/L16-1298
- DOI:
- Cite (ACL):
- Gil Francopoulo, Joseph Mariani, and Patrick Paroubek. 2016. A Study of Reuse and Plagiarism in LREC papers. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 1890–1897, Portorož, Slovenia. European Language Resources Association (ELRA).
- Cite (Informal):
- A Study of Reuse and Plagiarism in LREC papers (Francopoulo et al., LREC 2016)
- PDF:
- https://preview.aclanthology.org/remove-xml-comments/L16-1298.pdf