Identifying Hijacked Reviews

Monika Daryani, James Caverlee


Abstract
Fake reviews and review manipulation are growing problems on online marketplaces globally. Review Hijacking is a new review manipulation tactic in which unethical sellers “hijack” an existing product page (usually one with many positive reviews), then update the product details like title, photo, and description with those of an entirely different product. With the earlier reviews still attached, the new item appears well-reviewed. So far, little knowledge about hijacked reviews has resulted in little academic research and an absence of labeled data. Hence, this paper proposes a three-part study: (i) we propose a framework to generate synthetically labeled data for review hijacking by swapping products and reviews; (ii) then, we evaluate the potential of both a Siamese LSTM network and BERT sequence pair classifier to distinguish legitimate reviews from hijacked ones using this data; and (iii) we then deploy the best performing model on a collection of 31K products (with 6.5 M reviews) in the original data, where we find 100s of previously unknown examples of review hijacking.
Anthology ID:
2021.ecnlp-1.9
Volume:
Proceedings of the 4th Workshop on e-Commerce and NLP
Month:
August
Year:
2021
Address:
Online
Venue:
ECNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
70–78
Language:
URL:
https://aclanthology.org/2021.ecnlp-1.9
DOI:
10.18653/v1/2021.ecnlp-1.9
Bibkey:
Cite (ACL):
Monika Daryani and James Caverlee. 2021. Identifying Hijacked Reviews. In Proceedings of the 4th Workshop on e-Commerce and NLP, pages 70–78, Online. Association for Computational Linguistics.
Cite (Informal):
Identifying Hijacked Reviews (Daryani & Caverlee, ECNLP 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/paclic-22-ingestion/2021.ecnlp-1.9.pdf