Product Classification in E-Commerce using Distributional Semantics
Vivek Gupta, Harish Karnick, Ashendra Bansal, Pradhuman Jhala
Abstract
Product classification is the task of automatically predicting a taxonomy path for a product in a predefined taxonomy hierarchy given a textual product description or title. For efficient product classification we require a suitable representation for a document (the textual description of a product) feature vector and efficient and fast algorithms for prediction.To address the above challenges, we propose a new distributional semantics representation for document vector formation. We also develop a new two-level ensemble approach utilising (with respect to the taxonomy tree) path-wise, node-wise and depth-wise classifiers to reduce error in the final product classification task. Our experiments show the effectiveness of the distributional representation and the ensemble approach on data sets from a leading e-commerce platform and achieve improved results on various evaluation metrics compared to earlier approaches.- Anthology ID:
- C16-1052
- Volume:
- Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
- Month:
- December
- Year:
- 2016
- Address:
- Osaka, Japan
- Venue:
- COLING
- SIG:
- Publisher:
- The COLING 2016 Organizing Committee
- Note:
- Pages:
- 536–546
- Language:
- URL:
- https://aclanthology.org/C16-1052
- DOI:
- Cite (ACL):
- Vivek Gupta, Harish Karnick, Ashendra Bansal, and Pradhuman Jhala. 2016. Product Classification in E-Commerce using Distributional Semantics. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pages 536–546, Osaka, Japan. The COLING 2016 Organizing Committee.
- Cite (Informal):
- Product Classification in E-Commerce using Distributional Semantics (Gupta et al., COLING 2016)
- PDF:
- https://preview.aclanthology.org/remove-xml-comments/C16-1052.pdf