Abstract
User reviews provide a significant source of information for companies to understand their market and audience. In order to discover broad trends in this source, researchers have typically used topic models such as Latent Dirichlet Allocation (LDA). However, while there are metrics to choose the “best” number of topics, it is not clear whether the resulting topics can also provide in-depth, actionable product analysis. Our paper examines this issue by analyzing user reviews from the Best Buy US website for smart speakers. Using coherence scores to choose topics, we test whether the results help us to understand user interests and concerns. We find that while coherence scores are a good starting point to identify a number of topics, it still requires manual adaptation based on domain knowledge to provide market insights. We show that the resulting dimensions capture brand performance and differences, and differentiate the market into two distinct groups with different properties.- Anthology ID:
- D19-5510
- Volume:
- Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019)
- Month:
- November
- Year:
- 2019
- Address:
- Hong Kong, China
- Editors:
- Wei Xu, Alan Ritter, Tim Baldwin, Afshin Rahimi
- Venue:
- WNUT
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 76–83
- Language:
- URL:
- https://aclanthology.org/D19-5510
- DOI:
- 10.18653/v1/D19-5510
- Cite (ACL):
- Hanh Nguyen and Dirk Hovy. 2019. Hey Siri. Ok Google. Alexa: A topic modeling of user reviews for smart speakers. In Proceedings of the 5th Workshop on Noisy User-generated Text (W-NUT 2019), pages 76–83, Hong Kong, China. Association for Computational Linguistics.
- Cite (Informal):
- Hey Siri. Ok Google. Alexa: A topic modeling of user reviews for smart speakers (Nguyen & Hovy, WNUT 2019)
- PDF:
- https://preview.aclanthology.org/fix-dup-bibkey/D19-5510.pdf