Abstract
Accurate prediction of user attributes from social media is valuable for both social science analysis and consumer targeting. In this paper, we propose a systematic method to leverage user online social media content for predicting offline restaurant consumption level. We utilize the social login as a bridge and construct a dataset of 8,844 users who have been linked across Dianping (similar to Yelp) and Sina Weibo. More specifically, we construct consumption level ground truth based on user self report spending. We build predictive models using both raw features and, especially, latent features, such as topic distributions and celebrities clusters. The employed methods demonstrate that online social media content has strong predictive power for offline spending. Finally, combined with qualitative feature analysis, we present the differences in words usage, topic interests and following behavior between different consumption level groups.- Anthology ID:
- C16-1314
- Volume:
- Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
- Month:
- December
- Year:
- 2016
- Address:
- Osaka, Japan
- Editors:
- Yuji Matsumoto, Rashmi Prasad
- Venue:
- COLING
- SIG:
- Publisher:
- The COLING 2016 Organizing Committee
- Note:
- Pages:
- 3328–3338
- Language:
- URL:
- https://aclanthology.org/C16-1314
- DOI:
- Cite (ACL):
- Yang Xiao, Yuan Wang, Hangyu Mao, and Zhen Xiao. 2016. Predicting Restaurant Consumption Level through Social Media Footprints. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pages 3328–3338, Osaka, Japan. The COLING 2016 Organizing Committee.
- Cite (Informal):
- Predicting Restaurant Consumption Level through Social Media Footprints (Xiao et al., COLING 2016)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-4/C16-1314.pdf