Abstract
We explore whether social media can provide a window into community real estate -foreclosure rates and price changes- beyond that of traditional economic and demographic variables. We find language use in Twitter not only predicts real estate outcomes as well as traditional variables across counties, but that including Twitter language in traditional models leads to a significant improvement (e.g. from Pearson r = :50 to r = :59 for price changes). We overcome the challenge of the relative sparsity and noise in Twitter language variables by showing that training on the residual error of the traditional models leads to more accurate overall assessments. Finally, we discover that it is Twitter language related to business (e.g. ‘company’, ‘marketing’) and technology (e.g. ‘technology’, ‘internet’), among others, that yield predictive power over economics.- Anthology ID:
- E17-2005
- Volume:
- Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers
- Month:
- April
- Year:
- 2017
- Address:
- Valencia, Spain
- Editors:
- Mirella Lapata, Phil Blunsom, Alexander Koller
- Venue:
- EACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 28–33
- Language:
- URL:
- https://aclanthology.org/E17-2005
- DOI:
- Cite (ACL):
- Mohammadzaman Zamani and H. Andrew Schwartz. 2017. Using Twitter Language to Predict the Real Estate Market. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, pages 28–33, Valencia, Spain. Association for Computational Linguistics.
- Cite (Informal):
- Using Twitter Language to Predict the Real Estate Market (Zamani & Schwartz, EACL 2017)
- PDF:
- https://preview.aclanthology.org/ml4al-ingestion/E17-2005.pdf