Using Twitter Language to Predict the Real Estate Market

Mohammadzaman Zamani, H. Andrew Schwartz


Abstract
We explore whether social media can provide a window into community real estate -foreclosure rates and price changes- beyond that of traditional economic and demographic variables. We find language use in Twitter not only predicts real estate outcomes as well as traditional variables across counties, but that including Twitter language in traditional models leads to a significant improvement (e.g. from Pearson r = :50 to r = :59 for price changes). We overcome the challenge of the relative sparsity and noise in Twitter language variables by showing that training on the residual error of the traditional models leads to more accurate overall assessments. Finally, we discover that it is Twitter language related to business (e.g. ‘company’, ‘marketing’) and technology (e.g. ‘technology’, ‘internet’), among others, that yield predictive power over economics.
Anthology ID:
E17-2005
Volume:
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers
Month:
April
Year:
2017
Address:
Valencia, Spain
Editors:
Mirella Lapata, Phil Blunsom, Alexander Koller
Venue:
EACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
28–33
Language:
URL:
https://aclanthology.org/E17-2005
DOI:
Bibkey:
Cite (ACL):
Mohammadzaman Zamani and H. Andrew Schwartz. 2017. Using Twitter Language to Predict the Real Estate Market. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, pages 28–33, Valencia, Spain. Association for Computational Linguistics.
Cite (Informal):
Using Twitter Language to Predict the Real Estate Market (Zamani & Schwartz, EACL 2017)
Copy Citation:
PDF:
https://preview.aclanthology.org/ml4al-ingestion/E17-2005.pdf