Abstract
Spatial aggregation refers to merging of documents created at the same spatial location. We show that by spatial aggregation of a large collection of documents and applying a traditional topic discovery algorithm on the aggregated data we can efficiently discover spatially distinct topics. By looking at topic discovery through matrix factorization lenses we show that spatial aggregation allows low rank approximation of the original document-word matrix, in which spatially distinct topics are preserved and non-spatial topics are aggregated into a single topic. Our experiments on synthetic data confirm this observation. Our experiments on 4.7 million tweets collected during the Sandy Hurricane in 2012 show that spatial and temporal aggregation allows rapid discovery of relevant spatial and temporal topics during that period. Our work indicates that different forms of document aggregation might be effective in rapid discovery of various types of distinct topics from large collections of documents.- Anthology ID:
- P19-1025
- Volume:
- Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
- Month:
- July
- Year:
- 2019
- Address:
- Florence, Italy
- Editors:
- Anna Korhonen, David Traum, Lluís Màrquez
- Venue:
- ACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 252–262
- Language:
- URL:
- https://aclanthology.org/P19-1025
- DOI:
- 10.18653/v1/P19-1025
- Cite (ACL):
- Aniruddha Maiti and Slobodan Vucetic. 2019. Spatial Aggregation Facilitates Discovery of Spatial Topics. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 252–262, Florence, Italy. Association for Computational Linguistics.
- Cite (Informal):
- Spatial Aggregation Facilitates Discovery of Spatial Topics (Maiti & Vucetic, ACL 2019)
- PDF:
- https://preview.aclanthology.org/naacl-24-ws-corrections/P19-1025.pdf