Abstract
This paper addresses the problem of community membership detection using only text features in a scenario where a small number of positive labeled examples defines the community. The solution introduces an unsupervised proxy task for learning user embeddings: user re-identification. Experiments with 16 different communities show that the resulting embeddings are more effective for community membership identification than common unsupervised representations.- Anthology ID:
- N18-2094
- Volume:
- Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers)
- Month:
- June
- Year:
- 2018
- Address:
- New Orleans, Louisiana
- Editors:
- Marilyn Walker, Heng Ji, Amanda Stent
- Venue:
- NAACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 595–601
- Language:
- URL:
- https://aclanthology.org/N18-2094
- DOI:
- 10.18653/v1/N18-2094
- Cite (ACL):
- Aaron Jaech, Shobhit Hathi, and Mari Ostendorf. 2018. Community Member Retrieval on Social Media Using Textual Information. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), pages 595–601, New Orleans, Louisiana. Association for Computational Linguistics.
- Cite (Informal):
- Community Member Retrieval on Social Media Using Textual Information (Jaech et al., NAACL 2018)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-1/N18-2094.pdf
- Code
- ajaech/twittercommunities