Abstract
The large scale usage of social media, combined with its significant impact, has made it increasingly important to understand it. In particular, identifying user communities, can be helpful for many downstream tasks. However, particularly when models are trained on past data and tested on future, doing this is difficult.In this paper, we hypothesize to take advantage of Large Language Models (LLMs), to better identify user communities. Due to the fact that many LLMs, such as ChatGPT, are fixed and must be treated as black-boxes, we propose an approach to better prompt them, by training a smaller LLM to do this. We devise strategies to train this smaller model, showing how it can improve the larger LLMs ability to detect communities. Experimental results show improvements on Reddit and Twitter data, and the tasks of community detection, bot detection, and news media profiling.- Anthology ID:
- 2024.findings-emnlp.309
- Volume:
- Findings of the Association for Computational Linguistics: EMNLP 2024
- Month:
- November
- Year:
- 2024
- Address:
- Miami, Florida, USA
- Editors:
- Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 5371–5390
- Language:
- URL:
- https://preview.aclanthology.org/build-pipeline-with-new-library/2024.findings-emnlp.309/
- DOI:
- 10.18653/v1/2024.findings-emnlp.309
- Cite (ACL):
- Nikhil Mehta and Dan Goldwasser. 2024. Using RL to Identify Divisive Perspectives Improves LLMs Abilities to Identify Communities on Social Media. In Findings of the Association for Computational Linguistics: EMNLP 2024, pages 5371–5390, Miami, Florida, USA. Association for Computational Linguistics.
- Cite (Informal):
- Using RL to Identify Divisive Perspectives Improves LLMs Abilities to Identify Communities on Social Media (Mehta & Goldwasser, Findings 2024)
- PDF:
- https://preview.aclanthology.org/build-pipeline-with-new-library/2024.findings-emnlp.309.pdf