Using RL to Identify Divisive Perspectives Improves LLMs Abilities to Identify Communities on Social Media

Nikhil Mehta, Dan Goldwasser


Abstract
The large scale usage of social media, combined with its significant impact, has made it increasingly important to understand it. In particular, identifying user communities, can be helpful for many downstream tasks. However, particularly when models are trained on past data and tested on future, doing this is difficult.In this paper, we hypothesize to take advantage of Large Language Models (LLMs), to better identify user communities. Due to the fact that many LLMs, such as ChatGPT, are fixed and must be treated as black-boxes, we propose an approach to better prompt them, by training a smaller LLM to do this. We devise strategies to train this smaller model, showing how it can improve the larger LLMs ability to detect communities. Experimental results show improvements on Reddit and Twitter data, and the tasks of community detection, bot detection, and news media profiling.
Anthology ID:
2024.findings-emnlp.309
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2024
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
5371–5390
Language:
URL:
https://preview.aclanthology.org/build-pipeline-with-new-library/2024.findings-emnlp.309/
DOI:
10.18653/v1/2024.findings-emnlp.309
Bibkey:
Cite (ACL):
Nikhil Mehta and Dan Goldwasser. 2024. Using RL to Identify Divisive Perspectives Improves LLMs Abilities to Identify Communities on Social Media. In Findings of the Association for Computational Linguistics: EMNLP 2024, pages 5371–5390, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
Using RL to Identify Divisive Perspectives Improves LLMs Abilities to Identify Communities on Social Media (Mehta & Goldwasser, Findings 2024)
Copy Citation:
PDF:
https://preview.aclanthology.org/build-pipeline-with-new-library/2024.findings-emnlp.309.pdf
Software:
 2024.findings-emnlp.309.software.zip
Data:
 2024.findings-emnlp.309.data.zip