Topic Modeling in Brazilian Portuguese Documents on Antimicrobial Resistance

Enrique Reis Susin, Lilian Berton


Abstract
This study analyzes texts from multiple sources, including social media and news portals, to observe how different sectors of Brazilian society discuss the antimicrobial resistance. The main goal is to support epidemiological surveillance and public policy decisions through computational tools. Three datasets were used: tweets collected between 2008 and 2025 (64,225 documents), news articles from G1 (4,363 documents), and official government publications (.gov.br, 1,515 documents). These sources enable comparative analysis between informal discourse (social media) and institutional or journalistic discourse (official and media outlets). The study applies and compares topic modeling techniques, particularly those designed for Short Text Topic Modeling (STTM), such as GSDMM and BERTopic, to identify discursive trends, semantic patterns, and emerging topics related to antimicrobial resistance. By exploring these distinct contexts, this work demonstrates the potential of Natural Language Processing (NLP) and AI methods as instruments for integrated analysis of public health data in both informal and formal environments.
Anthology ID:
2026.propor-1.10
Volume:
Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1
Month:
April
Year:
2026
Address:
Salvador, Brazil
Editors:
Marlo Souza, Iria de-Dios-Flores, Diana Santos, Larissa Freitas, Jackson Wilke da Cruz Souza, Eugénio Ribeiro
Venue:
PROPOR
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
103–110
Language:
URL:
https://preview.aclanthology.org/ingest-dnd/2026.propor-1.10/
DOI:
Bibkey:
Cite (ACL):
Enrique Reis Susin and Lilian Berton. 2026. Topic Modeling in Brazilian Portuguese Documents on Antimicrobial Resistance. In Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1, pages 103–110, Salvador, Brazil. Association for Computational Linguistics.
Cite (Informal):
Topic Modeling in Brazilian Portuguese Documents on Antimicrobial Resistance (Susin & Berton, PROPOR 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-dnd/2026.propor-1.10.pdf