CentaurTA: A Self-Improving Human-Agents Collaboration Framework for Thematic Analysis

Lei Wang, Min Huang, Eduard Dragut


Abstract
Qualitative analysis is essential for studying complex social and behavioral phenomena, yet existing large language model (LLM) approaches face key limitations. Fully automated pipelines often compromise methodological rigor, while fully manual coding remains costly and labor-intensive. Although recent work emphasizes human–AI collaboration, existing multi-agent systems focus primarily on theme-level outputs, provide limited human oversight, and overlook fine-grained, data-level coding quality.We introduce CentaurTA, an iterative, self-improving human–agent framework for scalable thematic analysis. CentaurTA places humans in the loop to oversee and guide analysis, using expert feedback as a persistent learning signal to drive prompt-level refinement. By combining structured human feedback with rubric-based evaluation, CentaurTA provides fine-grained supervision for both open coding and theme construction while preserving methodological rigor. Experiments across multiple datasets, baselines, and LLM families show that CentaurTA improves coding alignment and transparency, highlighting the central role of human feedback in reliable qualitative analysis. Our code and data are available at https://github.com/Tom-Owl/CentaurTA.
Anthology ID:
2026.findings-acl.778
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
15871–15884
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.778/
DOI:
Bibkey:
Cite (ACL):
Lei Wang, Min Huang, and Eduard Dragut. 2026. CentaurTA: A Self-Improving Human-Agents Collaboration Framework for Thematic Analysis. In Findings of the Association for Computational Linguistics: ACL 2026, pages 15871–15884, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
CentaurTA: A Self-Improving Human-Agents Collaboration Framework for Thematic Analysis (Wang et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.778.pdf
Checklist:
 2026.findings-acl.778.checklist.pdf