CriticalBrew at CQs-Gen 2025: Collaborative Multi-Agent Generation and Evaluation of Critical Questions for Arguments

Roxanne El Baff; Dominik Opitz; Diaoulé Diallo

doi:10.18653/v1/2025.argmining-1.30

CriticalBrew at CQs-Gen 2025: Collaborative Multi-Agent Generation and Evaluation of Critical Questions for Arguments

Roxanne El Baff, Dominik Opitz, Diaoulé Diallo

Abstract

This paper presents the CriticalBrew submission to the CQs-Gen 2025 shared task, which focuses on generating critical questions (CQs) for a given argument. Our approach employs a multi-agent framework containing two sequential components: 1) Generation: machine society simulation for generating CQs and 2) Evaluation: LLM-based evaluation for selecting the top three questions. The first models collaboration as a sequence of thinking patterns (e.g., debate → reflect). The second assesses the generated questions using zero-shot prompting, evaluating them against several criteria (e.g., depth). Experiments with different open-weight LLMs (small vs. large) consistently outperformed the baseline, a single LLM with zero-shot prompting. Two configurations, agent count and thinking patterns, significantly impacted the performance in the shared task’s CQ-usefulness evaluation, whereas different LLM-based evaluation strategies (e.g., scoring) had no impact. Our code is available on GitHub.

Anthology ID:: 2025.argmining-1.30
Volume:: Proceedings of the 12th Argument mining Workshop
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Elena Chistova, Philipp Cimiano, Shohreh Haddadan, Gabriella Lapesa, Ramon Ruiz-Dolz
Venues:: ArgMining | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 314–321
Language:
URL:: https://preview.aclanthology.org/landing_page/2025.argmining-1.30/
DOI:: 10.18653/v1/2025.argmining-1.30
Bibkey:
Cite (ACL):: Roxanne El Baff, Dominik Opitz, and Diaoulé Diallo. 2025. CriticalBrew at CQs-Gen 2025: Collaborative Multi-Agent Generation and Evaluation of Critical Questions for Arguments. In Proceedings of the 12th Argument mining Workshop, pages 314–321, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: CriticalBrew at CQs-Gen 2025: Collaborative Multi-Agent Generation and Evaluation of Critical Questions for Arguments (El Baff et al., ArgMining 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/landing_page/2025.argmining-1.30.pdf

PDF Cite Search Fix data