0x.Yuan at SemEval-2024 Task 2: Agents Debating can reach consensus and produce better outcomes in Medical NLI task

Yu-An Lu; Hung-Yu Kao

doi:10.18653/v1/2024.semeval-1.47

0x.Yuan at SemEval-2024 Task 2: Agents Debating can reach consensus and produce better outcomes in Medical NLI task

Abstract

In this paper, we introduce a multi-agent debating framework, experimenting on SemEval 2024 Task 2. This innovative system employs a collaborative approach involving expert agents from various medical fields to analyze Clinical Trial Reports (CTRs). Our methodology emphasizes nuanced and comprehensive analysis by leveraging the diverse expertise of agents like Biostatisticians and Medical Linguists. Results indicate that our collaborative model surpasses the performance of individual agents in terms of Macro F1-score. Additionally, our analysis suggests that while initial debates often mirror majority decisions, the debating process refines these outcomes, demonstrating the system’s capability for in-depth analysis beyond simple majority rule. This research highlights the potential of AI collaboration in specialized domains, particularly in medical text interpretation.

Anthology ID:: 2024.semeval-1.47
Volume:: Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)
Month:: June
Year:: 2024
Address:: Mexico City, Mexico
Editors:: Atul Kr. Ojha, A. Seza Doğruöz, Harish Tayyar Madabushi, Giovanni Da San Martino, Sara Rosenthal, Aiala Rosá
Venue:: SemEval
SIG:: SIGLEX
Publisher:: Association for Computational Linguistics
Note:
Pages:: 305–310
Language:
URL:: https://preview.aclanthology.org/fix-sig-urls/2024.semeval-1.47/
DOI:: 10.18653/v1/2024.semeval-1.47
Bibkey:
Cite (ACL):: Yu-an Lu and Hung-yu Kao. 2024. 0x.Yuan at SemEval-2024 Task 2: Agents Debating can reach consensus and produce better outcomes in Medical NLI task. In Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), pages 305–310, Mexico City, Mexico. Association for Computational Linguistics.
Cite (Informal):: 0x.Yuan at SemEval-2024 Task 2: Agents Debating can reach consensus and produce better outcomes in Medical NLI task (Lu & Kao, SemEval 2024)
Copy Citation:
PDF:: https://preview.aclanthology.org/fix-sig-urls/2024.semeval-1.47.pdf
Supplementarymaterial:: 2024.semeval-1.47.SupplementaryMaterial.txt
Supplementarymaterial:: 2024.semeval-1.47.SupplementaryMaterial.zip

PDF Cite Search Supplementarymaterial Supplementarymaterial Fix data