MasonTigers at SemEval-2024 Task 9: Solving Puzzles with an Ensemble of Chain-of-Thought Prompts

Nishat Raihan; Dhiman Goswami; Al Nahian Bin Emran; Sadiya Sayara Chowdhury Puspo; Amrita Ganguly; Marcos Zampieri

MasonTigers at SemEval-2024 Task 9: Solving Puzzles with an Ensemble of Chain-of-Thought Prompts

Nishat Raihan, Dhiman Goswami, Al Nahian Bin Emran, Sadiya Sayara Chowdhury Puspo, Amrita Ganguly, Marcos Zampieri

Abstract

Our paper presents team MasonTigers submission to the SemEval-2024 Task 9 - which provides a dataset of puzzles for testing natural language understanding. We employ large language models (LLMs) to solve this task through several prompting techniques. Zero-shot and few-shot prompting generate reasonably good results when tested with proprietary LLMs, compared to the open-source models. We obtain further improved results with chain-of-thought prompting, an iterative prompting method that breaks down the reasoning process step-by-step. We obtain our best results by utilizing an ensemble of chain-of-thought prompts, placing 2nd in the word puzzle subtask and 13th in the sentence puzzle subtask. The strong performance of prompted LLMs demonstrates their capability for complex reasoning when provided with a decomposition of the thought process. Our work sheds light on how step-wise explanatory prompts can unlock more of the knowledge encoded in the parameters of large models.

Anthology ID:: 2024.semeval-1.196
Volume:: Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)
Month:: June
Year:: 2024
Address:: Mexico City, Mexico
Editors:: Atul Kr. Ojha, A. Seza Doğruöz, Harish Tayyar Madabushi, Giovanni Da San Martino, Sara Rosenthal, Aiala Rosá
Venue:: SemEval
SIG:: SIGLEX
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1358–1363
Language:
URL:: https://aclanthology.org/2024.semeval-1.196
DOI:
Bibkey:
Cite (ACL):: Nishat Raihan, Dhiman Goswami, Al Nahian Bin Emran, Sadiya Sayara Chowdhury Puspo, Amrita Ganguly, and Marcos Zampieri. 2024. MasonTigers at SemEval-2024 Task 9: Solving Puzzles with an Ensemble of Chain-of-Thought Prompts. In Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), pages 1358–1363, Mexico City, Mexico. Association for Computational Linguistics.
Cite (Informal):: MasonTigers at SemEval-2024 Task 9: Solving Puzzles with an Ensemble of Chain-of-Thought Prompts (Raihan et al., SemEval 2024)
Copy Citation:
PDF:: https://preview.aclanthology.org/bionlp-24-ingestion/2024.semeval-1.196.pdf
Supplementary material:: 2024.semeval-1.196.SupplementaryMaterial.zip
Supplementary material:: 2024.semeval-1.196.SupplementaryMaterial.txt

PDF Search Supplementary material Supplementary material