@inproceedings{khubaib-etal-2025-oath,
title = "Oath Breakers at {S}em{E}val-2025 Task 06: {P}romise{E}val",
author = "Khubaib, Muhammad and
Aijaz, Owais and
Enayat, Ayesha",
editor = "Rosenthal, Sara and
Ros{\'a}, Aiala and
Ghosh, Debanjan and
Zampieri, Marcos",
booktitle = "Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)",
month = jul,
year = "2025",
address = "Vienna, Austria",
publisher = "Association for Computational Linguistics",
url = "https://preview.aclanthology.org/corrections-2025-08/2025.semeval-1.225/",
pages = "1718--1723",
ISBN = "979-8-89176-273-2",
abstract = "SemEval Task 6: Promise Eval, was designed to evaluate a company{'}s adherence to its ESG commitments. Using Natural Language Processing (NLP) and Deep Learning techniques, the task involves analyzing ESG reports to identify, classify, and verify corporate promises. The verification process follows a structured pipeline with four subtasks: Promise Classification, Evidence Verification, Evidence Classification, and Timeline Verification. These subtasks ensure that identified promises are well-defined, supported by credible evidence, and time-bound.For model implementation, BERT was initially used for most of the classification tasks but was later replaced with DeBERTa, which improved performance due to its superior contextual understanding. To enhance model generalization, contrastive learning was applied alongside standard classification loss, helping the model differentiate between positive and negative examples. Oversampling techniques were used to address class imbalance issues, particularly for the Misleading evidence category. For timeline verification, BART was chosen initially but then shifted to DeBERTa again, as it better captures sequential dependencies in text.The dataset consists of ESG reports containing labeled promise statements, evidence snippets, and timeline information. The data was preprocessed by tokenizing text, handling imbalanced classes through oversampling, and incorporating domain-specific embeddings to improve understanding.By implementing these techniques, the research aims to provide a transparent and accountable framework for assessing corporate promises, ensuring that companies are held accountable for their ESG commitments."
}
Markdown (Informal)
[Oath Breakers at SemEval-2025 Task 06: PromiseEval](https://preview.aclanthology.org/corrections-2025-08/2025.semeval-1.225/) (Khubaib et al., SemEval 2025)
ACL
- Muhammad Khubaib, Owais Aijaz, and Ayesha Enayat. 2025. Oath Breakers at SemEval-2025 Task 06: PromiseEval. In Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025), pages 1718–1723, Vienna, Austria. Association for Computational Linguistics.