Evaluating Retrieval Augmented Generation to Communicate UK Climate Change Information

Arjun Biswas, Hatim Chahout, Tristan Pigram, Hang Dong, Hywel T.p. Williams, Fai Fung, Hailun Xie


Abstract
There is a huge demand for information about climate change across all sectors as societies seek to mitigate and adapt to its impacts. However, the volume and complexity of climate information, which takes many formats including numerical, text, and tabular data, can make good information hard to access. Here we use Large Language Models (LLMs) and Retrieval Augmented Generation (RAG) to create an AI agent that provides accurate and complete information from the United Kingdom Climate Projections 2018 (UKCP18) data archive. To overcome the problematic hallucinations associated with LLMs, four phases of experiments were performed to optimize different components of our RAG framework, combining various recent retrieval strategies. Performance was evaluated using three statistical metrics (faithfulness, relevance, coverage) as well as human evaluation by subject matter experts. Results show that the best model significantly outperforms a generic LLM (GPT-3.5) and has high-quality outputs with positive ratings by human experts. The UKCP Chatbot developed here will enable access at scale to the UKCP18 climate archives, offering an important case study of using RAG-based LLM systems to communicate climate information.
Anthology ID:
2025.climatenlp-1.9
Volume:
Proceedings of the 2nd Workshop on Natural Language Processing Meets Climate Change (ClimateNLP 2025)
Month:
July
Year:
2025
Address:
Bangkok, Thailand
Editors:
Kalyan Dutia, Peter Henderson, Markus Leippold, Christoper Manning, Gaku Morio, Veruska Muccione, Jingwei Ni, Tobias Schimanski, Dominik Stammbach, Alok Singh, Alba (Ruiran) Su, Saeid A. Vaghefi
Venues:
ClimateNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
126–141
Language:
URL:
https://preview.aclanthology.org/display_plenaries/2025.climatenlp-1.9/
DOI:
Bibkey:
Cite (ACL):
Arjun Biswas, Hatim Chahout, Tristan Pigram, Hang Dong, Hywel T.p. Williams, Fai Fung, and Hailun Xie. 2025. Evaluating Retrieval Augmented Generation to Communicate UK Climate Change Information. In Proceedings of the 2nd Workshop on Natural Language Processing Meets Climate Change (ClimateNLP 2025), pages 126–141, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):
Evaluating Retrieval Augmented Generation to Communicate UK Climate Change Information (Biswas et al., ClimateNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/display_plenaries/2025.climatenlp-1.9.pdf