Michael Hind
2026
BenchNavigator: A Discovery Interface for Comparing LLM Benchmarks
Anna Sokol | Inge Vejsbjerg | Elizabeth M. Daly | David Piorkowski | Michael Hind | Nuno Moniz | Nitesh V. Chawla
Proceedings of the Workshop on Evaluating Evaluations (EvalEval)
Anna Sokol | Inge Vejsbjerg | Elizabeth M. Daly | David Piorkowski | Michael Hind | Nuno Moniz | Nitesh V. Chawla
Proceedings of the Workshop on Evaluating Evaluations (EvalEval)
Evaluating large language models (LLMs) requires selecting benchmarks that fit the intended use case. However, the rapid growth of benchmarks has made discovery and comparison difficult, because practitioners must assemble information across papers, repositories, and dataset cards with heterogeneous metadata, inconsistent terminology, and uneven documentation. Prior work improves individual benchmark documentation and quality assessment, but does not provide a uniform way to compare benchmarks during discovery. We survey practitioners, analyze multi-source benchmark metadata, and identify the fields needed for effective benchmark discovery. We introduce BenchNavigator, a prototype that organizes heterogeneous metadata into a coherent, provenance-preserving interface aligned with practitioner priorities. Our results show that benchmark metadata can be presented in a comparable form without imposing new reporting burdens on benchmark producers. We frame this contribution as discovery infrastructure, not as a method for scoring benchmark quality or replacing contextual evaluation.
2025
Granite Guardian: Comprehensive LLM Safeguarding
Inkit Padhi | Manish Nagireddy | Giandomenico Cornacchia | Subhajit Chaudhury | Tejaswini Pedapati | Pierre Dognin | Keerthiram Murugesan | Erik Miehling | Martín Santillán Cooper | Kieran Fraser | Giulio Zizzo | Muhammad Zaid Hameed | Mark Purcell | Michael Desmond | Qian Pan | Inge Vejsbjerg | Elizabeth M. Daly | Michael Hind | Werner Geyer | Ambrish Rawat | Kush R. Varshney | Prasanna Sattigeri
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 3: Industry Track)
Inkit Padhi | Manish Nagireddy | Giandomenico Cornacchia | Subhajit Chaudhury | Tejaswini Pedapati | Pierre Dognin | Keerthiram Murugesan | Erik Miehling | Martín Santillán Cooper | Kieran Fraser | Giulio Zizzo | Muhammad Zaid Hameed | Mark Purcell | Michael Desmond | Qian Pan | Inge Vejsbjerg | Elizabeth M. Daly | Michael Hind | Werner Geyer | Ambrish Rawat | Kush R. Varshney | Prasanna Sattigeri
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 3: Industry Track)
The deployment of language models in real-world applications exposes users to various risks, including hallucinations and harmful or unethical content. These challenges highlight the urgent need for robust safeguards to ensure safe and responsible AI. To address this, we introduce Granite Guardian, a suite of advanced models designed to detect and mitigate risks associated with prompts and responses, enabling seamless integration with any large language model (LLM). Unlike existing open-source solutions, our Granite Guardian models provide comprehensive coverage across a wide range of risk dimensions, including social bias, profanity, violence, sexual content, unethical behavior, jailbreaking, and hallucination-related issues such as context relevance, groundedness, and answer accuracy in retrieval-augmented generation (RAG) scenarios. Trained on a unique dataset combining diverse human annotations and synthetic data, Granite Guardian excels in identifying risks often overlooked by traditional detection systems, particularly jailbreak attempts and RAG-specific challenges. https://github.com/ibm-granite/granite-guardian
Search
Fix author
Co-authors
- Elizabeth M. Daly 2
- Inge Vejsbjerg 2
- Subhajit Chaudhury 1
- Nitesh V. Chawla 1
- Giandomenico Cornacchia 1
- Michael Desmond 1
- Pierre Dognin 1
- Kieran Fraser 1
- Werner Geyer 1
- Muhammad Zaid Hameed 1
- Erik Miehling 1
- Nuno Moniz 1
- Keerthiram Murugesan 1
- Manish Nagireddy 1
- Inkit Padhi 1
- Qian Pan 1
- Tejaswini Pedapati 1
- David Piorkowski 1
- Mark Purcell 1
- Ambrish Rawat 1
- Martín Santillán Cooper 1
- Prasanna Sattigeri 1
- Anna Sokol 1
- Kush R. Varshney 1
- Giulio Zizzo 1