Interpreto: An Explainability Library for Transformers
Antonin Poché, Thomas Mullor, Gabriele Sarti, Frédéric Boisnard, Corentin Friedrich, Charlotte Claye, Francois Hoofd, Raphael Bernas, Nicholas Asher, Celine Hudelot, Fanny Jourdan
Abstract
Interpreto is an open-source Python library for interpreting HuggingFace language models, from early BERT variants to LLMs. It provides two complementary families of methods: attribution methods and concept-based explanations. The library bridges recent research and practical tooling by exposing explanation workflows through a unified API for both classification and text generation. A key differentiator is its end-to-end concept-based pipeline (from activation extraction to concept learning, interpretation, and scoring), which goes beyond feature-level attributions and is uncommon in existing libraries.- Anthology ID:
- 2026.acl-demo.1
- Volume:
- Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Greg Durrett, Ping Jian
- Venue:
- ACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 1–13
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.acl-demo.1/
- DOI:
- Cite (ACL):
- Antonin Poché, Thomas Mullor, Gabriele Sarti, Frédéric Boisnard, Corentin Friedrich, Charlotte Claye, Francois Hoofd, Raphael Bernas, Nicholas Asher, Celine Hudelot, and Fanny Jourdan. 2026. Interpreto: An Explainability Library for Transformers. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), pages 1–13, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- Interpreto: An Explainability Library for Transformers (Poché et al., ACL 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.acl-demo.1.pdf