Dialz: A Python Toolkit for Steering Vectors

Zara Siddique, Liam Turner, Luis Espinosa-Anke


Abstract
We introduce *Dialz*, a Python library for advancing research on steering vectors for open-source LMs. Steering vectors allow users to modify activations at inference time to amplify or weaken a ‘concept’, e.g. honesty or positivity, providing a more powerful alternative to prompting or fine-tuning. Dialz supports a diverse set of tasks, including creating contrastive pair datasets, computing and applying steering vectors, and visualizations. Unlike existing libraries, Dialz emphasizes modularity and usability, enabling both rapid prototyping and in-depth analysis. We demonstrate how Dialz can be used to reduce harmful outputs such as stereotypes, while also providing insights into model behaviour across different layers. We release Dialz with full documentation, tutorials, and support for popular open-source models to encourage further research in safe and controllable language generation. Dialz enables faster research cycles and facilitates insights into model interpretability, paving the way for safer, more transparent, and more reliable AI systems.
Anthology ID:
2025.acl-demo.35
Volume:
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Pushkar Mishra, Smaranda Muresan, Tao Yu
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
363–375
Language:
URL:
https://preview.aclanthology.org/ingestion-acl-25/2025.acl-demo.35/
DOI:
Bibkey:
Cite (ACL):
Zara Siddique, Liam Turner, and Luis Espinosa-Anke. 2025. Dialz: A Python Toolkit for Steering Vectors. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), pages 363–375, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Dialz: A Python Toolkit for Steering Vectors (Siddique et al., ACL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-acl-25/2025.acl-demo.35.pdf
Copyright agreement:
 2025.acl-demo.35.copyright_agreement.pdf