Quantifying Logical Consistency in Transformers via Query-Key Alignment

Eduard Tulchinskii; Laida Kushnareva; Anastasia Voznyuk; Andrei Andriiainen; Irina Piontkovskaya; Evgeny Burnaev; Serguei Barannikov

Quantifying Logical Consistency in Transformers via Query-Key Alignment

Eduard Tulchinskii, Laida Kushnareva, Anastasia Voznyuk, Andrei Andriiainen, Irina Piontkovskaya, Evgeny Burnaev, Serguei Barannikov

Abstract

Large language models (LLMs) excel at many NLP tasks, yet their multi-step logical reasoning remains unreliable. Existing solutions such as Chain-of-Thought prompting generate intermediate steps but provide no internal check of their logical coherence. In this paper, we use the “QK-score”, a lightweight metric based on query–key alignments within transformer attention heads, to evaluate the logical reasoning capabilities of LLMs. Our method automatically identifies attention heads that play a key role in distinguishing valid from invalid logical inferences, enabling efficient inference-time evaluation via a single forward pass. It reveals latent reasoning structure in LLMs and provides a scalable mechanistic alternative to ablation-based analysis. Across three benchmarks: ProntoQA-OOD, PARARULE-Plus, and MultiLogicEval, with models ranging from 1.5B to 70B parameters, the selected heads predict logical validity up to 14% better than the model probabilities, and remain robust under distractors and increasing reasoning depth of d≤ 6.

Anthology ID:: 2025.emnlp-main.1785
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 35184–35199
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1785/
DOI:
Bibkey:
Cite (ACL):: Eduard Tulchinskii, Laida Kushnareva, Anastasia Voznyuk, Andrei Andriiainen, Irina Piontkovskaya, Evgeny Burnaev, and Serguei Barannikov. 2025. Quantifying Logical Consistency in Transformers via Query-Key Alignment. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 35184–35199, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Quantifying Logical Consistency in Transformers via Query-Key Alignment (Tulchinskii et al., EMNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1785.pdf
Checklist:: 2025.emnlp-main.1785.checklist.pdf

PDF Cite Search Checklist Fix data