Bertram Højer
Also published as: Bertram H{\o}jer
2026
Language Models Learn Universal Representations of Numbers and Here’s Why You Should Care
Michal \v{S}tef\'anik | Timothee Mickus | Marek Kadl\v{c}{\'\i}k | Bertram H{\o}jer | Michal Spiegel | Ra\'ul V\'azquez | Aman Sinha | Josef Kucha\v{r} | Philipp Mondorf | Pontus Stenetorp
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Michal \v{S}tef\'anik | Timothee Mickus | Marek Kadl\v{c}{\'\i}k | Bertram H{\o}jer | Michal Spiegel | Ra\'ul V\'azquez | Aman Sinha | Josef Kucha\v{r} | Philipp Mondorf | Pontus Stenetorp
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Prior work has shown that large language models (LLMs) often converge to accurate input embedding for numbers, based on sinusoidal representations.In this work, we demonstrate that these representations are in fact strikingly systematic, to the point of being almost perfectly universal: different LLM families develop equivalent sinusoidal structures, and number representations are broadly interchangeable in a large swathe of experimental setups.We show that properly factoring in this characteristic is crucial when it comes to assessing how accurately LLMs encode numeric and other ordinal information, and that mechanistically enhancing this sinusoidality can also lead to reductions of LLMs’ arithmetic errors.
2025
Research Community Perspectives on “Intelligence” and Large Language Models
Bertram Højer | Terne Sasha Thorn Jakobsen | Anna Rogers | Stefan Heinrich
Findings of the Association for Computational Linguistics: ACL 2025
Bertram Højer | Terne Sasha Thorn Jakobsen | Anna Rogers | Stefan Heinrich
Findings of the Association for Computational Linguistics: ACL 2025
Despite the widespread use of ‘artificial intelligence’ (AI) framing in Natural Language Processing (NLP) research, it is not clear what researchers mean by ”intelligence”. To that end, we present the results of a survey on the notion of ”intelligence” among researchers and its role in the research agenda. The survey elicited complete responses from 303 researchers from a variety of fields including NLP, Machine Learning (ML), Cognitive Science, Linguistics, and Neuroscience.We identify 3 criteria of intelligence that the community agrees on the most: generalization, adaptability, & reasoning.Our results suggests that the perception of the current NLP systems as ”intelligent” is a minority position (29%).Furthermore, only 16.2% of the respondents see developing intelligent systems as a research goal, and these respondents are more likely to consider the current systems intelligent.