How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

Yixin Ou; Yunzhi Yao; Ningyu Zhang; Hui Jin; Jiacheng Sun; Shumin Deng; Zhenguo Li; Huajun Chen

doi:10.18653/v1/2025.findings-acl.1021

How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

Yixin Ou, Yunzhi Yao, Ningyu Zhang, Hui Jin, Jiacheng Sun, Shumin Deng, Zhenguo Li, Huajun Chen

Abstract

Despite exceptional capabilities in knowledge-intensive tasks, Large Language Models (LLMs) face a critical gap in understanding how they internalize new knowledge, particularly how acquired knowledge becomes structurally embedded in their neural computations. We address this issue through the lens of knowledge circuit evolution, identifying computational subgraphs that facilitate knowledge storage and processing. Our systematic analysis of circuit evolution throughout continual pre-training reveals several key findings: (1) the acquisition of new knowledge is influenced by its relevance to pre-existing knowledge; (2) the evolution of knowledge circuits exhibits a distinct phase shift from formation to optimization; (3) the evolution of knowledge circuits follows a deep-to-shallow pattern. These insights not only advance our theoretical understanding of the mechanisms of new knowledge acquisition in LLMs, but also provide potential implications for improving continual pre-training strategies to enhance model performance.

Anthology ID:: 2025.findings-acl.1021
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 19889–19913
Language:
URL:: https://preview.aclanthology.org/mtsummit-25-ingestion/2025.findings-acl.1021/
DOI:: 10.18653/v1/2025.findings-acl.1021
Bibkey:
Cite (ACL):: Yixin Ou, Yunzhi Yao, Ningyu Zhang, Hui Jin, Jiacheng Sun, Shumin Deng, Zhenguo Li, and Huajun Chen. 2025. How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training. In Findings of the Association for Computational Linguistics: ACL 2025, pages 19889–19913, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training (Ou et al., Findings 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/mtsummit-25-ingestion/2025.findings-acl.1021.pdf

PDF Cite Search Fix data