NeuRAG: End-to-End Neural Knowledge Augmentation via Hyper-Neurons

Liwei Zheng, Xuemin Liu, Jie Liu


Abstract
Retrieval-Augmented Generation (RAG) systems have become a standard approach for grounding large language models in external knowledge. However, they are constrained by a decoupled architecture: retrieval and reasoning operate as separate stages, with retrieved text merely prepended as passive context. This prevents deep integration of knowledge into the model’s parametric reasoning, leading to fragmented responses for complex queries requiring multi-document synthesis or conflict resolution. To bridge this gap, we propose NeuRAG, an end-to-end Neuralized RAG framework that unifies knowledge retrieval and fusion through Hyper-Neurons—parameterized modules encoding entire documents directly into the model’s parameter space. In NeuRAG, each document is encoded as a lightweight LoRA module, conceptualized as a knowledge neuron. These neurons collectively form a document-adaptive Hyper-Layer, which dynamically activates and fuses knowledge neurons via an attention mechanism conditioned on the input hidden-state query. This enables the model to jointly retrieve and reason within a single forward pass, seamlessly integrating external knowledge into its inference pathway. Extensive experiments across multiple datasets and LLMs demonstrate NeuRAG’s strong and consistent performance as a promising novel RAG paradigm.
Anthology ID:
2026.findings-acl.1516
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
30324–30343
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.findings-acl.1516/
DOI:
Bibkey:
Cite (ACL):
Liwei Zheng, Xuemin Liu, and Jie Liu. 2026. NeuRAG: End-to-End Neural Knowledge Augmentation via Hyper-Neurons. In Findings of the Association for Computational Linguistics: ACL 2026, pages 30324–30343, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
NeuRAG: End-to-End Neural Knowledge Augmentation via Hyper-Neurons (Zheng et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.findings-acl.1516.pdf
Checklist:
 2026.findings-acl.1516.checklist.pdf