Structure-Aware Quantized Retrieval for Long-Document Question Answering

Hui Huang, Julien Velcin, Yacine Kessaci


Abstract
Long-document question answering is challenging because relevant evidence is often scattered across distant sections. Traditional long-document QA/RAG pipelines often suffer from context fragmentation, retrieving locally plausible but structurally misaligned passages. We present the Hierarchical Quantized Document Retriever (HQDR), a framework that aligns hierarchical graph representations with a universal token vocabulary and integrates explicit structure into retrieval. By grounding continuous structural features in a fixed, discrete semantic space, HQDR captures universal hierarchical patterns rather than overfitting to specific layouts. We further propose a hybrid scoring mechanism that decouples semantic matching from structural alignment. Extensive experiments on QASPER and Natural Questions demonstrate that HQDR achieves consistent gains over strong baselines and exhibits superior robustness when transferring between datasets with distinct structural characteristics.
Anthology ID:
2026.findings-acl.209
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
4291–4304
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.209/
DOI:
Bibkey:
Cite (ACL):
Hui Huang, Julien Velcin, and Yacine Kessaci. 2026. Structure-Aware Quantized Retrieval for Long-Document Question Answering. In Findings of the Association for Computational Linguistics: ACL 2026, pages 4291–4304, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Structure-Aware Quantized Retrieval for Long-Document Question Answering (Huang et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.209.pdf
Checklist:
 2026.findings-acl.209.checklist.pdf