SAG: Enhancing Domain-Specific Information Retrieval with Semantic-Augmented Graphs

Carol-Luca Gasan, Vasile Pais


Abstract
Retrieval-Augmented Generation (RAG) systems rely on high-quality embeddings to retrieve relevant context for large language models. This paper introduces the Semantic-Augmented Graph (SAG), a new architecture that improves domain-specific embeddings by capturing hierarchical semantic relationships between text segments. Inspired by human information processing, SAG organizes content from general to specific concepts using a graph-based structure. By combining static embeddings with dynamic semantic graphs, it generates context-aware representations that reflect both lexical and conceptual links. Experiments on text similarity and domain-specific question answering show that SAG consistently outperforms standard embedding methods within RAG pipelines.
Anthology ID:
2025.starsem-1.29
Volume:
Proceedings of the 14th Joint Conference on Lexical and Computational Semantics (*SEM 2025)
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Lea Frermann, Mark Stevenson
Venue:
*SEM
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
362–371
Language:
URL:
https://preview.aclanthology.org/ingest-emnlp/2025.starsem-1.29/
DOI:
Bibkey:
Cite (ACL):
Carol-Luca Gasan and Vasile Pais. 2025. SAG: Enhancing Domain-Specific Information Retrieval with Semantic-Augmented Graphs. In Proceedings of the 14th Joint Conference on Lexical and Computational Semantics (*SEM 2025), pages 362–371, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
SAG: Enhancing Domain-Specific Information Retrieval with Semantic-Augmented Graphs (Gasan & Pais, *SEM 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-emnlp/2025.starsem-1.29.pdf