Graph Explorer: Training Faithful KG Agents with Visibility-Grounded Supervision

Yifeng Chen; Sicheng Wan; Tianyi Zhang; Xuezhou Zhang

Graph Explorer: Training Faithful KG Agents with Visibility-Grounded Supervision

Yifeng Chen, Sicheng Wan, Tianyi Zhang, Xuezhou Zhang

Abstract

Large language models (LLMs) are strong reasoners but still hallucinate and make unreliable decisions on knowledge-intensive questions. Knowledge graphs (KGs) provide explicit, auditable facts, motivating KGQA agents that interact with KGs via tool calls to reduce hallucinations. However, LLM agents often struggle to reliably manipulate KG-specific symbols (entity IDs and relation names), leading to invalid or hallucinated tool-call arguments, and high-quality step-by-step supervision for such tool use is scarce. Meanwhile, large datasets of expert SPARQL programs exist for Freebase KGQA, but naively converting them into action supervision is brittle: SPARQL assumes a global view of the KG, while an agent acts from a truncated, local prompt, so expert steps can reference KG IDs (entity/relation/attribute symbols) that are not visible at decision time. We present Graph Explorer, a fully automatic data synthesis pipeline that turns expert SPARQL into executable, visibility-grounded (actions may use only IDs shown in the prompt) tool supervision without manual trace labeling. Graph Explorer compiles SPARQL into tool-call plans, executes them under the same context-control policy used at inference, and retains only tool-interaction traces whose tool-call arguments are visible at decision time, yielding clean (context, next-action) pairs for action-centric fine-tuning. We evaluate with a strict finish-or-fail protocol (success only if the agent issues a valid within budget). Under this protocol, our fine-tuned Qwen3-8B reaches 74.0/80.2 Hit@1 on CWQ/WebQSP, improving over a reproduced prompting baseline by +22.5/+16.2 points, indicating more faithful multi-step graph exploration from visible evidence.

Anthology ID:: 2026.findings-acl.387
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 7875–7890
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.387/
DOI:
Bibkey:
Cite (ACL):: Yifeng Chen, Sicheng Wan, Tianyi Zhang, and Xuezhou Zhang. 2026. Graph Explorer: Training Faithful KG Agents with Visibility-Grounded Supervision. In Findings of the Association for Computational Linguistics: ACL 2026, pages 7875–7890, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Graph Explorer: Training Faithful KG Agents with Visibility-Grounded Supervision (Chen et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.387.pdf
Checklist:: 2026.findings-acl.387.checklist.pdf

PDF Cite Search Checklist Fix data