OntoGuard: Enforcing Action Admissibility for LLM Agents in Complex Interactive Environments

Pengxiang Liu; Tao Ren; Wei Xiong; Tingrui Yang; Junjie Wang; Jun HU

OntoGuard: Enforcing Action Admissibility for LLM Agents in Complex Interactive Environments

Pengxiang Liu, Tao Ren, Wei Xiong, Tingrui Yang, Junjie Wang, Jun HU

Abstract

Large Language Models (LLMs) have shown impressive reasoning capabilities in agents for complex interactive environments. However, these agents often suffer from hallucinations and lack grounding, leading to unreliable actions that conflict with real-world constraints. Existing approaches mitigate some issues through implicit imitation or sparse reinforcement learning but rely on fitting data distributions without explicitly understanding environmental constraints, often generating actions that are behaviorally distorted or environmentally impermissible. To address this, we introduce OntoGuard, an ontological framework designed to guard LLM agents by enforcing environmental and behavioral admissibility. These constraints are constructed by extracting knowledge from oracle demonstrations, supplemented with world knowledge inherent in LLMs and general knowledge bases. During inference, OntoGuard functions as an active interceptor, using a graph-based constraint-checking mechanism to reject invalid actions and prompt self-correction before acting. Experiments on both ScienceWorld and VirtualHome demonstrate OntoGuard’s advantage over state-of-the-art methods, validating its ability to enforce physical and behavioral constraints while preventing invalid actions.

Anthology ID:: 2026.findings-acl.1051
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 20937–20952
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1051/
DOI:
Bibkey:
Cite (ACL):: Pengxiang Liu, Tao Ren, Wei Xiong, Tingrui Yang, Junjie Wang, and Jun HU. 2026. OntoGuard: Enforcing Action Admissibility for LLM Agents in Complex Interactive Environments. In Findings of the Association for Computational Linguistics: ACL 2026, pages 20937–20952, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: OntoGuard: Enforcing Action Admissibility for LLM Agents in Complex Interactive Environments (Liu et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1051.pdf
Checklist:: 2026.findings-acl.1051.checklist.pdf

PDF Cite Search Checklist Fix data