PO-KGQA: Preference Optimization for Low-Resource Complex Knowledge Graph Question Answering

Prerna Agarwal, Ayushman Kumar Singh, Srikanta Bedathur


Abstract
Existing low-resource in-context learning-based knowledge graph question answering (KGQA) methods rely heavily on large language models (LLMs) to convert the natural language question into its corresponding logical form (LF), such as SPARQL, KoPL, etc. Recently, a few alignment techniques have been introduced that enable instruction-based fine-tuning of language models. They provide explicit negative signals and comparative objectives to learn how to avoid negative signals using preference optimization methods. Exploring such fine-tuning techniques with LLMs becomes very challenging due to the high computational resource requirements associated with them. Due to this, the focus has been shifted towards Small Language Models (SLMs), which offer advantages such as ease of (i) deployment for practical applications and (ii) instruction fine-tuning for specialized tasks. Motivated by this, in this work, we propose PO-KGQA: An SLM-based preference optimization framework for the complex KGQA task in a low-resource setting. Our extensive experiments demonstrate how PO-KGQA outperforms other fine-tuning alignment techniques on complex benchmarks such as KQA Pro by approximately 9% (avg).
Anthology ID:
2026.findings-acl.2083
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
41976–41997
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.2083/
DOI:
Bibkey:
Cite (ACL):
Prerna Agarwal, Ayushman Kumar Singh, and Srikanta Bedathur. 2026. PO-KGQA: Preference Optimization for Low-Resource Complex Knowledge Graph Question Answering. In Findings of the Association for Computational Linguistics: ACL 2026, pages 41976–41997, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
PO-KGQA: Preference Optimization for Low-Resource Complex Knowledge Graph Question Answering (Agarwal et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.2083.pdf
Checklist:
 2026.findings-acl.2083.checklist.pdf