Creating ConLangs to Probe the Metalinguistic Grammatical Knowledge of LLMs

Chihiro Taguchi, Richard Sproat


Abstract
We present a system that uses LLMs as a tool in the development of Constructed Languages— ConLangs, which we call IASC (Interactive Agentic System for ConLangs). The system is modular in that it creates each of the components—phonology, morphology and syntax, lexicon, orthography, and grammatical handbook, using module-specific sets of prompts. The approach is agentic in that various modules allow for refining the output given automatically-generated commentary on a previous step. Our main goals are twofold. First, we aim to provide tools that facilitate an engaging and enjoyable experience in creating artificially constructed languages. Second, the focus of this paper is on using our ConLang framework as a novel way to explore what LLMs ‘know’ about language—not what they know about any particular language or encyclopedic facts, but how much they know about and understand language and linguistic concepts. In the experiments, we particularly focus on the morphosyntax module and show that there is a fairly wide gulf in capabilities both among different LLMs and among different linguistic specifications, with it being notably easier for systems to deal with more typologically common patterns than rarer ones. All code is released: https://github.com/SakanaAI/IASC.
Anthology ID:
2026.findings-acl.1455
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
29096–29148
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1455/
DOI:
Bibkey:
Cite (ACL):
Chihiro Taguchi and Richard Sproat. 2026. Creating ConLangs to Probe the Metalinguistic Grammatical Knowledge of LLMs. In Findings of the Association for Computational Linguistics: ACL 2026, pages 29096–29148, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Creating ConLangs to Probe the Metalinguistic Grammatical Knowledge of LLMs (Taguchi & Sproat, Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1455.pdf
Checklist:
 2026.findings-acl.1455.checklist.pdf