Formal Basis of a Language Universal

Miloš Stanojević, Mark Steedman


Abstract
Abstract Steedman (2020) proposes as a formal universal of natural language grammar that grammatical permutations of the kind that have given rise to transformational rules are limited to a class known to mathematicians and computer scientists as the “separable” permutations. This class of permutations is exactly the class that can be expressed in combinatory categorial grammars (CCGs). The excluded non-separable permutations do in fact seem to be absent in a number of studies of crosslinguistic variation in word order in nominal and verbal constructions. The number of permutations that are separable grows in the number n of lexical elements in the construction as the Large Schröder Number Sn−1. Because that number grows much more slowly than the n! number of all permutations, this generalization is also of considerable practical interest for computational applications such as parsing and machine translation. The present article examines the mathematical and computational origins of this restriction, and the reason it is exactly captured in CCG without the imposition of any further constraints.
Anthology ID:
2021.cl-1.2
Volume:
Computational Linguistics, Volume 47, Issue 1 - March 2021
Month:
March
Year:
2021
Address:
Cambridge, MA
Venue:
CL
SIG:
Publisher:
MIT Press
Note:
Pages:
9–42
Language:
URL:
https://aclanthology.org/2021.cl-1.2
DOI:
10.1162/coli_a_00394
Bibkey:
Cite (ACL):
Miloš Stanojević and Mark Steedman. 2021. Formal Basis of a Language Universal. Computational Linguistics, 47(1):9–42.
Cite (Informal):
Formal Basis of a Language Universal (Stanojević & Steedman, CL 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/auto-file-uploads/2021.cl-1.2.pdf
Video:
 https://preview.aclanthology.org/auto-file-uploads/2021.cl-1.2.mp4