A grammatical analyser for Tokelau

Trond Trosterud, Arnfinn Muruvik Vonen


Abstract
This article will present a grammatical aunalyser, disambiguator and dependency analysis of Tokelau. The grammatical analyser is written as a finite-state transducer (FST), whereas the disambiguator and dependency analyser are written in Constraint Grammar (CG), both within the GiellaLT infrastructure. Contrary to most languages analyzed within this framework, Being a Polynesian language, Tokelau is a predominantly isolating language, with reduplication and affixation as the main morphological processes. The article will discuss how FST and CG deal with Polynesian languages.
Anthology ID:
2025.cgmta-1.6
Volume:
Proceedings of the 9th Workshop on Constraint Grammar and Finite State NLP
Month:
march
Year:
2025
Address:
Tallinn, Estonia
Editors:
Trond Trosterud, Linda Wiechetek, Flammie Pirinen
Venues:
cgmta | WS
SIG:
Publisher:
University of Tartu Library
Note:
Pages:
38–44
Language:
URL:
https://preview.aclanthology.org/fix-sig-urls/2025.cgmta-1.6/
DOI:
Bibkey:
Cite (ACL):
Trond Trosterud and Arnfinn Muruvik Vonen. 2025. A grammatical analyser for Tokelau. In Proceedings of the 9th Workshop on Constraint Grammar and Finite State NLP, pages 38–44, Tallinn, Estonia. University of Tartu Library.
Cite (Informal):
A grammatical analyser for Tokelau (Trosterud & Vonen, cgmta 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-sig-urls/2025.cgmta-1.6.pdf