Flammie A. Pirinen

Also published as: Tommi A. Pirinen, Flammie A Pirinen, Tommi Pirinen, Tommi A Pirinen, Flammie Pirinen

2025

Proceedings of the 9th Workshop on Constraint Grammar and Finite State NLP
Trond Trosterud | Linda Wiechetek | Flammie Pirinen
Proceedings of the 9th Workshop on Constraint Grammar and Finite State NLP

pdf bib abs

Divvunspell—Finite-State Spell-Checking and Correction on Modern Platforms
Flammie A Pirinen | Sjur Nørstebø Moshagen
Proceedings of the 9th Workshop on Constraint Grammar and Finite State NLP

Spell-checking and correction is one of the key applications of natural language support. Historically, for the biggest, less morphologically complex languages, spell-checking and correction could be implemented by relatively simple means; however, for morphologically complex and low-resource languages, the solutions were often suboptimal. Finite-state methods are the state of the art in rule-based natural language processing and also for spell-checking and correction they have been effectively used. In this article, we show some recent developments of a finite-state spell-checker implementation that works with modern operating systems and platforms.

pdf bib abs

Exploring Limitations and Risks of LLM-Based Grammatical Error Correction for Indigenous Languages
Flammie A Pirinen | Linda Wiechetek
Proceedings of the Eight Workshop on the Use of Computational Methods in the Study of Endangered Languages

Rule-based grammatical error correction has long been seen as the most effective way to create user-friendly end-user systems for gram- matical error correction (GEC). However, in the recent years the large language models and generative AI systems based on that technol- ogy have been progressed fast to challenge the traditional GEC approach. In this article we show which possibilities and limitations this approach bears for Indigenous languages that have more limited digital presence in the large language model data and a different literacy background than English. We show experi- ments in North Sámi, an Indigenous language of Northern Europe.

pdf bib abs

Can advances in NLP lead to worse results for Uralic languages and how can we fight back? Experiences from the world of automatic spell-checking and correction for Finnish
Flammie A Pirinen
Proceedings of the 10th International Workshop on Computational Linguistics for Uralic Languages

Spell-checking and correction is a ubiquitous application within text input in modern technology, and in some ways or another, if you type texts on a keyboard or a mobile phone, there will probably be an underlying spelling corrector running. The spell checkers have been around for decades, initially based on dictionaries and grammar rules, nowadays increasingly based on statistical data or large language models. In recent years, however, there has been a growing concern about the quality of these modern spell-checkers. In this article, we show that the spell-checkers for Finnish have gotten significantly worse in their modern implementations compared to their traditional knowledge-driven versions. We propose that this can have critical consequences for the quality of texts produced, as well as literacy overall.We furthermore speculate if it would be possible to get spell-checking and correction back on track for Uralic languages in modern systems.

pdf bib abs

Language technology for the minority Finnic languages
Flammie A Pirinen | Trond Trosterud | Jack Rueter
Proceedings of the 10th International Workshop on Computational Linguistics for Uralic Languages

This article gives an overview of the state of the art in language technology tools for Balto-Finnic minority languages, i.e., Balto-Finnic languages other than Estonian and Finnish. For simplicity, we will use the term Finnic in this article when referring to all members of this language branch except the Estonian and Finnish literary languages. All in all, there are nine standardised languages represented in existing language technology infrastructures with keyboards, grammatical language models, proofing tools, annotated corpora and (for one of the langauges) extensive ICALL programs. This article presents these tools and resources, discusses the relation between language models and proofing tool quality, as well as the (potential) impact of these tools on the respective language communities. The article rounds off with a discussion on prospects for future development.

Flammie A. Pirinen

2025

2024

2023

2022

2021

2020

2019

2018

2017

2015

2014

2013

2012

2011

2009

Co-authors

Venues