Trond Trosterud


2022

pdf
Unmasking the Myth of Effortless Big Data - Making an Open Source Multi-lingual Infrastructure and Building Language Resources from Scratch
Linda Wiechetek | Katri Hiovain-Asikainen | Inga Lill Sigga Mikkelsen | Sjur Moshagen | Flammie Pirinen | Trond Trosterud | Børre Gaup
Proceedings of the Thirteenth Language Resources and Evaluation Conference

Machine learning (ML) approaches have dominated NLP during the last two decades. From machine translation and speech technology, ML tools are now also in use for spellchecking and grammar checking, with a blurry distinction between the two. We unmask the myth of effortless big data by illuminating the efforts and time that lay behind building a multi-purpose corpus with regard to collecting, mark-up and building from scratch. We also discuss what kind of language technology minority languages actually need, and to what extent the dominating paradigm has been able to deliver these tools. In this context we present our alternative to corpus-based language technology, which is knowledge-based language technology, and we show how this approach can provide language technology solutions for languages being outside the reach of machine learning procedures. We present a stable and mature infrastructure (GiellaLT) containing more than hundred languages and building a number of language technology tools that are useful for language communities.

2021

pdf
Proceedings of the Seventh International Workshop on Computational Linguistics of Uralic Languages
Flammie A Pirinen | Timofey Arhangelskiy | Trond Trosterud | Michael Rießler
Proceedings of the Seventh International Workshop on Computational Linguistics of Uralic Languages

pdf
Overview of Open-Source Morphology Development for the Komi-Zyrian Language: Past and future
Jack Rueter | Niko Partanen | Mika Hämäläinen | Trond Trosterud
Proceedings of the Seventh International Workshop on Computational Linguistics of Uralic Languages

2018

pdf
Proceedings of the Fourth International Workshop on Computational Linguistics of Uralic Languages
Tommi A. Pirinen | Michael Rießler | Jack Rueter | Trond Trosterud | Francis M. Tyers
Proceedings of the Fourth International Workshop on Computational Linguistics of Uralic Languages

pdf
Modeling Northern Haida Verb Morphology
Jordan Lachler | Lene Antonsen | Trond Trosterud | Sjur Moshagen | Antti Arppe
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

pdf
Building a Constraint Grammar Parser for Plains Cree Verbs and Arguments
Katherine Schmirler | Antti Arppe | Trond Trosterud | Lene Antonsen
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

2017

pdf
A Morphological Parser for Odawa
Dustin Bowers | Antti Arppe | Jordan Lachler | Sjur Moshagen | Trond Trosterud
Proceedings of the 2nd Workshop on the Use of Computational Methods in the Study of Endangered Languages

pdf
North-Sámi to Finnish rule-based machine translation system
Tommi Pirinen | Francis M. Tyers | Trond Trosterud | Ryan Johnson | Kevin Unhammer | Tiina Puolakainen
Proceedings of the 21st Nordic Conference on Computational Linguistics

pdf
Machine translation with North Saami as a pivot language
Lene Antonsen | Ciprian Gerstenberger | Maja Kappfjell | Sandra Nystø Rahka | Marja-Liisa Olthuis | Trond Trosterud | Francis M. Tyers
Proceedings of the 21st Nordic Conference on Computational Linguistics

pdf
Proceedings of the Third Workshop on Computational Linguistics for Uralic Languages
Francis M. Tyers | Michael Rießler | Tommi A. Pirinen | Trond Trosterud
Proceedings of the Third Workshop on Computational Linguistics for Uralic Languages

pdf
A morphological analyser for Kven
Sindre Reino Trosterud | Trond Trosterud | Anna-Kaisa Räisänen | Leena Niiranen | Mervi Haavisto | Kaisa Maliniemi
Proceedings of the Third Workshop on Computational Linguistics for Uralic Languages

2014

pdf
Modeling the Noun Morphology of Plains Cree
Conor Snoek | Dorothy Thunder | Kaidi Lõo | Antti Arppe | Jordan Lachler | Sjur Moshagen | Trond Trosterud
Proceedings of the 2014 Workshop on the Use of Computational Methods in the Study of Endangered Languages

2013

pdf
Using Finite State Transducers for Making Efficient Reading Comprehension Dictionaries
Ryan Johnson | Lene Antonsen | Trond Trosterud
Proceedings of the 19th Nordic Conference of Computational Linguistics (NODALIDA 2013)

pdf
Building an Open-Source Development Infrastructure for Language Technology Projects
Sjur N. Moshagen | Tommi Pirinen | Trond Trosterud
Proceedings of the 19th Nordic Conference of Computational Linguistics (NODALIDA 2013)

2012

pdf
Evaluating North Sámi to Norwegian assimilation RBMT
Trond Trosterud | Kevin Brubeck Unhammer
Proceedings of the Third International Workshop on Free/Open-Source Rule-Based Machine Translation

2010

pdf
Reusing Grammatical Resources for New Languages
Lene Antonsen | Trond Trosterud | Linda Wiechetek
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)

Grammatical approaches to language technology are often considered less optimal than statistical approaches in multilingual settings, where large-scale portability becomes an important issue. The present paper argues that there is a notable gain in reusing grammatical resources when porting technology to new languages. The pivot language is North Sámi, and the paper discusses portability with respect to the closely related Lule and South Sámi, and to the unrelated Faroese and Greenlandic languages.

2009

pdf
Interactive pedagogical programs based on constraint grammar
Lene Antonsen | Saara Huhmarniemi | Trond Trosterud
Proceedings of the 17th Nordic Conference of Computational Linguistics (NODALIDA 2009)

pdf
Reuse of free resources in machine translation between Nynorsk and Bokmål
Kevin Unhammer | Trond Trosterud
Proceedings of the First International Workshop on Free/Open-Source Rule-Based Machine Translation

We describe the development of a two-way shallow-transfer machine translation system between Norwegian Nynorsk and Norwegian Bokma ̊l built on the Apertium platform, using the Free and Open Source resources Norsk Ordbank and the Oslo–Bergen Constraint Grammar tagger. We detail the integration of these and other resources in the system along with the construction of the lexical and structural transfer, and evaluate the translation quality in comparison with another system. Finally, some future work is suggested.

pdf
Developing Prototypes for Machine Translation between Two Sami Languages
Francis M. Tyers | Linda Wiechetek | Trond Trosterud
Proceedings of the 13th Annual conference of the European Association for Machine Translation

2008

pdf
Finite State Solutions For Reduplication In Kinyarwanda Language
Jackson Muhirwe | Trond Trosterud
Proceedings of the IJCNLP-08 Workshop on NLP for Less Privileged Languages

2007

pdf
Usage of XSL Stylesheets for the Annotation of the Sámi Language Corpora.
Saara Huhmarniemi | Sjur N. Moshagen | Trond Trosterud
Proceedings of the Linguistic Annotation Workshop