Krista Liin
2025
Paragraph-level Error Correction and Explanation Generation: Case Study for Estonian
Martin Vainikko
|
Taavi Kamarik
|
Karina Kert
|
Krista Liin
|
Silvia Maine
|
Kais Allkivi
|
Annekatrin Kaivapalu
|
Mark Fishel
Proceedings of the 20th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2025)
We present a case study on building task-specific models for grammatical error correction and explanation generation tailored to learners of Estonian. Our approach handles whole paragraphs instead of sentences and leverages prompting proprietary large language models for generating synthetic training data, addressing the limited availability of error correction data and the complete absence of correction justification/explanation data in Estonian. We describe the chosen approach and pipeline and provide technical details for the experimental part. The final outcome is a set of open-weight models, which are released with a permissive license along with the generated synthetic error correction and explanation data.
2020
CLARIN: Distributed Language Resources and Technology in a European Infrastructure
Maria Eskevich
|
Franciska de Jong
|
Alexander König
|
Darja Fišer
|
Dieter Van Uytvanck
|
Tero Aalto
|
Lars Borin
|
Olga Gerassimenko
|
Jan Hajic
|
Henk van den Heuvel
|
Neeme Kahusk
|
Krista Liin
|
Martin Matthiesen
|
Stelios Piperidis
|
Kadri Vider
Proceedings of the 1st International Workshop on Language Technology Platforms
CLARIN is a European Research Infrastructure providing access to digital language resources and tools from across Europe and beyond to researchers in the humanities and social sciences. This paper focuses on CLARIN as a platform for the sharing of language resources. It zooms in on the service offer for the aggregation of language repositories and the value proposition for a number of communities that benefit from the enhanced visibility of their data and services as a result of integration in CLARIN. The enhanced findability of language resources is serving the social sciences and humanities (SSH) community at large and supports research communities that aim to collaborate based on virtual collections for a specific domain. The paper also addresses the wider landscape of service platforms based on language technologies which has the potential of becoming a powerful set of interoperable facilities to a variety of communities of use.
Search
Fix author
Co-authors
- Tero Aalto 1
- Kais Allkivi 1
- Lars Borin 1
- Maria Eskevich 1
- Mark Fishel 1
- show all...