Global Open Resources and Information for Language and Linguistic Analysis (GORILLA)

Damir Cavar, Malgorzata Cavar, Lwin Moe


Abstract
The infrastructure Global Open Resources and Information for Language and Linguistic Analysis (GORILLA) was created as a resource that provides a bridge between disciplines such as documentary, theoretical, and corpus linguistics, speech and language technologies, and digital language archiving services. GORILLA is designed as an interface between digital language archive services and language data producers. It addresses various problems of common digital language archive infrastructures. At the same time it serves the speech and language technology communities by providing a platform to create and share speech and language data from low-resourced and endangered languages. It hosts an initial collection of language models for speech and natural language processing (NLP), and technologies or software tools for corpus creation and annotation. GORILLA is designed to address the Transcription Bottleneck in language documentation, and, at the same time to provide solutions to the general Language Resource Bottleneck in speech and language technologies. It does so by facilitating the cooperation between documentary and theoretical linguistics, and speech and language technologies research and development, in particular for low-resourced and endangered languages.
Anthology ID:
L16-1710
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
4484–4491
Language:
URL:
https://aclanthology.org/L16-1710
DOI:
Bibkey:
Cite (ACL):
Damir Cavar, Malgorzata Cavar, and Lwin Moe. 2016. Global Open Resources and Information for Language and Linguistic Analysis (GORILLA). In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 4484–4491, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Global Open Resources and Information for Language and Linguistic Analysis (GORILLA) (Cavar et al., LREC 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/paclic-22-ingestion/L16-1710.pdf