Scaling Language Data Import/Export with a Data Transformer Interface

Nicholas Buckeridge, Ben Foley


Abstract
This paper focuses on the technical improvement of Elpis, a language technology which assists people in the process of transcription, particularly for low-resource language documentation situations. To provide better support for the diversity of file formats encountered by people working to document the world’s languages, a Data Transformer interface has been developed to abstract the complexities of designing individual data import scripts. This work took place as part of a larger project of code quality improvement and the publication of template code that can be used for development of other language technologies.
Anthology ID:
2020.sltu-1.16
Volume:
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages (SLTU) and Collaboration and Computing for Under-Resourced Languages (CCURL)
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Dorothee Beermann, Laurent Besacier, Sakriani Sakti, Claudia Soria
Venue:
SLTU
SIG:
Publisher:
European Language Resources association
Note:
Pages:
121–125
Language:
English
URL:
https://preview.aclanthology.org/sigedu-bea-out-of-sync-correction/2020.sltu-1.16/
DOI:
Bibkey:
Cite (ACL):
Nicholas Buckeridge and Ben Foley. 2020. Scaling Language Data Import/Export with a Data Transformer Interface. In Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages (SLTU) and Collaboration and Computing for Under-Resourced Languages (CCURL), pages 121–125, Marseille, France. European Language Resources association.
Cite (Informal):
Scaling Language Data Import/Export with a Data Transformer Interface (Buckeridge & Foley, SLTU 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/sigedu-bea-out-of-sync-correction/2020.sltu-1.16.pdf