Emanuel Dima


2018

pdf bib
Handling Big Data and Sensitive Data Using EUDAT’s Generic Execution Framework and the WebLicht Workflow Engine.
Claus Zinn | Wei Qui | Marie Hinrichs | Emanuel Dima | Alexandr Chernov
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

2016

pdf bib
Crosswalking from CMDI to Dublin Core and MARC 21
Claus Zinn | Thorsten Trippel | Steve Kaminski | Emanuel Dima
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)

The Component MetaData Infrastructure (CMDI) is a framework for the creation and usage of metadata formats to describe all kinds of resources in the CLARIN world. To better connect to the library world, and to allow librarians to enter metadata for linguistic resources into their catalogues, a crosswalk from CMDI-based formats to bibliographic standards is required. The general and rather fluid nature of CMDI, however, makes it hard to map arbitrary CMDI schemas to metadata standards such as Dublin Core (DC) or MARC 21, which have a mature, well-defined and fixed set of field descriptors. In this paper, we address the issue and propose crosswalks between CMDI-based profiles originating from the NaLiDa project and DC and MARC 21, respectively.

2012

pdf bib
A Metadata Editor to Support the Description of Linguistic Resources
Emanuel Dima | Christina Hoppermann | Erhard Hinrichs | Thorsten Trippel | Claus Zinn
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)

Creating and maintaining metadata for various kinds of resources requires appropriate tools to assist the user. The paper presents the metadata editor ProFormA for the creation and editing of CMDI (Component Metadata Infrastructure) metadata in web forms. This editor supports a number of CMDI profiles currently being provided for different types of resources. Since the editor is based on XForms and server-side processing, users can create and modify CMDI files in their standard browser without the need for further processing. Large parts of ProFormA are implemented as web services in order to reuse them in other contexts and programs.

pdf bib
A Repository for the Sustainable Management of Research Data
Emanuel Dima | Verena Henrich | Erhard Hinrichs | Marie Hinrichs | Christina Hoppermann | Thorsten Trippel | Thomas Zastrow | Claus Zinn
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)

This paper presents the system architecture as well as the underlying workflow of the Extensible Repository System of Digital Objects (ERDO) which has been developed for the sustainable archiving of language resources within the Tübingen CLARIN-D project. In contrast to other approaches focusing on archiving experts, the described workflow can be used by researchers without required knowledge in the field of long-term storage for transferring data from their local file systems into a persistent repository.