Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology
Maria Gavrilidou, Penny Labropoulou, Stelios Piperidis, Voula Giouli, Nicoletta Calzolari, Monica Monachini, Claudia Soria, Khalid Choukri
Abstract
This paper reports on the multilingual Language Resources (MLRs), i.e. parallel corpora and terminological lexicons for less widely digitally available languages, that have been developed in the INTERA project and the methodology adopted for their production. Special emphasis is given to the reality factors that have influenced the MLRs development approach and their final constitution. Building on the experience gained in the project, a production model has been elaborated, suggesting ways and techniques that can be exploited in order to improve LRs production taking into account realistic issues.- Anthology ID:
- L06-1321
- Volume:
- Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
- Month:
- May
- Year:
- 2006
- Address:
- Genoa, Italy
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2006/pdf/534_pdf.pdf
- DOI:
- Cite (ACL):
- Maria Gavrilidou, Penny Labropoulou, Stelios Piperidis, Voula Giouli, Nicoletta Calzolari, Monica Monachini, Claudia Soria, and Khalid Choukri. 2006. Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology. In Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06), Genoa, Italy. European Language Resources Association (ELRA).
- Cite (Informal):
- Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology (Gavrilidou et al., LREC 2006)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2006/pdf/534_pdf.pdf