Next Generation Language Resources using Grid

Federico Calzolari, Eva Sassolini, Manuela Sassi, Sebastiana Cucurullo, Eugenio Picchi, Francesca Bertagna, Alessandro Enea, Monica Monachini, Claudia Soria, Nicoletta Calzolari


Abstract
This paper presents a case study concerning the challenges and requirements posed by next generation language resources, realized as an overall model of open, distributed and collaborative language infrastructure. If a sort of “new paradigm” for language resource sharing is required, we think that the emerging and still evolving technology connected to Grid computing is a very interesting and suitable one for a concrete realization of this vision. Given the current limitations of Grid computing, it is very important to test the new environment on basic language analysis tools, in order to get the feeling of what are the potentialities and possible limitations connected to its use in NLP. For this reason, we have done some experiments on a module of the Linguistic Miner, i.e. the extraction of linguistic patterns from restricted domain corpora. The Grid environment has produced the expected results (reduction of the processing time, huge storage capacity, data redundancy) without any additional cost for the final user.
Anthology ID:
L06-1388
Volume:
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
Month:
May
Year:
2006
Address:
Genoa, Italy
Editors:
Nicoletta Calzolari, Khalid Choukri, Aldo Gangemi, Bente Maegaard, Joseph Mariani, Jan Odijk, Daniel Tapias
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2006/pdf/631_pdf.pdf
DOI:
Bibkey:
Cite (ACL):
Federico Calzolari, Eva Sassolini, Manuela Sassi, Sebastiana Cucurullo, Eugenio Picchi, Francesca Bertagna, Alessandro Enea, Monica Monachini, Claudia Soria, and Nicoletta Calzolari. 2006. Next Generation Language Resources using Grid. In Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06), Genoa, Italy. European Language Resources Association (ELRA).
Cite (Informal):
Next Generation Language Resources using Grid (Calzolari et al., LREC 2006)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2006/pdf/631_pdf.pdf