The LIMA Multilingual Analyzer Made Free: FLOSS Resources Adaptation and Correction

Gaël de Chalendar


Abstract
At CEA LIST, we have decided to release our multilingual analyzer LIMA as Free software. As we were not proprietary of all the language resources used we had to select and adapt free ones in order to attain results good enough and equivalent to those obtained with our previous ones. For English and French, we found and adapted a full-form dictionary and an annotated corpus for learning part-of-speech tagging models.
Anthology ID:
L14-1313
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2932–2937
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/362_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Gaël de Chalendar. 2014. The LIMA Multilingual Analyzer Made Free: FLOSS Resources Adaptation and Correction. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 2932–2937, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
The LIMA Multilingual Analyzer Made Free: FLOSS Resources Adaptation and Correction (de Chalendar, LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/362_Paper.pdf