Julien Nioche


2008

pdf
The BNC Parsed with RASP4UIMA
Øistein E. Andersen | Julien Nioche | Ted Briscoe | John Carroll
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

We have integrated the RASP system with the UIMA framework (RASP4UIMA) and used this to parse the XML-encoded version of the British National Corpus (BNC). All original annotation is preserved, and parsing information, mainly in the form of grammatical relations, is added in an XML format. A few specific adaptations of the system to give better results with the BNC are discussed briefly. The RASP4UIMA system is publicly available and can be used to parse other corpora or document collections, and the final parsed version of the BNC will be deposited with the Oxford Text Archive.

2000

pdf
TyPTex: Inductive Typological Text Classification by Multivariate Statistical Analysis for NLP Systems Tuning/Evaluation
Helka Folch | Serge Heiden | Benoît Habert | Serge Fleury | Gabriel Illouz | Pierre Lafon | Julien Nioche | Sophie Prévost
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00)