Hard Time Parsing Questions: Building a QuestionBank for French

Djamé Seddah, Marie Candito


Abstract
We present the French Question Bank, a treebank of 2600 questions. We show that classical parsing model performance drop while the inclusion of this data set is highly beneficial without harming the parsing of non-question data. when facing out-of- domain data with strong structural diver- gences. Two thirds being aligned with the QB (Judge et al., 2006) and being freely available, this treebank will prove useful to build robust NLP systems.
Anthology ID:
L16-1375
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2366–2370
Language:
URL:
https://aclanthology.org/L16-1375
DOI:
Bibkey:
Cite (ACL):
Djamé Seddah and Marie Candito. 2016. Hard Time Parsing Questions: Building a QuestionBank for French. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 2366–2370, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Hard Time Parsing Questions: Building a QuestionBank for French (Seddah & Candito, LREC 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-1/L16-1375.pdf