Introducing CQuAE : A New French Contextualised Question-Answering Corpus for the Education Domain
Thomas Gerald, Anne Vilnat, Sofiane Ettayeb, Louis Tamames, Patrick Paroubek
Abstract
We present a new question answering corpus in French designed to educational domain. To be useful in such domain, we have to propose more complex questions and to be able to justify the answers on validated material. We analyze some properties of this corpus. The last part of this paper will be devoted to present the first experiments we have carried out to demonstrate the value of this dataset for learning a Retrieval Augmented Genration framework. Different experiments are proposed, with an automatic evaluation. A human evaluation is proposed to confirm or infirm this automatic evaluation.- Anthology ID:
- 2024.lrec-main.808
- Volume:
- Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
- Month:
- May
- Year:
- 2024
- Address:
- Torino, Italia
- Editors:
- Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue
- Venues:
- LREC | COLING
- SIG:
- Publisher:
- ELRA and ICCL
- Note:
- Pages:
- 9234–9244
- Language:
- URL:
- https://aclanthology.org/2024.lrec-main.808
- DOI:
- Cite (ACL):
- Thomas Gerald, Anne Vilnat, Sofiane Ettayeb, Louis Tamames, and Patrick Paroubek. 2024. Introducing CQuAE : A New French Contextualised Question-Answering Corpus for the Education Domain. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 9234–9244, Torino, Italia. ELRA and ICCL.
- Cite (Informal):
- Introducing CQuAE : A New French Contextualised Question-Answering Corpus for the Education Domain (Gerald et al., LREC-COLING 2024)
- PDF:
- https://preview.aclanthology.org/add_acl24_videos/2024.lrec-main.808.pdf