Exploring Social Sciences Archives with Explainable Document Linkage through Question Generation

Elie Antoine, Hyun Jung Kang, Ismaël Rousseau, Ghislaine Azémard, Frederic Bechet, Geraldine Damnati


Abstract
This paper proposes a new approach for exploring digitized humanities and social sciences collections based on explainable links built from questions. Our experiments show the quality of our automatically generated questions and their relevance in a local context as well as the originality of the links produced by embeddings based on these questions. Analyses have also been performed to understand the types of questions generated on our corpus, and the related uses that can enrich the exploration. The relationships between the co-references and the questions generated, and the answers extracted from the text were also discussed and open a path for future improvements for our system in their resolution.
Anthology ID:
2023.latechclfl-1.16
Volume:
Proceedings of the 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature
Month:
May
Year:
2023
Address:
Dubrovnik, Croatia
Venue:
LaTeCHCLfL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
141–151
Language:
URL:
https://aclanthology.org/2023.latechclfl-1.16
DOI:
Bibkey:
Cite (ACL):
Elie Antoine, Hyun Jung Kang, Ismaël Rousseau, Ghislaine Azémard, Frederic Bechet, and Geraldine Damnati. 2023. Exploring Social Sciences Archives with Explainable Document Linkage through Question Generation. In Proceedings of the 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, pages 141–151, Dubrovnik, Croatia. Association for Computational Linguistics.
Cite (Informal):
Exploring Social Sciences Archives with Explainable Document Linkage through Question Generation (Antoine et al., LaTeCHCLfL 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/remove-xml-comments/2023.latechclfl-1.16.pdf
Video:
 https://preview.aclanthology.org/remove-xml-comments/2023.latechclfl-1.16.mp4