Antoine Isaac


2025

pdf bib
Don’t Erase, Inform! Detecting and Contextualizing Harmful Language in Cultural Heritage Collections
Orfeas Menis Mastromichalakis | Jason Liartis | Kristina Rose | Antoine Isaac | Giorgos Stamou
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

Cultural Heritage (CH) data hold invaluable knowledge, reflecting the history, traditions, and identities of societies, and shaping our understanding of the past and present. However, many CH collections contain outdated or offensive descriptions that reflect historical biases. CH Institutions (CHIs) face significant challenges in curating these data due to the vast scale and complexity of the task. To address this, we develop an AI-powered tool that detects offensive terms in CH metadata and provides contextual insights into their historical background and contemporary perception. We leverage a multilingual vocabulary co-created with marginalized communities, researchers, and CH professionals, along with traditional NLP techniques and Large Language Models (LLMs). Available as a standalone web app and integrated with major CH platforms, the tool has processed over 7.9 million records, contextualizing the contentious terms detected in their metadata. Rather than erasing these terms, our approach seeks to inform, making biases visible and providing actionable insights for creating more inclusive and accessible CH collections.

2022

pdf bib
Europeana Translate: Providing multilingual access to digital cultural heritage
Eirini Kaldeli | Mercedes García-Martínez | Antoine Isaac | Paolo Sebastiano Scalia | Arne Stabenau | Iván Lena Almor | Carmen Grau Lacal | Martín Barroso Ordóñez | Amando Estela | Manuel Herranz
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation

Europeana Translate is a project funded under the Connecting European Facility with the objective to take advantage of state-of-the-art machine translation in order to increase the multilinguality of resources in the cultural heritage domain

2007

pdf bib
Anchoring Dutch Cultural Heritage Thesauri to WordNet: Two Case Studies
Véronique Malaisé | Antoine Isaac | Luit Gazendam | Hennie Brugman
Proceedings of the Workshop on Language Technology for Cultural Heritage Data (LaTeCH 2007).