Targum — a Multilingual New Testament Translation Corpus

Maciej Rapacz, Aleksander Smywiński-Pohl


Abstract
Many European languages possess rich biblical translation histories, yet existing corpora — in prioritizing linguistic breadth — often fail to capture this depth. To address this gap, we introduce a multilingual corpus of 651 New Testament translations, of which 334 are unique, spanning five languages with 2.4–5.0× more translations per language than any prior corpus: English (194 unique versions from 390 total), French (41 from 78), Italian (17 from 33), Polish (29 from 48), and Spanish (53 from 102). Aggregated from 12 online biblical libraries and one preexisting corpus, each translation is annotated with metadata that maps the text to a standardized identifier for the work, its specific edition, and its year of revision. This canonicalization allows researchers to define "uniqueness" for their own needs: they can perform micro-level analyses on translation families, such as the KJV lineage, or conduct macro-level studies by deduplicating closely related texts. By providing the first multilingual resource with sufficient depth per language for flexible, multilevel analysis, the corpus fills a gap in the quantitative study of translation history.
Anthology ID:
2026.lrec-main.564
Volume:
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Month:
May
Year:
2026
Address:
Palma de Mallorca, Spain
Editors:
Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
Venue:
LREC
SIG:
Publisher:
ELRA Language Resource Association
Note:
Pages:
7092–7105
Language:
URL:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.564/
DOI:
Bibkey:
Cite (ACL):
Maciej Rapacz and Aleksander Smywiński-Pohl. 2026. Targum — a Multilingual New Testament Translation Corpus. International Conference on Language Resources and Evaluation, main:7092–7105.
Cite (Informal):
Targum — a Multilingual New Testament Translation Corpus (Rapacz & Smywiński-Pohl, LREC 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.564.pdf