From Shijing to English and German: Resources and Evaluation for LLM Translation of Early Chinese Poetry

Ying Jiao, Meng Sun


Abstract
While large language models (LLMs) show promise in literary translation, Shijing (The Book of Songs) serves as a rigorous yet under-explored testbed for testing their limits, given its linguistic antiquity and complex poetic constraints. Automated evaluation in this domain is currently hindered by a scarcity of multilingual resources and the inadequacy of existing metrics in capturing both semantic fidelity and aesthetic quality. In this paper, we bridge these gaps by curating a Shijing parallel corpus with line-by-line Chinese-English-German alignments, together with a fine-grained lexical knowledge base (KB) for archaic expressions. Based on these resources, we propose a hybrid evaluation framework that integrates knowledge-driven, rule-based, and LLM-as-judge metrics. Experimental results show that our framework achieves significantly higher human correlation than traditional metrics and demonstrates high statistical stability. By applying this framework to evaluate representative LLMs, we reveal that while top-tier models like Gemini-2.5-Pro and DeepSeek-3.1 show potential, achieving semantic precision and aesthetic sophistication—particularly in lower-resource directions like German—remains a persistent challenge. Our code, lexical KB, and corpus reconstruction protocols are available at https://github.com/ML-KULeuven/ShijingLLMTrans.
Anthology ID:
2026.findings-acl.542
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
11143–11162
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.542/
DOI:
Bibkey:
Cite (ACL):
Ying Jiao and Meng Sun. 2026. From Shijing to English and German: Resources and Evaluation for LLM Translation of Early Chinese Poetry. In Findings of the Association for Computational Linguistics: ACL 2026, pages 11143–11162, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
From Shijing to English and German: Resources and Evaluation for LLM Translation of Early Chinese Poetry (Jiao & Sun, Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.542.pdf
Checklist:
 2026.findings-acl.542.checklist.pdf