Wikidata-Driven Entity-Aware Translation: Boosting LLMs with External Knowledge

Lu Xu


Abstract
This paper presents an entity-aware machine translation system that significantly improves named entity translation by integrating external knowledge from Wikidata with Large Language Models (LLMs). While LLMs demonstrate strong general translation capabilities, they struggle with named entities that require specific cultural or domain knowledge. We address this challenge through two approaches: retrieving multilingual entity representations using gold Wikidata IDs, and employing Relik, an information extraction tool, to automatically detect and link entities without gold annotations. Experiments across multiple language pairs show our system outperforms baselines by up to 63 percentage points in entity translation accuracy (m-ETA) while maintaining high overall translation quality. Our approach ranked 3rd overall and 1st among non-finetuned systems on the SemEval-2025 Task 2 leaderboard. Additionally, we introduced language-specific post-processing further enhances performance, particularly for Traditional Chinese translations.
Anthology ID:
2025.semeval-1.238
Volume:
Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Sara Rosenthal, Aiala Rosá, Debanjan Ghosh, Marcos Zampieri
Venues:
SemEval | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1802–1809
Language:
URL:
https://preview.aclanthology.org/transition-to-people-yaml/2025.semeval-1.238/
DOI:
Bibkey:
Cite (ACL):
Lu Xu. 2025. Wikidata-Driven Entity-Aware Translation: Boosting LLMs with External Knowledge. In Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025), pages 1802–1809, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Wikidata-Driven Entity-Aware Translation: Boosting LLMs with External Knowledge (Xu, SemEval 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/transition-to-people-yaml/2025.semeval-1.238.pdf