Leveraging LLM For Synchronizing Information Across Multilingual Tables
Siddharth Khincha, Tushar Kataria, Ankita Anand, Dan Roth, Vivek Gupta
Abstract
The vast amount of online information today poses challenges for non-English speakers, as much of it is concentrated in high-resource languages such as English and French. Wikipedia reflects this imbalance, with content in low-resource languages frequently outdated or incomplete. Recent research has sought to improve cross-language synchronization of Wikipedia tables using rule-based methods. These approaches can be effective, but they struggle with complexity and generalization. This paper explores large language models (LLMs) for multilingual information synchronization, using zero-shot prompting as a scalable solution. We introduce the Information Updation dataset, simulating the real-world process of updating outdated Wikipedia tables, and evaluate LLM performance. Our findings reveal that single-prompt approaches often produce suboptimal results, prompting us to introduce a task decomposition strategy that enhances coherence and accuracy. Our proposed method outperforms existing baselines, particularly in Information Updation (1.79%) and Information Addition (20.58%), highlighting the model’s strength in dynamically updating and enriching data across architectures.- Anthology ID:
- 2025.naacl-long.329
- Volume:
- Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
- Month:
- April
- Year:
- 2025
- Address:
- Albuquerque, New Mexico
- Editors:
- Luis Chiruzzo, Alan Ritter, Lu Wang
- Venue:
- NAACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 6474–6492
- Language:
- URL:
- https://preview.aclanthology.org/fix-sig-urls/2025.naacl-long.329/
- DOI:
- Cite (ACL):
- Siddharth Khincha, Tushar Kataria, Ankita Anand, Dan Roth, and Vivek Gupta. 2025. Leveraging LLM For Synchronizing Information Across Multilingual Tables. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 6474–6492, Albuquerque, New Mexico. Association for Computational Linguistics.
- Cite (Informal):
- Leveraging LLM For Synchronizing Information Across Multilingual Tables (Khincha et al., NAACL 2025)
- PDF:
- https://preview.aclanthology.org/fix-sig-urls/2025.naacl-long.329.pdf