Jiatong He
2025
Multimodal Neural Machine Translation: A Survey of the State of the Art
Yi Feng
|
Chuanyi Li
|
Jiatong He
|
Zhenyu Hou
|
Vincent Ng
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Multimodal neural machine translation (MNMT) has received increasing attention due to its widespread applications in various fields such as cross-border e-commerce and cross-border social media platforms. The task aims to integrate other modalities, such as the visual modality, with textual data to enhance translation performance. We survey the major milestones in MNMT research, providing a comprehensive overview of relevant datasets and recent methodologies, and discussing key challenges and promising research directions.