Patent Claim Translation via Continual Pre-training of Large Language Models with Parallel Data

Haruto Azami; Minato Kondo; Takehito Utsuro; Masaaki Nagata

Patent Claim Translation via Continual Pre-training of Large Language Models with Parallel Data

Haruto Azami, Minato Kondo, Takehito Utsuro, Masaaki Nagata

Abstract

Recent advancements in large language models (LLMs) have enabled their application across various domains. However, in the field of patent translation, Transformer encoder-decoder based models remain the standard approach, and the potential of LLMs for translation tasks has not been thoroughly explored. In this study, we conducted patent claim translation using an LLM fine-tuned with parallel data through continual pre-training and supervised fine-tuning, following the methodology proposed by Guo et al. (2024) and Kondo et al. (2024). Comparative evaluation against the Transformer encoder-decoder based translations revealed that the LLM achieved high scores for both BLEU and COMET. This demonstrated improvements in addressing issues such as omissions and repetitions. Nonetheless, hallucination errors, which were not observed in the traditional models, occurred in some cases and negatively affected the translation quality. This study highlights the promise of LLMs for patent translation while identifying the challenges that warrant further investigation.

Anthology ID:: 2025.mtsummit-1.23
Volume:: Proceedings of Machine Translation Summit XX: Volume 1
Month:: June
Year:: 2025
Address:: Geneva, Switzerland
Editors:: Pierrette Bouillon, Johanna Gerlach, Sabrina Girletti, Lise Volkart, Raphael Rubino, Rico Sennrich, Ana C. Farinha, Marco Gaido, Joke Daems, Dorothy Kenny, Helena Moniz, Sara Szoc
Venue:: MTSummit
SIG:
Publisher:: European Association for Machine Translation
Note:
Pages:: 300–314
Language:
URL:: https://preview.aclanthology.org/mtsummit-25-ingestion/2025.mtsummit-1.23/
DOI:
Bibkey:
Cite (ACL):: Haruto Azami, Minato Kondo, Takehito Utsuro, and Masaaki Nagata. 2025. Patent Claim Translation via Continual Pre-training of Large Language Models with Parallel Data. In Proceedings of Machine Translation Summit XX: Volume 1, pages 300–314, Geneva, Switzerland. European Association for Machine Translation.
Cite (Informal):: Patent Claim Translation via Continual Pre-training of Large Language Models with Parallel Data (Azami et al., MTSummit 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/mtsummit-25-ingestion/2025.mtsummit-1.23.pdf

PDF Cite Search Fix data