Abstract
This paper presents an Italian→Catalan RBMT system automatically built by combining the linguistic data of the existing pairs Spanish–Catalan and Spanish–Italian. A lightweight manual postprocessing is carried out in order to fix inconsistencies in the automatically derived dictionaries and to add very frequent words that are missing according to a corpus analysis. The system is evaluated on the KDE4 corpus and outperforms Google Translate by approximately ten absolute points in terms of both TER and GTM.- Anthology ID:
- 2011.freeopmt-1.12
- Volume:
- Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation
- Month:
- January 20-21
- Year:
- 2011
- Address:
- Barcelona, Spain
- Venue:
- FreeOpMT
- SIG:
- Publisher:
- Note:
- Pages:
- 77–81
- Language:
- URL:
- https://aclanthology.org/2011.freeopmt-1.12
- DOI:
- Cite (ACL):
- Antonio Toral, Mireia Ginestí-Rosell, and Francis Tyers. 2011. An Italian to Catalan RBMT system reusing data from existing language pairs. In Proceedings of the Second International Workshop on Free/Open-Source Rule-Based Machine Translation, pages 77–81, Barcelona, Spain.
- Cite (Informal):
- An Italian to Catalan RBMT system reusing data from existing language pairs (Toral et al., FreeOpMT 2011)
- PDF:
- https://preview.aclanthology.org/remove-xml-comments/2011.freeopmt-1.12.pdf