Abstract
This paper reports on a translation engine designed for the needs of the Polish State Border Guard. The engine is a component of the AI Searcher system, whose aim is to search for Internet texts, written in Polish, Russian, Ukrainian or Belarusian, which may lead to criminal acts at the eastern border of the European Union. The system is intended for Polish users, and the translation engine should serve to assist understanding of non-Polish documents. The engine was trained on general-domain texts. The adaptation for the criminal domain consisted in the appropriate translation of criminal terms and proper names, such as forenames, surnames and geographical objects. The translation process needs to take into account the rich inflection found in all of the languages of interest. To this end, a method based on constrained decoding that incorporates an inflected lexicon into a neural translation process was applied in the engine.- Anthology ID:
- 2021.mtsummit-up.5
- Volume:
- Proceedings of Machine Translation Summit XVIII: Users and Providers Track
- Month:
- August
- Year:
- 2021
- Address:
- Virtual
- Venue:
- MTSummit
- SIG:
- Publisher:
- Association for Machine Translation in the Americas
- Note:
- Pages:
- 38–43
- Language:
- URL:
- https://aclanthology.org/2021.mtsummit-up.5
- DOI:
- Cite (ACL):
- Artur Nowakowski and Krzysztof Jassem. 2021. Neural Translator Designed to Protect the Eastern Border of the European Union. In Proceedings of Machine Translation Summit XVIII: Users and Providers Track, pages 38–43, Virtual. Association for Machine Translation in the Americas.
- Cite (Informal):
- Neural Translator Designed to Protect the Eastern Border of the European Union (Nowakowski & Jassem, MTSummit 2021)
- PDF:
- https://preview.aclanthology.org/remove-xml-comments/2021.mtsummit-up.5.pdf