Abstract
In this paper, we introduce a coverage-based scoring function that discriminates between parallel and non-parallel sentences. When plugged into Bleualign, a state-of-the-art sentence aligner, our function improves both precision and recall of alignments over the originally proposed BLEU score. Furthermore, since our scoring function uses Moses phrase tables directly we avoid the need to translate the texts to be aligned, which is time-consuming and a potential source of alignment errors.- Anthology ID:
- L16-1354
- Volume:
- Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
- Month:
- May
- Year:
- 2016
- Address:
- Portorož, Slovenia
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 2228–2231
- Language:
- URL:
- https://aclanthology.org/L16-1354
- DOI:
- Cite (ACL):
- Luís Gomes and Gabriel Pereira Lopes. 2016. First Steps Towards Coverage-Based Sentence Alignment. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 2228–2231, Portorož, Slovenia. European Language Resources Association (ELRA).
- Cite (Informal):
- First Steps Towards Coverage-Based Sentence Alignment (Gomes & Lopes, LREC 2016)
- PDF:
- https://preview.aclanthology.org/paclic-22-ingestion/L16-1354.pdf