Low-resource Post Processing of Noisy OCR Output for Historical Corpus Digitisation

Caitlin Richter, Matthew Wickes, Deniz Beser, Mitch Marcus


Anthology ID:
L18-1369
Volume:
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
Month:
May
Year:
2018
Address:
Miyazaki, Japan
Editors:
Nicoletta Calzolari, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Koiti Hasida, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis, Takenobu Tokunaga
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
Language:
URL:
https://aclanthology.org/L18-1369
DOI:
Bibkey:
Cite (ACL):
Caitlin Richter, Matthew Wickes, Deniz Beser, and Mitch Marcus. 2018. Low-resource Post Processing of Noisy OCR Output for Historical Corpus Digitisation. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan. European Language Resources Association (ELRA).
Cite (Informal):
Low-resource Post Processing of Noisy OCR Output for Historical Corpus Digitisation (Richter et al., LREC 2018)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-2/L18-1369.pdf