Orthographic Errors in Web Pages: Toward Cleaner Web Corpora

Christoph Ringlstetter, Klaus U. Schulz, Stoyan Mihov


Anthology ID:
J06-3001
Volume:
Computational Linguistics, Volume 32, Number 3, September 2006
Month:
Year:
2006
Address:
Venue:
CL
SIG:
Publisher:
Note:
Pages:
295–340
Language:
URL:
https://aclanthology.org/J06-3001
DOI:
10.1162/coli.2006.32.3.295
Bibkey:
Cite (ACL):
Christoph Ringlstetter, Klaus U. Schulz, and Stoyan Mihov. 2006. Orthographic Errors in Web Pages: Toward Cleaner Web Corpora. Computational Linguistics, 32(3):295–340.
Cite (Informal):
Orthographic Errors in Web Pages: Toward Cleaner Web Corpora (Ringlstetter et al., CL 2006)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/J06-3001.pdf