A modular open-source focused crawler for mining monolingual and bilingual corpora from the web

Vassilis Papavassiliou, Prokopis Prokopidis, Gregor Thurmair


Anthology ID:
W13-2506
Volume:
Proceedings of the Sixth Workshop on Building and Using Comparable Corpora
Month:
August
Year:
2013
Address:
Sofia, Bulgaria
Venue:
BUCC
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
43–51
Language:
URL:
https://aclanthology.org/W13-2506
DOI:
Bibkey:
Cite (ACL):
Vassilis Papavassiliou, Prokopis Prokopidis, and Gregor Thurmair. 2013. A modular open-source focused crawler for mining monolingual and bilingual corpora from the web. In Proceedings of the Sixth Workshop on Building and Using Comparable Corpora, pages 43–51, Sofia, Bulgaria. Association for Computational Linguistics.
Cite (Informal):
A modular open-source focused crawler for mining monolingual and bilingual corpora from the web (Papavassiliou et al., BUCC 2013)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/W13-2506.pdf