A modular open-source focused crawler for mining monolingual and bilingual corpora from the web
Vassilis Papavassiliou, Prokopis Prokopidis, Gregor Thurmair
- Anthology ID:
- W13-2506
- Volume:
- Proceedings of the Sixth Workshop on Building and Using Comparable Corpora
- Month:
- August
- Year:
- 2013
- Address:
- Sofia, Bulgaria
- Venue:
- BUCC
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 43–51
- Language:
- URL:
- https://aclanthology.org/W13-2506
- DOI:
- Cite (ACL):
- Vassilis Papavassiliou, Prokopis Prokopidis, and Gregor Thurmair. 2013. A modular open-source focused crawler for mining monolingual and bilingual corpora from the web. In Proceedings of the Sixth Workshop on Building and Using Comparable Corpora, pages 43–51, Sofia, Bulgaria. Association for Computational Linguistics.
- Cite (Informal):
- A modular open-source focused crawler for mining monolingual and bilingual corpora from the web (Papavassiliou et al., BUCC 2013)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/W13-2506.pdf