Proceedings of the 9th Web as Corpus Workshop (WaC-9)
Felix Bildhauer, Roland Schäfer (Editors)
- Anthology ID:
- W14-04
- Month:
- April
- Year:
- 2014
- Address:
- Gothenburg, Sweden
- Venue:
- WAC
- SIG:
- SIGWAC
- Publisher:
- Association for Computational Linguistics
- URL:
- https://aclanthology.org/W14-04
- DOI:
- 10.3115/v1/W14-04
- PDF:
- https://preview.aclanthology.org/nodalida-main-page/W14-04.pdf
Proceedings of the 9th Web as Corpus Workshop (WaC-9)
Felix Bildhauer
|
Roland Schäfer
Finding Viable Seed URLs for Web Corpora: A Scouting Approach and Comparative Study of Available Sources
Adrien Barbaresi
Focused Web Corpus Crawling
Roland Schäfer
|
Adrien Barbaresi
|
Felix Bildhauer
Less Destructive Cleaning of Web Documents by Using Standoff Annotation
Maik Stührenberg
Some Issues on the Normalization of a Corpus of Products Reviews in Portuguese
Magali Sanches Duran
|
Lucas Avanço
|
Sandra Aluísio
|
Thiago Pardo
|
Maria da Graça Volpe Nunes
{bs,hr,sr}WaC - Web Corpora of Bosnian, Croatian and Serbian
Nikola Ljubešić
|
Filip Klubička
The PAISÀ Corpus of Italian Web Texts
Verena Lyding
|
Egon Stemle
|
Claudia Borghetti
|
Marco Brunello
|
Sara Castagnoli
|
Felice Dell’Orletta
|
Henrik Dittmann
|
Alessandro Lenci
|
Vito Pirrelli