Abstract
In this paper, we present an annotation campaign of football (soccer) matches, from a heterogeneous text corpus of both match minutes and video commentary transcripts, in French. The data, annotations and evaluation process are detailed, and the quality of the annotated corpus is discussed. In particular, we propose a new technique to better estimate the annotator agreement when few elements of a text are to be annotated. Based on that, we show how the source medium influenced the process and the quality.- Anthology ID:
- L12-1357
- Volume:
- Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
- Month:
- May
- Year:
- 2012
- Address:
- Istanbul, Turkey
- Editors:
- Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 2567–2572
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2012/pdf/623_Paper.pdf
- DOI:
- Cite (ACL):
- Karën Fort and Vincent Claveau. 2012. Annotating Football Matches: Influence of the Source Medium on Manual Annotation. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 2567–2572, Istanbul, Turkey. European Language Resources Association (ELRA).
- Cite (Informal):
- Annotating Football Matches: Influence of the Source Medium on Manual Annotation (Fort & Claveau, LREC 2012)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2012/pdf/623_Paper.pdf