Annotating Football Matches: Influence of the Source Medium on Manual Annotation

Karën Fort, Vincent Claveau


Abstract
In this paper, we present an annotation campaign of football (soccer) matches, from a heterogeneous text corpus of both match minutes and video commentary transcripts, in French. The data, annotations and evaluation process are detailed, and the quality of the annotated corpus is discussed. In particular, we propose a new technique to better estimate the annotator agreement when few elements of a text are to be annotated. Based on that, we show how the source medium influenced the process and the quality.
Anthology ID:
L12-1357
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2567–2572
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/623_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Karën Fort and Vincent Claveau. 2012. Annotating Football Matches: Influence of the Source Medium on Manual Annotation. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 2567–2572, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Annotating Football Matches: Influence of the Source Medium on Manual Annotation (Fort & Claveau, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/623_Paper.pdf