@inproceedings{salmon-vallet-2014-effortless,
    title = "An Effortless Way To Create Large-Scale Datasets For Famous Speakers",
    author = "Salmon, Fran{\c{c}}ois  and
      Vallet, F{\'e}licien",
    editor = "Calzolari, Nicoletta  and
      Choukri, Khalid  and
      Declerck, Thierry  and
      Loftsson, Hrafn  and
      Maegaard, Bente  and
      Mariani, Joseph  and
      Moreno, Asuncion  and
      Odijk, Jan  and
      Piperidis, Stelios",
    booktitle = "Proceedings of the Ninth International Conference on Language Resources and Evaluation ({LREC}'14)",
    month = may,
    year = "2014",
    address = "Reykjavik, Iceland",
    publisher = "European Language Resources Association (ELRA)",
    url = "https://preview.aclanthology.org/ingest-emnlp/L14-1283/",
    pages = "348--352",
    abstract = "The creation of large-scale multimedia datasets has become a scientific matter in itself. Indeed, the fully-manual annotation of hundreds or thousands of hours of video and/or audio turns out to be practically infeasible. In this paper, we propose an extremly handy approach to automatically construct a database of famous speakers from TV broadcast news material. We then run a user experiment with a correctly designed tool that demonstrates that very reliable results can be obtained with this method. In particular, a thorough error analysis demonstrates the value of the approach and provides hints for the improvement of the quality of the dataset."
}Markdown (Informal)
[An Effortless Way To Create Large-Scale Datasets For Famous Speakers](https://preview.aclanthology.org/ingest-emnlp/L14-1283/) (Salmon & Vallet, LREC 2014)
ACL