@inproceedings{pighin-etal-2012-faust,
    title = "The {FAUST} Corpus of Adequacy Assessments for Real-World Machine Translation Output",
    author = "Pighin, Daniele  and
      M{\`a}rquez, Llu{\'i}s  and
      Formiga, Llu{\'i}s",
    editor = "Calzolari, Nicoletta  and
      Choukri, Khalid  and
      Declerck, Thierry  and
      Do{\u{g}}an, Mehmet U{\u{g}}ur  and
      Maegaard, Bente  and
      Mariani, Joseph  and
      Moreno, Asuncion  and
      Odijk, Jan  and
      Piperidis, Stelios",
    booktitle = "Proceedings of the Eighth International Conference on Language Resources and Evaluation ({LREC}'12)",
    month = may,
    year = "2012",
    address = "Istanbul, Turkey",
    publisher = "European Language Resources Association (ELRA)",
    url = "https://preview.aclanthology.org/ingest-emnlp/L12-1181/",
    pages = "29--35",
    abstract = "We present a corpus consisting of 11,292 real-world English to Spanish automatic translations annotated with relative (ranking) and absolute (adequate/non-adequate) quality assessments. The translation requests, collected through the popular translation portal \url{http://reverso.net}, provide a most variated sample of real-world machine translation (MT) usage, from complete sentences to units of one or two words, from well-formed to hardly intelligible texts, from technical documents to colloquial and slang snippets. In this paper, we present 1) a preliminary annotation experiment that we carried out to select the most appropriate quality criterion to be used for these data, 2) a graph-based methodology inspired by Interactive Genetic Algorithms to reduce the annotation effort, and 3) the outcomes of the full-scale annotation experiment, which result in a valuable and original resource for the analysis and characterization of MT-output quality."
}Markdown (Informal)
[The FAUST Corpus of Adequacy Assessments for Real-World Machine Translation Output](https://preview.aclanthology.org/ingest-emnlp/L12-1181/) (Pighin et al., LREC 2012)
ACL