@inproceedings{labaka-etal-2016-domain,
    title = "Domain Adaptation in {MT} Using Titles in {W}ikipedia as a Parallel Corpus: Resources and Evaluation",
    author = "Labaka, Gorka  and
      Alegria, I{\~n}aki  and
      Sarasola, Kepa",
    editor = "Calzolari, Nicoletta  and
      Choukri, Khalid  and
      Declerck, Thierry  and
      Goggi, Sara  and
      Grobelnik, Marko  and
      Maegaard, Bente  and
      Mariani, Joseph  and
      Mazo, Helene  and
      Moreno, Asuncion  and
      Odijk, Jan  and
      Piperidis, Stelios",
    booktitle = "Proceedings of the Tenth International Conference on Language Resources and Evaluation ({LREC}'16)",
    month = may,
    year = "2016",
    address = "Portoro{\v{z}}, Slovenia",
    publisher = "European Language Resources Association (ELRA)",
    url = "https://preview.aclanthology.org/landing_page/L16-1351/",
    pages = "2209--2213",
    abstract = "This paper presents how an state-of-the-art SMT system is enriched by using an extra in-domain parallel corpora extracted from Wikipedia. We collect corpora from parallel titles and from parallel fragments in comparable articles from Wikipedia. We carried out an evaluation with a double objective: evaluating the quality of the extracted data and evaluating the improvement due to the domain-adaptation. We think this can be very useful for languages with limited amount of parallel corpora, where in-domain data is crucial to improve the performance of MT sytems. The experiments on the Spanish-English language pair improve a baseline trained with the Europarl corpus in more than 2 points of BLEU when translating in the Computer Science domain."
}Markdown (Informal)
[Domain Adaptation in MT Using Titles in Wikipedia as a Parallel Corpus: Resources and Evaluation](https://preview.aclanthology.org/landing_page/L16-1351/) (Labaka et al., LREC 2016)
ACL