@inproceedings{yu-etal-2018-multilingual,
    title = "Multilingual Seq2seq Training with Similarity Loss for Cross-Lingual Document Classification",
    author = "Yu, Katherine  and
      Li, Haoran  and
      Oguz, Barlas",
    editor = "Augenstein, Isabelle  and
      Cao, Kris  and
      He, He  and
      Hill, Felix  and
      Gella, Spandana  and
      Kiros, Jamie  and
      Mei, Hongyuan  and
      Misra, Dipendra",
    booktitle = "Proceedings of the Third Workshop on Representation Learning for {NLP}",
    month = jul,
    year = "2018",
    address = "Melbourne, Australia",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/iwcs-25-ingestion/W18-3023/",
    doi = "10.18653/v1/W18-3023",
    pages = "175--179",
    abstract = "In this paper we continue experiments where neural machine translation training is used to produce joint cross-lingual fixed-dimensional sentence embeddings. In this framework we introduce a simple method of adding a loss to the learning objective which penalizes distance between representations of bilingually aligned sentences. We evaluate cross-lingual transfer using two approaches, cross-lingual similarity search on an aligned corpus (Europarl) and cross-lingual document classification on a recently published benchmark Reuters corpus, and we find the similarity loss significantly improves performance on both. Furthermore, we notice that while our Reuters results are very competitive, our English results are not as competitive, showing room for improvement in the current cross-lingual state-of-the-art. Our results are based on a set of 6 European languages."
}Markdown (Informal)
[Multilingual Seq2seq Training with Similarity Loss for Cross-Lingual Document Classification](https://preview.aclanthology.org/iwcs-25-ingestion/W18-3023/) (Yu et al., RepL4NLP 2018)
ACL