@inproceedings{basaldella-collier-2019-bioreddit,
    title = "{B}io{R}eddit: Word Embeddings for User-Generated Biomedical {NLP}",
    author = "Basaldella, Marco  and
      Collier, Nigel",
    editor = "Holderness, Eben  and
      Jimeno Yepes, Antonio  and
      Lavelli, Alberto  and
      Minard, Anne-Lyse  and
      Pustejovsky, James  and
      Rinaldi, Fabio",
    booktitle = "Proceedings of the Tenth International Workshop on Health Text Mining and Information Analysis (LOUHI 2019)",
    month = nov,
    year = "2019",
    address = "Hong Kong",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/iwcs-25-ingestion/D19-6205/",
    doi = "10.18653/v1/D19-6205",
    pages = "34--38",
    abstract = "Word embeddings, in their different shapes and iterations, have changed the natural language processing research landscape in the last years. The biomedical text processing field is no stranger to this revolution; however, scholars in the field largely trained their embeddings on scientific documents only, even when working on user-generated data. In this paper we show how training embeddings from a corpus collected from user-generated text from medical forums heavily influences the performance on downstream tasks, outperforming embeddings trained both on general purpose data or on scientific papers when applied on user-generated content."
}Markdown (Informal)
[BioReddit: Word Embeddings for User-Generated Biomedical NLP](https://preview.aclanthology.org/iwcs-25-ingestion/D19-6205/) (Basaldella & Collier, Louhi 2019)
ACL