@inproceedings{henry-etal-2017-evaluating,
    title = "Evaluating Feature Extraction Methods for Knowledge-based Biomedical Word Sense Disambiguation",
    author = "Henry, Sam  and
      Cuffy, Clint  and
      McInnes, Bridget",
    editor = "Cohen, Kevin Bretonnel  and
      Demner-Fushman, Dina  and
      Ananiadou, Sophia  and
      Tsujii, Junichi",
    booktitle = "Proceedings of the 16th {B}io{NLP} Workshop",
    month = aug,
    year = "2017",
    address = "Vancouver, Canada,",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/iwcs-25-ingestion/W17-2334/",
    doi = "10.18653/v1/W17-2334",
    pages = "272--281",
    abstract = "In this paper, we present an analysis of feature extraction methods via dimensionality reduction for the task of biomedical Word Sense Disambiguation (WSD). We modify the vector representations in the 2-MRD WSD algorithm, and evaluate four dimensionality reduction methods: Word Embeddings using Continuous Bag of Words and Skip Gram, Singular Value Decomposition (SVD), and Principal Component Analysis (PCA). We also evaluate the effects of vector size on the performance of each of these methods. Results are evaluated on five standard evaluation datasets (Abbrev.100, Abbrev.200, Abbrev.300, NLM-WSD, and MSH-WSD). We find that vector sizes of 100 are sufficient for all techniques except SVD, for which a vector size of 1500 is referred. We also show that SVD performs on par with Word Embeddings for all but one dataset."
}Markdown (Informal)
[Evaluating Feature Extraction Methods for Knowledge-based Biomedical Word Sense Disambiguation](https://preview.aclanthology.org/iwcs-25-ingestion/W17-2334/) (Henry et al., BioNLP 2017)
ACL