Shubin Zhao


2020

pdf bib
Embedding Semantic Taxonomies
Alyssa Lees | Chris Welty | Shubin Zhao | Jacek Korycki | Sara Mc Carthy
Proceedings of the 28th International Conference on Computational Linguistics

A common step in developing an understanding of a vertical domain, e.g. shopping, dining, movies, medicine, etc., is curating a taxonomy of categories specific to the domain. These human created artifacts have been the subject of research in embeddings that attempt to encode aspects of the partial ordering property of taxonomies. We compare Box Embeddings, a natural containment representation of category taxonomies, to partial-order embeddings and a baseline Bayes Net, in the context of representing the Medical Subject Headings (MeSH) taxonomy given a set of 300K PubMed articles with subject labels from MeSH. We deeply explore the experimental properties of training box embeddings, including preparation of the training data, sampling ratios and class balance, initialization strategies, and propose a fix to the original box objective. We then present first results in using these techniques for representing a bipartite learning problem (i.e. collaborative filtering) in the presence of taxonomic relations within each partition, inferring disease (anatomical) locations from their use as subject labels in journal articles. Our box model substantially outperforms all baselines for taxonomic reconstruction and bipartite relationship experiments. This performance improvement is observed both in overall accuracy and the weighted spread by true taxonomic depth.

2005

pdf bib
Extracting Relations with Integrated Information Using Kernel Methods
Shubin Zhao | Ralph Grishman
Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL’05)

2004

pdf bib
Discriminative Slot Detection Using Kernel Methods
Shubin Zhao | Adam Meyers | Ralph Grishman
COLING 2004: Proceedings of the 20th International Conference on Computational Linguistics

2001

pdf bib
Covering Treebanks with GLARF
A. Meyers | Ralph Grishman | Michiko Kosaka | Shubin Zhao
Proceedings of the ACL 2001 Workshop on Sharing Tools and Resources