Michelle Gregory
Also published as: Michelle L. Gregory, M. L. Gregory
2019
Improving Chemical Named Entity Recognition in Patents with Contextualized Word Embeddings
Zenan Zhai | Dat Quoc Nguyen | Saber Akhondi | Camilo Thorne | Christian Druckenbrodt | Trevor Cohn | Michelle Gregory | Karin Verspoor
Proceedings of the 18th BioNLP Workshop and Shared Task
Zenan Zhai | Dat Quoc Nguyen | Saber Akhondi | Camilo Thorne | Christian Druckenbrodt | Trevor Cohn | Michelle Gregory | Karin Verspoor
Proceedings of the 18th BioNLP Workshop and Shared Task
Chemical patents are an important resource for chemical information. However, few chemical Named Entity Recognition (NER) systems have been evaluated on patent documents, due in part to their structural and linguistic complexity. In this paper, we explore the NER performance of a BiLSTM-CRF model utilising pre-trained word embeddings, character-level word representations and contextualized ELMo word representations for chemical patents. We compare word embeddings pre-trained on biomedical and chemical patent corpora. The effect of tokenizers optimized for the chemical domain on NER performance in chemical patents is also explored. The results on two patent corpora show that contextualized word representations generated from ELMo substantially improve chemical NER performance w.r.t. the current state-of-the-art. We also show that domain-specific resources such as word embeddings trained on chemical patents and chemical-specific tokenizers, have a positive impact on NER performance.
2017
Tagging Funding Agencies and Grants in Scientific Articles using Sequential Learning Models
Subhradeep Kayal | Zubair Afzal | George Tsatsaronis | Sophia Katrenko | Pascal Coupet | Marius Doornenbal | Michelle Gregory
Proceedings of the 16th BioNLP Workshop
Subhradeep Kayal | Zubair Afzal | George Tsatsaronis | Sophia Katrenko | Pascal Coupet | Marius Doornenbal | Michelle Gregory
Proceedings of the 16th BioNLP Workshop
In this paper we present a solution for tagging funding bodies and grants in scientific articles using a combination of trained sequential learning models, namely conditional random fields (CRF), hidden markov models (HMM) and maximum entropy models (MaxEnt), on a benchmark set created in-house. We apply the trained models to address the BioASQ challenge 5c, which is a newly introduced task that aims to solve the problem of funding information extraction from scientific articles. Results in the dry-run data set of BioASQ task 5c show that the suggested approach can achieve a micro-recall of more than 85% in tagging both funding bodies and grants.
2007
PNNL: A Supervised Maximum Entropy Approach to Word Sense Disambiguation
Stephen Tratz | Antonio Sanfilippo | Michelle Gregory | Alan Chappell | Christian Posse | Paul Whitney
Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007)
Stephen Tratz | Antonio Sanfilippo | Michelle Gregory | Alan Chappell | Christian Posse | Paul Whitney
Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007)
2006
Word Domain Disambiguation via Word Sense Disambiguation
Antonio Sanfilippo | Stephen Tratz | Michelle Gregory
Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Antonio Sanfilippo | Stephen Tratz | Michelle Gregory
Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
User-directed Sentiment Analysis: Visualizing the Affective Content of Documents
Michelle L. Gregory | Nancy Chinchor | Paul Whitney | Richard Carter | Elizabeth Hetzler | Alan Turner
Proceedings of the Workshop on Sentiment and Subjectivity in Text
Michelle L. Gregory | Nancy Chinchor | Paul Whitney | Richard Carter | Elizabeth Hetzler | Alan Turner
Proceedings of the Workshop on Sentiment and Subjectivity in Text
Integrating Ontological Knowledge and Textual Evidence in Estimating Gene and Gene Product Similarity
Antonio Sanfilippo | Christian Posse | Banu Gopalan | Stephen Tratz | Michelle Gregory
Proceedings of the HLT-NAACL BioNLP Workshop on Linking Natural Language and Biology
Antonio Sanfilippo | Christian Posse | Banu Gopalan | Stephen Tratz | Michelle Gregory
Proceedings of the HLT-NAACL BioNLP Workshop on Linking Natural Language and Biology
ChAT: A Time-Linked System for Conversational Analysis
Michelle L. Gregory | Douglas Love | Stuart Rose | Anne Schur
Proceedings of the Analyzing Conversations in Text and Speech
Michelle L. Gregory | Douglas Love | Stuart Rose | Anne Schur
Proceedings of the Analyzing Conversations in Text and Speech
2005
Bridging the Gap between Technology and Users: Leveraging Machine
Thomas Hoeft | Nick Cramer | M. L. Gregory | Elizabeth Hetzler
Proceedings of HLT/EMNLP 2005 Interactive Demonstrations
Thomas Hoeft | Nick Cramer | M. L. Gregory | Elizabeth Hetzler
Proceedings of HLT/EMNLP 2005 Interactive Demonstrations
2004
Search
Fix author
Co-authors
- Antonio Sanfilippo 3
- Stephen Tratz 3
- Elizabeth Hetzler 2
- Christian Posse 2
- Paul Whitney 2
- Zubair Afzal 1
- Saber Akhondi 1
- Yasemin Altun 1
- Richard Carter 1
- Alan Chappell 1
- Eugene Charniak 1
- Nancy Chinchor 1
- Trevor Cohn 1
- Pascal Coupet 1
- Nick Cramer 1
- Marius Doornenbal 1
- Christian Druckenbrodt 1
- Banu Gopalan 1
- Thomas Hoeft 1
- Mark Johnson 1
- Sophia Katrenko 1
- Subhradeep Kayal 1
- Douglas Love 1
- Dat Quoc Nguyen 1
- Stuart Rose 1
- Anne Schur 1
- Camilo Thorne 1
- George Tsatsaronis 1
- Alan Turner 1
- Karin Verspoor 1
- Zenan Zhai 1