Robert Munro
2020
Detecting Independent Pronoun Bias with Partially-Synthetic Data Generation
Robert Munro | Alex (Carmen) Morrison
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Robert Munro | Alex (Carmen) Morrison
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
We report that state-of-the-art parsers consistently failed to identify “hers” and “theirs” as pronouns but identified the masculine equivalent “his”. We find that the same biases exist in recent language models like BERT. While some of the bias comes from known sources, like training data with gender imbalances, we find that the bias is _amplified_ in the language models and that linguistic differences between English pronouns that are not inherently biased can become biases in some machine learning models. We introduce a new technique for measuring bias in models, using Bayesian approximations to generate partially-synthetic data from the model itself.
Proceedings of the 1st Workshop on NLP for COVID-19 at ACL 2020
Karin Verspoor | Kevin Bretonnel Cohen | Mark Dredze | Emilio Ferrara | Jonathan May | Robert Munro | Cecile Paris | Byron Wallace
Proceedings of the 1st Workshop on NLP for COVID-19 at ACL 2020
Karin Verspoor | Kevin Bretonnel Cohen | Mark Dredze | Emilio Ferrara | Jonathan May | Robert Munro | Cecile Paris | Byron Wallace
Proceedings of the 1st Workshop on NLP for COVID-19 at ACL 2020
2012
Accurate Unsupervised Joint Named-Entity Extraction from Unaligned Parallel Text
Robert Munro | Christopher D. Manning
Proceedings of the 4th Named Entity Workshop (NEWS) 2012
Robert Munro | Christopher D. Manning
Proceedings of the 4th Named Entity Workshop (NEWS) 2012
2011
Subword and Spatiotemporal Models for Identifying Actionable Information in Haitian Kreyol
Robert Munro
Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Robert Munro
Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Crisis MT: Developing A Cookbook for MT in Crisis Situations
William Lewis | Robert Munro | Stephan Vogel
Proceedings of the Sixth Workshop on Statistical Machine Translation
William Lewis | Robert Munro | Stephan Vogel
Proceedings of the Sixth Workshop on Statistical Machine Translation
2010
Crowdsourced translation for emergency response in Haiti: the global collaboration of local knowledge
Robert Munro
Proceedings of the Workshop on Collaborative Translation: technology, crowdsourcing, and the translator perspective
Robert Munro
Proceedings of the Workshop on Collaborative Translation: technology, crowdsourcing, and the translator perspective
In the wake of the January 12 earthquake in Haiti it quickly became clear that the existing emergency response services had failed but text messages were still getting through. A number of people quickly came together to establish a text-message based emergency reporting system. There was one hurdle: the majority of the messages were in Haitian Kreyol, which for the most part was not understood by the primary emergency responders, the US Military. We therefore crowdsourced the translation of messages, allowing volunteers from within the Haitian Kreyol and French-speaking communities to translate, categorize and geolocate the messages in real-time. Collaborating online, they employed their local knowledge of locations, regional slang, abbreviations and spelling variants to process more than 40,000 messages in the first six weeks alone. According the responders this saved hundreds of lives and helped direct the first food and aid to tens of thousands. The average turn-around from a message arriving in Kreyol to it being translated, categorized, geolocated and streamed back to the responders was 10 minutes. Collaboration among translators was crucial for data-quality, motivation and community contacts, enabling richer value-adding in the translation than would have been possible from any one person.
Subword Variation in Text Message Classification
Robert Munro | Christopher D. Manning
Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Robert Munro | Christopher D. Manning
Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Crowdsourcing and language studies: the new generation of linguistic data
Robert Munro | Steven Bethard | Victor Kuperman | Vicky Tzuyin Lai | Robin Melnick | Christopher Potts | Tyler Schnoebelen | Harry Tily
Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk
Robert Munro | Steven Bethard | Victor Kuperman | Vicky Tzuyin Lai | Robin Melnick | Christopher Potts | Tyler Schnoebelen | Harry Tily
Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk
2003
A queueing-theory model of word frequency distributions
Robert Munro
Proceedings of the Australasian Language Technology Workshop 2003
Robert Munro
Proceedings of the Australasian Language Technology Workshop 2003
Meta-Learning Orthographic and Contextual Models for Language Independent Named Entity Recognition
Robert Munro | Daren Ler | Jon Patrick
Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003
Robert Munro | Daren Ler | Jon Patrick
Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003