Robert Munro

2020

Detecting Independent Pronoun Bias with Partially-Synthetic Data Generation
Robert Munro | Alex (Carmen) Morrison
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

We report that state-of-the-art parsers consistently failed to identify “hers” and “theirs” as pronouns but identified the masculine equivalent “his”. We find that the same biases exist in recent language models like BERT. While some of the bias comes from known sources, like training data with gender imbalances, we find that the bias is _amplified_ in the language models and that linguistic differences between English pronouns that are not inherently biased can become biases in some machine learning models. We introduce a new technique for measuring bias in models, using Bayesian approximations to generate partially-synthetic data from the model itself.

pdf bib

2012

pdf bib

Accurate Unsupervised Joint Named-Entity Extraction from Unaligned Parallel Text
Robert Munro | Christopher D. Manning
Proceedings of the 4th Named Entity Workshop (NEWS) 2012

2011

pdf bib

Subword and Spatiotemporal Models for Identifying Actionable Information in Haitian Kreyol
Robert Munro
Proceedings of the Fifteenth Conference on Computational Natural Language Learning

pdf bib

Crisis MT: Developing A Cookbook for MT in Crisis Situations
William Lewis | Robert Munro | Stephan Vogel
Proceedings of the Sixth Workshop on Statistical Machine Translation

2010

pdf bib abs

Crowdsourced translation for emergency response in Haiti: the global collaboration of local knowledge
Robert Munro
Proceedings of the Workshop on Collaborative Translation: technology, crowdsourcing, and the translator perspective

In the wake of the January 12 earthquake in Haiti it quickly became clear that the existing emergency response services had failed but text messages were still getting through. A number of people quickly came together to establish a text-message based emergency reporting system. There was one hurdle: the majority of the messages were in Haitian Kreyol, which for the most part was not understood by the primary emergency responders, the US Military. We therefore crowdsourced the translation of messages, allowing volunteers from within the Haitian Kreyol and French-speaking communities to translate, categorize and geolocate the messages in real-time. Collaborating online, they employed their local knowledge of locations, regional slang, abbreviations and spelling variants to process more than 40,000 messages in the first six weeks alone. According the responders this saved hundreds of lives and helped direct the first food and aid to tens of thousands. The average turn-around from a message arriving in Kreyol to it being translated, categorized, geolocated and streamed back to the responders was 10 minutes. Collaboration among translators was crucial for data-quality, motivation and community contacts, enabling richer value-adding in the translation than would have been possible from any one person.

pdf bib

Subword Variation in Text Message Classification
Robert Munro | Christopher D. Manning
Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics

pdf bib

Robert Munro

2020

2012

2011

2010

2003

2002

Co-authors

Venues