Jeff Bilmes
Also published as: Jeff A. Bilmes
2025
MULTIGUARD: An Efficient Approach for AI Safety Moderation Across Languages and Modalities
Sahil Verma | Keegan Hines | Jeff Bilmes | Charlotte Siska | Luke Zettlemoyer | Hila Gonen | Chandan Singh
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Sahil Verma | Keegan Hines | Jeff Bilmes | Charlotte Siska | Luke Zettlemoyer | Hila Gonen | Chandan Singh
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
The emerging capabilities of large language models (LLMs) have sparked concerns about their immediate potential for harmful misuse. The core approach to mitigate these concerns is the detection of harmful queries to the model. Current detection approaches are fallible, and are particularly susceptible to attacks that exploit mismatched generalization of model capabilities (e.g., prompts in low-resource languages or prompts provided in non-text modalities such as image and audio). To tackle this challenge, we propose OMNIGUARD, an approach for detecting harmful prompts across languages and modalities. Our approach (i) identifies internal representations of an LLM/MLLM that are aligned across languages or modalities and then (ii) uses them to build a language-agnostic or modality-agnostic classifier for detecting harmful prompts. OMNIGUARD improves harmful prompt classification accuracy by 11.57% over the strongest baseline in a multilingual setting, by 20.44% for image-based prompts, and sets a new SOTA for audio-based prompts. By repurposing embeddings computed during generation, OMNIGUARD is also very efficient (≈ 120× faster than the next fastest baseline). Code and data are available at https://github.com/vsahil/OmniGuard
2024
An End-to-End Submodular Framework for Data-Efficient In-Context Learning
Lilly Kumari | Shengjie Wang | Arnav Das | Tianyi Zhou | Jeff Bilmes
Findings of the Association for Computational Linguistics: NAACL 2024
Lilly Kumari | Shengjie Wang | Arnav Das | Tianyi Zhou | Jeff Bilmes
Findings of the Association for Computational Linguistics: NAACL 2024
An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models
Gantavya Bhatt | Yifang Chen | Arnav Das | Jifan Zhang | Sang Truong | Stephen Mussmann | Yinglun Zhu | Jeff Bilmes | Simon Du | Kevin Jamieson | Jordan Ash | Robert Nowak
Findings of the Association for Computational Linguistics: ACL 2024
Gantavya Bhatt | Yifang Chen | Arnav Das | Jifan Zhang | Sang Truong | Stephen Mussmann | Yinglun Zhu | Jeff Bilmes | Simon Du | Kevin Jamieson | Jordan Ash | Robert Nowak
Findings of the Association for Computational Linguistics: ACL 2024
Supervised finetuning (SFT) on instruction datasets has played a crucial role in achieving the remarkable zero-shot generalization capabilities observed in modern large language models (LLMs). However, the annotation efforts required to produce high quality responses for instructions are becoming prohibitively expensive, especially as the number of tasks spanned by instruction datasets continues to increase. Active learning is effective in identifying useful subsets of samples to annotate from an unlabeled pool, but its high computational cost remains a barrier to its widespread applicability in the context of LLMs. To mitigate the annotation cost of SFT and circumvent the computational bottlenecks of active learning, we propose using experimental design. Experimental design techniques select the most informative samples to label, and typically maximize some notion of uncertainty and/or diversity. In our work, we implement a framework that evaluates several existing and novel experimental design techniques and find that these methods consistently yield significant gains in label efficiency with little computational overhead. On generative tasks, to reach the same generalization performance, our methods save 50% of the annotation cost compared to random sampling.
2015
Summarization of Multi-Document Topic Hierarchies using Submodular Mixtures
Ramakrishna Bairi | Rishabh Iyer | Ganesh Ramakrishnan | Jeff Bilmes
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
Ramakrishna Bairi | Rishabh Iyer | Ganesh Ramakrishnan | Jeff Bilmes
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
2014
Submodularity for Data Selection in Machine Translation
Katrin Kirchhoff | Jeff Bilmes
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Katrin Kirchhoff | Jeff Bilmes
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)
2013
Using Document Summarization Techniques for Speech Data Subset Selection
Kai Wei | Yuzong Liu | Katrin Kirchhoff | Jeff Bilmes
Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Kai Wei | Yuzong Liu | Katrin Kirchhoff | Jeff Bilmes
Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
2011
A Class of Submodular Functions for Document Summarization
Hui Lin | Jeff Bilmes
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies
Hui Lin | Jeff Bilmes
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies
Word Alignment via Submodular Maximization over Matroids
Hui Lin | Jeff Bilmes
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies
Hui Lin | Jeff Bilmes
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies
2010
Multi-document Summarization via Budgeted Maximization of Submodular Functions
Hui Lin | Jeff Bilmes
Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Hui Lin | Jeff Bilmes
Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
2009
Compiling a Massive, Multilingual Dictionary via Probabilistic Inference
Mausam | Stephen Soderland | Oren Etzioni | Daniel Weld | Michael Skinner | Jeff Bilmes
Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP
Mausam | Stephen Soderland | Oren Etzioni | Daniel Weld | Michael Skinner | Jeff Bilmes
Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP
2008
Soft-Supervised Learning for Text Classification
Amarnag Subramanya | Jeff Bilmes
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing
Amarnag Subramanya | Jeff Bilmes
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing
2007
Generalized Graphical Abstractions for Statistical Machine Translation
Karim Filali | Jeff Bilmes
Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
Karim Filali | Jeff Bilmes
Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
Virtual Evidence for Training Speech Recognizers Using Partially Labeled Data
Amarnag Subramanya | Jeff Bilmes
Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
Amarnag Subramanya | Jeff Bilmes
Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
2006
Proceedings of the Human Language Technology Conference of the NAACL, Main Conference
Robert C. Moore | Jeff Bilmes | Jennifer Chu-Carroll | Mark Sanderson
Proceedings of the Human Language Technology Conference of the NAACL, Main Conference
Robert C. Moore | Jeff Bilmes | Jennifer Chu-Carroll | Mark Sanderson
Proceedings of the Human Language Technology Conference of the NAACL, Main Conference
Backoff Model Training using Partially Observed Data: Application to Dialog Act Tagging
Gang Ji | Jeff Bilmes
Proceedings of the Human Language Technology Conference of the NAACL, Main Conference
Gang Ji | Jeff Bilmes
Proceedings of the Human Language Technology Conference of the NAACL, Main Conference
Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Robert C. Moore | Jeff Bilmes | Jennifer Chu-Carroll | Mark Sanderson
Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Robert C. Moore | Jeff Bilmes | Jennifer Chu-Carroll | Mark Sanderson
Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Proceedings of the Workshop on Computationally Hard Problems and Joint Inference in Speech and Language Processing
Ryan McDonald | Charles Sutton | Hal Daumé III | Andrew McCallum | Fernando Pereira | Jeff Bilmes
Proceedings of the Workshop on Computationally Hard Problems and Joint Inference in Speech and Language Processing
Ryan McDonald | Charles Sutton | Hal Daumé III | Andrew McCallum | Fernando Pereira | Jeff Bilmes
Proceedings of the Workshop on Computationally Hard Problems and Joint Inference in Speech and Language Processing
2005
Part-of-Speech Tagging using Virtual Evidence and Negative Training
Sheila M. Reynolds | Jeff A. Bilmes
Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing
Sheila M. Reynolds | Jeff A. Bilmes
Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing
The Vocal Joystick: A Voice-Based Human-Computer Interface for Individuals with Motor Impairments
Jeff A. Bilmes | Xiao Li | Jonathan Malkin | Kelley Kilanski | Richard Wright | Katrin Kirchhoff | Amar Subramanya | Susumu Harada | James Landay | Patricia Dowden | Howard Chizeck
Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing
Jeff A. Bilmes | Xiao Li | Jonathan Malkin | Kelley Kilanski | Richard Wright | Katrin Kirchhoff | Amar Subramanya | Susumu Harada | James Landay | Patricia Dowden | Howard Chizeck
Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing
A Dynamic Bayesian Framework to Model Context and Memory in Edit Distance Learning: An Application to Pronunciation Classification
Karim Filali | Jeff Bilmes
Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL’05)
Karim Filali | Jeff Bilmes
Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL’05)
2003
Search
Fix author
Co-authors
- Katrin Kirchhoff 4
- Hui Lin 3
- Amarnag Subramanya 3
- Jennifer Chu-Carroll 2
- Arnav Das 2
- Karim Filali 2
- Robert C. Moore 2
- Mark Sanderson 2
- Mausam . 1
- Jordan Ash 1
- Ramakrishna Bairi 1
- Gantavya Bhatt 1
- Yifang Chen 1
- Howard Chizeck 1
- Hal Daumé III 1
- Patricia Dowden 1
- Simon Du 1
- Oren Etzioni 1
- Hila Gonen 1
- Susumu Harada 1
- Keegan Hines 1
- Rishabh Iyer 1
- Kevin Jamieson 1
- Gang Ji 1
- Kelley Kilanski 1
- Lilly Kumari 1
- James Landay 1
- Xiao Li 1
- Yuzong Liu 1
- Jonathan Malkin 1
- Andrew McCallum 1
- Ryan McDonald 1
- Stephen Mussmann 1
- Robert Nowak 1
- Fernando Pereira 1
- Ganesh Ramakrishnan 1
- Sheila M. Reynolds 1
- Chandan Singh 1
- Charlotte Siska 1
- Michael Skinner 1
- Stephen Soderland 1
- Charles Sutton 1
- Sang Truong 1
- Sahil Verma 1
- Shengjie Wang 1
- Kai Wei 1
- Daniel S. Weld 1
- Richard Wright 1
- Luke Zettlemoyer 1
- Jifan Zhang 1
- Tianyi Zhou 1
- Yinglun Zhu 1