Ethan Selfridge
Also published as: Ethan O. Selfridge
2024
Two-tiered Encoder-based Hallucination Detection for Retrieval-Augmented Generation in the Wild
Ilana Zimmerman | Jadin Tredup | Ethan Selfridge | Joseph Bradley
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: Industry Track
Ilana Zimmerman | Jadin Tredup | Ethan Selfridge | Joseph Bradley
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: Industry Track
Detecting hallucinations, where Large Language Models (LLMs) are not factually consistent with a Knowledge Base (KB), is a challenge for Retrieval-Augmented Generation (RAG) systems. Current solutions rely on public datasets to develop prompts or fine-tune a Natural Language Inference (NLI) model. However, these approaches are not focused on developing an enterprise RAG system; they do not consider latency, train or evaluate on production data, nor do they handle non-verifiable statements such as small talk or questions. To address this, we leverage the customer service conversation data of four large brands to evaluate existing solutions and propose a set of small encoder models trained on a new dataset. We find the proposed models to outperform existing methods and highlight the value of combining a small amount of in-domain data with public datasets.
2023
The economic trade-offs of large language models: A case study
Kristen Howell | Gwen Christian | Pavel Fomitchov | Gitit Kehat | Julianne Marzulla | Leanne Rolston | Jadin Tredup | Ilana Zimmerman | Ethan Selfridge | Joseph Bradley
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track)
Kristen Howell | Gwen Christian | Pavel Fomitchov | Gitit Kehat | Julianne Marzulla | Leanne Rolston | Jadin Tredup | Ilana Zimmerman | Ethan Selfridge | Joseph Bradley
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track)
Contacting customer service via chat is a common practice. Because employing customer service agents is expensive, many companies are turning to NLP that assists human agents by auto-generating responses that can be used directly or with modifications. With their ability to handle large context windows, Large Language Models (LLMs) are a natural fit for this use case. However, their efficacy must be balanced with the cost of training and serving them. This paper assesses the practical cost and impact of LLMs for the enterprise as a function of the usefulness of the responses that they generate. We present a cost framework for evaluating an NLP model’s utility for this use case and apply it to a single brand as a case study in the context of an existing agent assistance product. We compare three strategies for specializing an LLM — prompt engineering, fine-tuning, and knowledge distillation — using feedback from the brand’s customer service agents. We find that the usability of a model’s responses can make up for a large difference in inference cost for our case study brand, and we extrapolate our findings to the broader enterprise space.
2014
MVA: The Multimodal Virtual Assistant
Michael Johnston | John Chen | Patrick Ehlen | Hyuckchul Jung | Jay Lieske | Aarthi Reddy | Ethan Selfridge | Svetlana Stoyanchev | Brant Vasilieff | Jay Wilpon
Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL)
Michael Johnston | John Chen | Patrick Ehlen | Hyuckchul Jung | Jay Lieske | Aarthi Reddy | Ethan Selfridge | Svetlana Stoyanchev | Brant Vasilieff | Jay Wilpon
Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL)
2013
Continuously Predicting and Processing Barge-in During a Live Spoken Dialogue Task
Ethan Selfridge | Iker Arizmendi | Peter Heeman | Jason Williams
Proceedings of the SIGDIAL 2013 Conference
Ethan Selfridge | Iker Arizmendi | Peter Heeman | Jason Williams
Proceedings of the SIGDIAL 2013 Conference
2012
A Temporal Simulator for Developing Turn-Taking Methods for Spoken Dialogue Systems
Ethan O. Selfridge | Peter A. Heeman
Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Ethan O. Selfridge | Peter A. Heeman
Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Integrating Incremental Speech Recognition and POMDP-Based Dialogue Systems
Ethan O. Selfridge | Iker Arizmendi | Peter A. Heeman | Jason D. Williams
Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Ethan O. Selfridge | Iker Arizmendi | Peter A. Heeman | Jason D. Williams
Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue
2011
Proceedings of the ACL 2011 Student Session
Sasa Petrovic | Ethan Selfridge | Emily Pitler | Miles Osborne | Thamar Solorio
Proceedings of the ACL 2011 Student Session
Sasa Petrovic | Ethan Selfridge | Emily Pitler | Miles Osborne | Thamar Solorio
Proceedings of the ACL 2011 Student Session
Stability and Accuracy in Incremental Speech Recognition
Ethan Selfridge | Iker Arizmendi | Peter Heeman | Jason Williams
Proceedings of the SIGDIAL 2011 Conference
Ethan Selfridge | Iker Arizmendi | Peter Heeman | Jason Williams
Proceedings of the SIGDIAL 2011 Conference
2010
Search
Fix author
Co-authors
- Peter A. Heeman 6
- Iker Arizmendi 3
- Jason D. Williams 3
- Joseph Bradley 2
- Jadin Tredup 2
- Ilana Zimmerman 2
- Lois M. Black 1
- John Chen 1
- Gwen Christian 1
- Patrick Ehlen 1
- Pavel Fomitchov 1
- Kristen Howell 1
- Michael Johnston 1
- Hyuckchul Jung 1
- Gitit Kehat 1
- Jay Lieske 1
- Rebecca Lunsford 1
- Julianne Marzulla 1
- Miles Osborne 1
- Saša Petrović 1
- Emily Pitler 1
- Aarthi Reddy 1
- Leanne Rolston 1
- Thamar Solorio 1
- Svetlana Stoyanchev 1
- Brant Vasilieff 1
- Jay Wilpon 1
- Jan van Santen 1