Andrew Rabinovich


2025

pdf bib
DispatchQA: A Benchmark for Small Function Calling Language Models in E-Commerce Applications
Joachim Daiber | Victor Maricato | Ayan Sinha | Andrew Rabinovich
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track

We introduce DispatchQA, a benchmark to evaluate how well small language models (SLMs) translate open‐ended search queries into executable API calls via explicit function calling. Our benchmark focuses on the latency-sensitive e-commerce setting and measures SLMs’ impact on both search relevance and search latency. We provide strong, replicable baselines based on Llama 3.1 8B Instruct fine-tuned on synthetically generated data and find that fine-tuned SLMs produce search quality comparable or better than large language models such as GPT-4o while achieving up to 3× faster inference. All data, code, and training checkpoints are publicly released to spur further research on resource‐efficient query understanding.

2015

pdf bib
What’s Cookin’? Interpreting Cooking Videos using Text, Speech and Vision
Jonathan Malmaud | Jonathan Huang | Vivek Rathod | Nicholas Johnston | Andrew Rabinovich | Kevin Murphy
Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies