Svitlana Vyetrenko
2026
Time-RA: Towards Time Series Reasoning for Anomaly Diagnosis with LLM Feedback
Yiyuan Yang | Zichuan Liu | Lei Song | Kai Ying | Stephen Wang | Joshua Thomas Bamford | Svitlana Vyetrenko | Jiang Bian | Qingsong Wen
Findings of the Association for Computational Linguistics: ACL 2026
Yiyuan Yang | Zichuan Liu | Lei Song | Kai Ying | Stephen Wang | Joshua Thomas Bamford | Svitlana Vyetrenko | Jiang Bian | Qingsong Wen
Findings of the Association for Computational Linguistics: ACL 2026
Time series anomaly detection (TSAD) has traditionally focused on binary classification and often lacks the fine-grained categorization and explanatory reasoning required for transparent decision-making. To address these limitations, we propose Time-series Reasoning for Anomaly (Time-RA), a novel task that reformulates TSAD from a discriminative into a generative, reasoning-intensive paradigm. To facilitate this, we introduce RATs40K, the first real-world large-scale multimodal benchmark with ~40,000 samples across 10 domains, integrating raw time series, textual context, and visual plots with structured reasoning annotations. Extensive benchmarking shows that while supervised fine-tuning and visual representations boost diagnostic accuracy and reasoning consistency, performance varies across complex scenarios. Notably, fine-tuned models demonstrate strong "plug-and-play" transferability, outperforming traditional baselines on unseen real-world datasets. Our work establishes a foundation for interpretable, multimodal time series analysis. All code and the RATs40K dataset are fully open-sourced to facilitate future research.
2024
Evaluating Large Language Models on Time Series Feature Understanding: A Comprehensive Taxonomy and Benchmark
Elizabeth Fons | Rachneet Kaur | Soham Palande | Zhen Zeng | Tucker Balch | Manuela Veloso | Svitlana Vyetrenko
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Elizabeth Fons | Rachneet Kaur | Soham Palande | Zhen Zeng | Tucker Balch | Manuela Veloso | Svitlana Vyetrenko
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Large Language Models (LLMs) offer the potential for automatic time series analysis and reporting, which is a critical task across many domains, spanning healthcare, finance, climate, energy, and many more. In this paper, we propose a framework for rigorously evaluating the capabilities of LLMs on time series understanding, encompassing both univariate and multivariate forms. We introduce a comprehensive taxonomy of time series features, a critical framework that delineates various characteristics inherent in time series data. Leveraging this taxonomy, we have systematically designed and synthesized a diverse dataset of time series, embodying the different outlined features, each accompanied by textual descriptions. This dataset acts as a solid foundation for assessing the proficiency of LLMs in comprehending time series. Our experiments shed light on the strengths and limitations of state-of-the-art LLMs in time series understanding, revealing which features these models readily comprehend effectively and where they falter. In addition, we uncover the sensitivity of LLMs to factors including the formatting of the data, the position of points queried within a series and the overall time series length.