Bridging Attribution and Open-Set Detection using Graph-Augmented Instance Learning in Synthetic Speech

Mohd Mujtaba Akhtar, Girish, Farhan Sheth, Muskaan Singh


Abstract
We propose a unified framework for not only attributing synthetic speech to its source but also for detecting speech generated by synthesizers that were not encountered during training. This requires methods that move beyond simple detection to support both detailed forensic analysis and open-set generalization. To address this, we introduce SIGNAL, a hybrid framework that combines speech foundation models (SFMs) with graph-based modeling and open-set-aware inference. Our framework integrates Graph Neural Networks (GNNs) and a k-Nearest Neighbor (KNN) classifier, allowing it to capture meaningful relationships between utterances and recognize speech that doesn’t belong to any known generator. It constructs a query-conditioned graph over generator class prototypes, enabling the GNN to reason over relationships among candidate generators, while the KNN branch supports open-set detection via confidence-based thresholding. We evaluate SIGNAL using the DiffSSD dataset, which offers a diverse mix of real speech and synthetic audio from both open-source and commercial diffusion-based TTS systems. To further assess generalization, we also test on the SingFake benchmark. Our results show that SIGNAL consistently improves performance across both tasks, with Mamba-based embeddings delivering especially strong results. To the best of our knowledge, this is the first study to unify graph-based learning and open-set detection for tracing synthetic speech back to its origin.
Anthology ID:
2026.eacl-long.250
Volume:
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
March
Year:
2026
Address:
Rabat, Morocco
Editors:
Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:
EACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
5404–5413
Language:
URL:
https://preview.aclanthology.org/ingest-eacl/2026.eacl-long.250/
DOI:
Bibkey:
Cite (ACL):
Mohd Mujtaba Akhtar, Girish, Farhan Sheth, and Muskaan Singh. 2026. Bridging Attribution and Open-Set Detection using Graph-Augmented Instance Learning in Synthetic Speech. In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 5404–5413, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):
Bridging Attribution and Open-Set Detection using Graph-Augmented Instance Learning in Synthetic Speech (Akhtar et al., EACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-eacl/2026.eacl-long.250.pdf