A Channel-Aware Anomaly-Guided Data Augmentation Framework for the FSR-2025 Hakka Speech Recognition Challenge
Siang-Ting Lin, Arthur Hao, Chiun-Yu Hua, Kuan-Tang Huang, Berlin Chen
Abstract
The Formosa Speech Recognition Challenge 2025 (FSR-2025) focuses on Taiwanese Hakka, a low-resource language with limited data diversity and channel coverage. To address this challenge, we propose a channel-aware, data-centric framework that leverages multilingual foundation models to mitigate mismatches between field recordings and training data. Our method integrates unsupervised anomaly detection and channel-conditioned augmentation to enhance data representativeness before ASR fine-tuning, aiming to explore the potential for improving robustness in low-resource Hakka speech recognition.- Anthology ID:
- 2025.rocling-main.60
- Volume:
- Proceedings of the 37th Conference on Computational Linguistics and Speech Processing (ROCLING 2025)
- Month:
- November
- Year:
- 2025
- Address:
- National Taiwan University, Taipei City, Taiwan
- Editors:
- Kai-Wei Chang, Ke-Han Lu, Chih-Kai Yang, Zhi-Rui Tam, Wen-Yu Chang, Chung-Che Wang
- Venue:
- ROCLING
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 499–503
- Language:
- URL:
- https://preview.aclanthology.org/dashboard/2025.rocling-main.60/
- DOI:
- Cite (ACL):
- Siang-Ting Lin, Arthur Hao, Chiun-Yu Hua, Kuan-Tang Huang, and Berlin Chen. 2025. A Channel-Aware Anomaly-Guided Data Augmentation Framework for the FSR-2025 Hakka Speech Recognition Challenge. In Proceedings of the 37th Conference on Computational Linguistics and Speech Processing (ROCLING 2025), pages 499–503, National Taiwan University, Taipei City, Taiwan. Association for Computational Linguistics.
- Cite (Informal):
- A Channel-Aware Anomaly-Guided Data Augmentation Framework for the FSR-2025 Hakka Speech Recognition Challenge (Lin et al., ROCLING 2025)
- PDF:
- https://preview.aclanthology.org/dashboard/2025.rocling-main.60.pdf