Siang-Ting Lin
2026
NTNU-SMIL at SemEval-2026 Task 3: Logistic-Loss Regression with Same-Language Transfer for Valence–Arousal Stance Prediction in Dimensional Stance Analysis (DimStance)
Siang-Ting Lin | Tien-Hong Lo | Yun-Ting Sun | Jhih-Rong Guo | Tung-Yen Hao | Fong-Chun Tsai | Berlin Chen
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
Siang-Ting Lin | Tien-Hong Lo | Yun-Ting Sun | Jhih-Rong Guo | Tung-Yen Hao | Fong-Chun Tsai | Berlin Chen
Proceedings of the 20th International Workshop on Semantic Evaluation (2026)
We propose NTNU-SMIL’s system for SemEval-2026 Task 3 Track B Subtask 1 Dimensional Stance Analysis (DimStance). Our approach models target-conditioned valence–arousal regression using sentence-pair encoding, dual regression heads, and a logistic-loss regression formulation. For English and Chinese, we further leverage same-language transfer from Track A and apply lightweight out-of-fold calibration with multi-seed ensembling to reduce cross-lingual scale mismatch. Post-hoc analysis shows that same-language transfer and logistic-loss regression are the main drivers of performance gains, while arousal variance collapse remains a challenge in low-resource settings such as Swahili.
2025
A Channel-Aware Anomaly-Guided Data Augmentation Framework for the FSR-2025 Hakka Speech Recognition Challenge
Siang-Ting Lin | Arthur Hao | Chiun-Yu Hua | Kuan-Tang Huang | Berlin Chen
Proceedings of the 37th Conference on Computational Linguistics and Speech Processing (ROCLING 2025)
Siang-Ting Lin | Arthur Hao | Chiun-Yu Hua | Kuan-Tang Huang | Berlin Chen
Proceedings of the 37th Conference on Computational Linguistics and Speech Processing (ROCLING 2025)
The Formosa Speech Recognition Challenge 2025 (FSR-2025) focuses on Taiwanese Hakka, a low-resource language with limited data diversity and channel coverage. To address this challenge, we propose a channel-aware, data-centric framework that leverages multilingual foundation models to mitigate mismatches between field recordings and training data. Our method integrates unsupervised anomaly detection and channel-conditioned augmentation to enhance data representativeness before ASR fine-tuning, aiming to explore the potential for improving robustness in low-resource Hakka speech recognition.