Abstract
Automatic speech recognition (ASR) has evolved from a pipeline architecture with pronunciation dictionaries, phonetic features and language models to the end-to-end systems performing a direct translation from a raw waveform into a word sequence. With the increase in accuracy and the availability of pre-trained models, the ASR systems are now omnipresent in our daily applications. On the other hand, the models’ interpretability and their computational cost have become more challenging, particularly when dealing with less-common languages or identifying regional variations of speakers. This research proposal will follow a four-stage process: 1) Proving an overview of acoustic features and feature extraction algorithms; 2) Exploring current ASR models, tools, and performance assessment techniques; 3) Aligning features with interpretable phonetic transcripts; and 4) Designing a prototype ARPOCA to increase awareness of regional language variation and improve models feedback by developing a semi-automatic acoustic features extraction using PRAAT in conjunction with phonetic transcription.- Anthology ID:
- 2022.acl-srw.28
- Volume:
- Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop
- Month:
- May
- Year:
- 2022
- Address:
- Dublin, Ireland
- Editors:
- Samuel Louvan, Andrea Madotto, Brielen Madureira
- Venue:
- ACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 366–372
- Language:
- URL:
- https://aclanthology.org/2022.acl-srw.28
- DOI:
- 10.18653/v1/2022.acl-srw.28
- Cite (ACL):
- Megan Merz and Olga Scrivner. 2022. Discourse on ASR Measurement: Introducing the ARPOCA Assessment Tool. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, pages 366–372, Dublin, Ireland. Association for Computational Linguistics.
- Cite (Informal):
- Discourse on ASR Measurement: Introducing the ARPOCA Assessment Tool (Merz & Scrivner, ACL 2022)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-3/2022.acl-srw.28.pdf