Software for Automatic Speech Recognition via Whisper models applied to Oral History interviews in the Portuguese language

Edgleide de Oliveira Clemente da Silva; Fernando Rezende Zagatti; Filipe Loyola Lopes; Anderson Dias Duarte; Rodrigo Bonacin; Angela Maria Alves

Software for Automatic Speech Recognition via Whisper models applied to Oral History interviews in the Portuguese language

Edgleide de Oliveira Clemente da Silva, Fernando Rezende Zagatti, Filipe Loyola Lopes, Anderson Dias Duarte, Rodrigo Bonacin, Angela Maria Alves

Abstract

This paper presents Ethos AT, a desktop software for automatic transcription that uses OpenAI Whisper models, enabling local processing and ensuring data privacy and accessibility for users who are not necessarily programming experts, such as oral history researchers. A comparative analysis of six Whisper models (small, medium, large, large-v2, large-v3, and turbo) was conducted to analyze performance in terms of transcription accuracy, error types, and processing time. Results indicate that larger models achieve higher lexical accuracy, while smaller ones provide faster execution with acceptable quality for general use; the turbo model showed an effective balance between accuracy and speed. Overall, Ethos AT offers a secure, efficient, and user-friendly solution for academic and institutional contexts.

Anthology ID:: 2026.propor-1.30
Volume:: Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1
Month:: April
Year:: 2026
Address:: Salvador, Brazil
Editors:: Marlo Souza, Iria de-Dios-Flores, Diana Santos, Larissa Freitas, Jackson Wilke da Cruz Souza, Eugénio Ribeiro
Venue:: PROPOR
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 302–310
Language:
URL:: https://preview.aclanthology.org/ingest-dnd/2026.propor-1.30/
DOI:
Bibkey:
Cite (ACL):: Edgleide de Oliveira Clemente da Silva, Fernando Rezende Zagatti, Filipe Loyola Lopes, Anderson Dias Duarte, Rodrigo Bonacin, and Angela Maria Alves. 2026. Software for Automatic Speech Recognition via Whisper models applied to Oral History interviews in the Portuguese language. In Proceedings of the 17th International Conference on Computational Processing of Portuguese (PROPOR 2026) - Vol. 1, pages 302–310, Salvador, Brazil. Association for Computational Linguistics.
Cite (Informal):: Software for Automatic Speech Recognition via Whisper models applied to Oral History interviews in the Portuguese language (Silva et al., PROPOR 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-dnd/2026.propor-1.30.pdf

PDF Cite Search Fix data