Braxen 1.0

Christina Tånnander, Jens Edlund


Abstract
With this paper, we release a Swedish pronunciation lexicon resource, Braxen 1.0, which is the result of almost 20 years development carried out at the Swedish Agency for Accessible Media (MTM). The lexicon originated with a basic word list, but has continuously been exanded with new entries, mainly acquired from university textbooks and news text. Braxen consists of around 850 000 entries, of which around 150 000 are proper names. The lexicon is released under the CC BY 4.0 license and is accessible for public use.
Anthology ID:
2025.nodalida-1.71
Volume:
Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025)
Month:
march
Year:
2025
Address:
Tallinn, Estonia
Editors:
Richard Johansson, Sara Stymne
Venue:
NoDaLiDa
SIG:
Publisher:
University of Tartu Library
Note:
Pages:
709–713
Language:
URL:
https://preview.aclanthology.org/fix-sig-urls/2025.nodalida-1.71/
DOI:
Bibkey:
Cite (ACL):
Christina Tånnander and Jens Edlund. 2025. Braxen 1.0. In Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025), pages 709–713, Tallinn, Estonia. University of Tartu Library.
Cite (Informal):
Braxen 1.0 (Tånnander & Edlund, NoDaLiDa 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/fix-sig-urls/2025.nodalida-1.71.pdf