Jordi Adell
2012
BUCEADOR, a multi-language search engine for digital libraries
Jordi Adell
|
Antonio Bonafonte
|
Antonio Cardenal
|
Marta R. Costa-Jussà
|
José A. R. Fonollosa
|
Asunción Moreno
|
Eva Navas
|
Eduardo R. Banga
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
This paper presents a web-based multimedia search engine built within the Buceador (www.buceador.org) research project. A proof-of-concept tool has been implemented which is able to retrieve information from a digital library made of multimedia documents in the 4 official languages in Spain (Spanish, Basque, Catalan and Galician). The retrieved documents are presented in the user language after translation and dubbing (the four previous languages + English). The paper presents the tool functionality, the architecture, the digital library and provide some information about the technology involved in the fields of automatic speech recognition, statistical machine translation, text-to-speech synthesis and information retrieval. Each technology has been adapted to the purposes of the presented tool as well as to interact with the rest of the technologies involved.
2008
Corpus and Voices for Catalan Speech Synthesis
Antonio Bonafonte
|
Jordi Adell
|
Ignasi Esquerra
|
Silvia Gallego
|
Asunción Moreno
|
Javier Pérez
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
In this paper we describe the design and production of Catalan database for building synthetic voices. Two speakers, with 10 hours per speaker, have recorded 10 hours of speech. The speaker selection and the corpus design aim to provide resources for high quality synthesis. The resources have been used to build voices for the Festival TTS. Both the original recordings and the Festival databases are freely available for research and for commertial use.