SUTAV: A Turkish Audio-Visual Database

Ibrahim Saygin Topkaya, Hakan Erdogan


Abstract
This paper contains information about the """"Sabanci University Turkish Audio-Visual (SUTAV)"""" database. The main aim of collecting SUTAV database was to obtain a large audio-visual collection of spoken words, numbers and sentences in Turkish language. The database was collected between 2006 and 2010 during """"Novel approaches in audio-visual speech recognition"""" project which is funded by The Scientific and Technological Research Council of Turkey (TUBITAK). First part of the database contains a large corpus of Turkish language and contains standart quality videos. The second part is relatively small compared to the first one and contains recordings of spoken digits in high quality videos. Although the main aim to collect SUTAV database was to obtain a database for audio-visual speech recognition applications, it also contains useful data that can be used in other kinds of multimodal research like biometric security and person verification. The paper presents information about the data collection process and the the spoken content. It also contains a sample evaluation protocol and recognition results that are obtained with a small portion of the database.
Anthology ID:
L12-1262
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2334–2337
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/483_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Ibrahim Saygin Topkaya and Hakan Erdogan. 2012. SUTAV: A Turkish Audio-Visual Database. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 2334–2337, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
SUTAV: A Turkish Audio-Visual Database (Topkaya & Erdogan, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/483_Paper.pdf