Abstract
The paper introduces the Political Speech Corpus of Bulgarian. First, its current state has been discussed with respect to its size, coverage, genre specification and related online services. Then, the focus goes to the annotation details. On the one hand, the layers of linguistic annotation are presented. On the other hand, the compatibility with CLARIN technical Infrastructure is explained. Also, some user-based scenarios are mentioned to demonstrate the corpus services and applicability.- Anthology ID:
- L12-1569
- Volume:
- Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
- Month:
- May
- Year:
- 2012
- Address:
- Istanbul, Turkey
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 1744–1747
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2012/pdf/956_Paper.pdf
- DOI:
- Cite (ACL):
- Petya Osenova and Kiril Simov. 2012. The Political Speech Corpus of Bulgarian. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 1744–1747, Istanbul, Turkey. European Language Resources Association (ELRA).
- Cite (Informal):
- The Political Speech Corpus of Bulgarian (Osenova & Simov, LREC 2012)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2012/pdf/956_Paper.pdf