Svetla Peneva Koeva
2026
Bulgarian Massive Multitask Language Understanding Benchmark
Svetla Peneva Koeva | Ivelina Stoyanova | Dimiter Georgiev | Svetlozara Leseva | Valentina Stefanova | Maria Todorova | Tsvetana Ivanova Dimitrova | Hristina Kukova | Mihaela Moskova | Tinko Tinchev
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Svetla Peneva Koeva | Ivelina Stoyanova | Dimiter Georgiev | Svetlozara Leseva | Valentina Stefanova | Maria Todorova | Tsvetana Ivanova Dimitrova | Hristina Kukova | Mihaela Moskova | Tinko Tinchev
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Assessing the broad general knowledge of Large Language Models (LLMs) across multiple domains in Bulgarian remains challenging due to the limited availability of Bulgarian evaluation benchmarks. To address this gap, we introduce the Bulgarian Massive Multitask Language Understanding benchmark (MMLU-BG), designed to evaluate whether LLMs possess generalised knowledge capabilities beyond simple text prediction in Bulgarian. This paper presents the structure, the development protocol, and the size of the MMLU-BG benchmark. It is tested in comparison with the original MMLU for English across seven LLMs selected according to specific criteria. The experiments demonstrate that the MMLU-BG benchmark assesses multi-domain versatility and highlights the models’ strengths and weaknesses across different subject areas.
2025
IfGPT: A Dataset in Bulgarian for Large Language Models
Svetla Peneva Koeva | Ivelina Stoyanova | Jordan Konstantinov Kralev
Proceedings of the First Workshop on Advancing NLP for Low-Resource Languages
Svetla Peneva Koeva | Ivelina Stoyanova | Jordan Konstantinov Kralev
Proceedings of the First Workshop on Advancing NLP for Low-Resource Languages
The paper presents the large dataset IfGPT, which contains available corpora and datasets for Bulgarian, and describes methods to continuously expand it with unduplicated and unbiased Bulgarian data. The samples in the dataset are annotated with metadata that enable effective extraction of domain- and application-oriented datasets for fine-tuning or Retrieval Augmented Generation (RAG) of large language models (LLMs). The paper focuses on the description of the extended metadata of the IfGPT dataset and its management in a graph database.
2023
XL-WA: a Gold Evaluation Benchmark for Word Alignment in 14 Language Pairs
Federico Martelli | Andrei Stefan Bejgu | Cesare Campagnano | Jaka Čibej | Rute Costa | Apolonija Gantar | Jelena Kallas | Svetla Peneva Koeva | Kristina Koppel | Simon Krek | Margit Langemets | Veronika Lipp | Sanni Nimb | Sussi Olsen | Bolette Sanford Pedersen | Valeria Quochi | Ana Salgado | László Simon | Carole Tiberius | Rafael-J Ureña-Ruiz | Roberto Navigli
Proceedings of the Ninth Italian Conference on Computational Linguistics (CLiC-it 2023)
Federico Martelli | Andrei Stefan Bejgu | Cesare Campagnano | Jaka Čibej | Rute Costa | Apolonija Gantar | Jelena Kallas | Svetla Peneva Koeva | Kristina Koppel | Simon Krek | Margit Langemets | Veronika Lipp | Sanni Nimb | Sussi Olsen | Bolette Sanford Pedersen | Valeria Quochi | Ana Salgado | László Simon | Carole Tiberius | Rafael-J Ureña-Ruiz | Roberto Navigli
Proceedings of the Ninth Italian Conference on Computational Linguistics (CLiC-it 2023)
Search
Fix author
Co-authors
- Ivelina Stoyanova 2
- Andrei Stefan Bejgu 1
- Cesare Campagnano 1
- Rute Costa 1
- Tsvetana Ivanova Dimitrova 1
- Apolonija Gantar 1
- Dimiter Georgiev 1
- Jelena Kallas 1
- Kristina Koppel 1
- Jordan Konstantinov Kralev 1
- Simon Krek 1
- Hristina Kukova 1
- Margit Langemets 1
- Svetlozara Leseva 1
- Veronika Lipp 1
- Federico Martelli 1
- Mihaela Moskova 1
- Roberto Navigli 1
- Sanni Nimb 1
- Sussi Olsen 1
- Valeria Quochi 1
- Ana Salgado 1
- Bolette Sanford Pedersen 1
- László Simon 1
- Valentina Stefanova 1
- Carole Tiberius 1
- Tinko Tinchev 1
- Maria Todorova 1
- Rafael-J. Ureña-Ruiz 1
- Jaka Čibej 1