Klim Peshkov


pdf bib
Proceedings of TALN 2014 (Volume 4: RECITAL - Student Research Workshop)
Núria Gala | Klim Peshkov | Brigitte Bigi
Proceedings of TALN 2014 (Volume 4: RECITAL - Student Research Workshop)

Segmentation evaluation metrics, a comparison grounded on prosodic and discourse units
Klim Peshkov | Laurent Prévot
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

Knowledge on evaluation metrics and best practices of using them have improved fast in the recent years Fort et al. (2012). However, the advances concern mostly evaluation of classification related tasks. Segmentation tasks have received less attention. Nevertheless, there are crucial in a large number of linguistic studies. A range of metrics is available (F-score on boundaries, F-score on units, WindowDiff ((WD), Boundary Similarity (BS) but it is still relatively difficult to interpret these metrics on various linguistic segmentation tasks, such as prosodic and discourse segmentation. In this paper, we consider real segmented datasets (introduced in Peshkov et al. (2012)) as references which we deteriorate in different ways (random addition of boundaries, random removal boundaries, near-miss errors introduction). This provide us with various measures on controlled datasets and with an interesting benchmark for various linguistic segmentation tasks.


A Quantitative Comparative Study of Prosodic and Discourse Units, the Case of French and Taiwan Mandarin
Laurent Prévot | Shu-Chuan Tseng | Alvin Cheng-Hsien Chen | Klim Peshkov
Proceedings of the 27th Pacific Asia Conference on Language, Information, and Computation (PACLIC 27)