Abstract
Although the availability of the natural language processing tools and the development of metrics to evaluate them increases, there is a certain gap to fill in that field for the less-resourced languages, such as Polish. Therefore the projects which are designed to extend the existing tools for diverse languages are the best starting point for making these languages more and more covered. This paper presents the results of the first attempt of the co\-re\-fe\-rence resolution for Polish using statistical methods. It presents the conclusions from the process of adapting the Beautiful Anaphora Resolution Toolkit (BART; a system primarily designed for the English language) for Polish and collates its evaluation results with those of the previously implemented rule-based system. Finally, we describe our plans for the future usage of the tool and highlight the upcoming research to be conducted, such as the experiments of a larger scale and the comparison with other machine learning tools.- Anthology ID:
- L12-1635
- Volume:
- Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
- Month:
- May
- Year:
- 2012
- Address:
- Istanbul, Turkey
- Editors:
- Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 192–195
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2012/pdf/1064_Paper.pdf
- DOI:
- Cite (ACL):
- Mateusz Kopeć and Maciej Ogrodniczuk. 2012. Creating a Coreference Resolution System for Polish. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 192–195, Istanbul, Turkey. European Language Resources Association (ELRA).
- Cite (Informal):
- Creating a Coreference Resolution System for Polish (Kopeć & Ogrodniczuk, LREC 2012)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2012/pdf/1064_Paper.pdf