@inproceedings{bekavac-snajder-2016-graph,
    title = "Graph-Based Induction of Word Senses in {C}roatian",
    author = "Bekavac, Marko  and
      {\v{S}}najder, Jan",
    editor = "Calzolari, Nicoletta  and
      Choukri, Khalid  and
      Declerck, Thierry  and
      Goggi, Sara  and
      Grobelnik, Marko  and
      Maegaard, Bente  and
      Mariani, Joseph  and
      Mazo, Helene  and
      Moreno, Asuncion  and
      Odijk, Jan  and
      Piperidis, Stelios",
    booktitle = "Proceedings of the Tenth International Conference on Language Resources and Evaluation ({LREC}'16)",
    month = may,
    year = "2016",
    address = "Portoro{\v{z}}, Slovenia",
    publisher = "European Language Resources Association (ELRA)",
    url = "https://preview.aclanthology.org/landing_page/L16-1481/",
    pages = "3014--3018",
    abstract = "Word sense induction (WSI) seeks to induce senses of words from unannotated corpora. In this paper, we address the WSI task for the Croatian language. We adopt the word clustering approach based on co-occurrence graphs, in which senses are taken to correspond to strongly inter-connected components of co-occurring words. We experiment with a number of graph construction techniques and clustering algorithms, and evaluate the sense inventories both as a clustering problem and extrinsically on a word sense disambiguation (WSD) task. In the cluster-based evaluation, Chinese Whispers algorithm outperformed Markov Clustering, yielding a normalized mutual information score of 64.3. In contrast, in WSD evaluation Markov Clustering performed better, yielding an accuracy of about 75{\%}. We are making available two induced sense inventories of 10,000 most frequent Croatian words: one coarse-grained and one fine-grained inventory, both obtained using the Markov Clustering algorithm."
}Markdown (Informal)
[Graph-Based Induction of Word Senses in Croatian](https://preview.aclanthology.org/landing_page/L16-1481/) (Bekavac & Šnajder, LREC 2016)
ACL
- Marko Bekavac and Jan Šnajder. 2016. Graph-Based Induction of Word Senses in Croatian. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 3014–3018, Portorož, Slovenia. European Language Resources Association (ELRA).