@inproceedings{suzuki-etal-2016-correcting,
    title = "Correcting Errors in a Treebank Based on Tree Mining",
    author = "Suzuki, Kanta  and
      Kato, Yoshihide  and
      Matsubara, Shigeki",
    editor = "Calzolari, Nicoletta  and
      Choukri, Khalid  and
      Declerck, Thierry  and
      Goggi, Sara  and
      Grobelnik, Marko  and
      Maegaard, Bente  and
      Mariani, Joseph  and
      Mazo, Helene  and
      Moreno, Asuncion  and
      Odijk, Jan  and
      Piperidis, Stelios",
    booktitle = "Proceedings of the Tenth International Conference on Language Resources and Evaluation ({LREC}'16)",
    month = may,
    year = "2016",
    address = "Portoro{\v{z}}, Slovenia",
    publisher = "European Language Resources Association (ELRA)",
    url = "https://preview.aclanthology.org/landing_page/L16-1244/",
    pages = "1540--1545",
    abstract = "This paper provides a new method to correct annotation errors in a treebank. The previous error correction method constructs a pseudo parallel corpus where incorrect partial parse trees are paired with correct ones, and extracts error correction rules from the parallel corpus. By applying these rules to a treebank, the method corrects errors. However, this method does not achieve wide coverage of error correction. To achieve wide coverage, our method adopts a different approach. In our method, we consider that an infrequent pattern which can be transformed to a frequent one is an annotation error pattern. Based on a tree mining technique, our method seeks such infrequent tree patterns, and constructs error correction rules each of which consists of an infrequent pattern and a corresponding frequent pattern. We conducted an experiment using the Penn Treebank. We obtained 1,987 rules which are not constructed by the previous method, and the rules achieved good precision."
}Markdown (Informal)
[Correcting Errors in a Treebank Based on Tree Mining](https://preview.aclanthology.org/landing_page/L16-1244/) (Suzuki et al., LREC 2016)
ACL
- Kanta Suzuki, Yoshihide Kato, and Shigeki Matsubara. 2016. Correcting Errors in a Treebank Based on Tree Mining. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 1540–1545, Portorož, Slovenia. European Language Resources Association (ELRA).