Robert Krovetz
2026
A Test Collection for Part-of-Speech Tagging and Word Sense Disambiguation
Robert Krovetz
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Robert Krovetz
Proceedings of the Fifteenth Language Resources and Evaluation Conference
We evaluate a focused test collection at the intersection of part-of-speech tagging and word-sense disambiguation. The collection targets words such as train, novel, and lean, where part-of-speech contrasts align with clear meaning differences. We use it to detect regressions across tagger versions, track quantitative and qualitative progress over time, and test robustness to orthographic variation. Experiments with the Stanford and TnT taggers show 68% accuracy, compared with 92% for a recent spaCy transformer model. Earlier taggers erred mainly on noun–verb distinctions; spaCy’s errors more often involve noun–adjective distinctions. Uppercase text roughly doubles error rates for all taggers. We discuss common problems and propose directions for future testing.
2011
The Web is not a PERSON, Berners-Lee is not an ORGANIZATION, and African-Americans are not LOCATIONS: An Analysis of the Performance of Named-Entity Recognition
Robert Krovetz | Paul Deane | Nitin Madnani
Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World
Robert Krovetz | Paul Deane | Nitin Madnani
Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World
1997
Homonymy and Polysemy in Information Retrieval
Robert Krovetz
35th Annual Meeting of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics
Robert Krovetz
35th Annual Meeting of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics