@inproceedings{ilievski-etal-2016-semantic,
    title = "Semantic overfitting: what `world' do we consider when evaluating disambiguation of text?",
    author = "Ilievski, Filip  and
      Postma, Marten  and
      Vossen, Piek",
    editor = "Matsumoto, Yuji  and
      Prasad, Rashmi",
    booktitle = "Proceedings of {COLING} 2016, the 26th International Conference on Computational Linguistics: Technical Papers",
    month = dec,
    year = "2016",
    address = "Osaka, Japan",
    publisher = "The COLING 2016 Organizing Committee",
    url = "https://preview.aclanthology.org/ingest-emnlp/C16-1112/",
    pages = "1180--1191",
    abstract = "Semantic text processing faces the challenge of defining the relation between lexical expressions and the world to which they make reference within a period of time. It is unclear whether the current test sets used to evaluate disambiguation tasks are representative for the full complexity considering this time-anchored relation, resulting in semantic overfitting to a specific period and the frequent phenomena within. We conceptualize and formalize a set of metrics which evaluate this complexity of datasets. We provide evidence for their applicability on five different disambiguation tasks. To challenge semantic overfitting of disambiguation systems, we propose a time-based, metric-aware method for developing datasets in a systematic and semi-automated manner, as well as an event-based QA task."
}Markdown (Informal)
[Semantic overfitting: what ‘world’ do we consider when evaluating disambiguation of text?](https://preview.aclanthology.org/ingest-emnlp/C16-1112/) (Ilievski et al., COLING 2016)
ACL