@inproceedings{abe-etal-2022-sentence,
    title = "Why is sentence similarity benchmark not predictive of application-oriented task performance?",
    author = "Abe, Kaori  and
      Yokoi, Sho  and
      Kajiwara, Tomoyuki  and
      Inui, Kentaro",
    editor = "Deutsch, Daniel  and
      Udomcharoenchaikit, Can  and
      Opitz, Juri  and
      Gao, Yang  and
      Fomicheva, Marina  and
      Eger, Steffen",
    booktitle = "Proceedings of the 3rd Workshop on Evaluation and Comparison of NLP Systems",
    month = nov,
    year = "2022",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/ingest-emnlp/2022.eval4nlp-1.8/",
    doi = "10.18653/v1/2022.eval4nlp-1.8",
    pages = "70--87"
}Markdown (Informal)
[Why is sentence similarity benchmark not predictive of application-oriented task performance?](https://preview.aclanthology.org/ingest-emnlp/2022.eval4nlp-1.8/) (Abe et al., Eval4NLP 2022)
ACL