Why is sentence similarity benchmark not predictive of application-oriented task performance? - ACL Anthology

This is an internal, incomplete preview of a proposed change to the ACL Anthology. For efficiency reasons, we don't generate MODS or Endnote formats, and the preview may be incomplete in other ways, or contain mistakes. Do not treat this content as an official publication.

Why is sentence similarity benchmark not predictive of application-oriented task performance?

Kaori Abe, Sho Yokoi, Tomoyuki Kajiwara, Kentaro Inui

Anthology ID:: 2022.eval4nlp-1.8
Volume:: Proceedings of the 3rd Workshop on Evaluation and Comparison of NLP Systems
Month:: November
Year:: 2022
Address:: Online
Editors:: Daniel Deutsch, Can Udomcharoenchaikit, Juri Opitz, Yang Gao, Marina Fomicheva, Steffen Eger
Venue:: Eval4NLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 70–87
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2022.eval4nlp-1.8/
DOI:: 10.18653/v1/2022.eval4nlp-1.8
Bibkey:
Cite (ACL):: Kaori Abe, Sho Yokoi, Tomoyuki Kajiwara, and Kentaro Inui. 2022. Why is sentence similarity benchmark not predictive of application-oriented task performance?. In Proceedings of the 3rd Workshop on Evaluation and Comparison of NLP Systems, pages 70–87, Online. Association for Computational Linguistics.
Cite (Informal):: Why is sentence similarity benchmark not predictive of application-oriented task performance? (Abe et al., Eval4NLP 2022)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2022.eval4nlp-1.8.pdf
Supplementarymaterial:: 2022.eval4nlp-1.8.SupplementaryMaterial.zip

PDF Cite Search Supplementarymaterial Fix data