Investigating associative, switchable and negatable Winograd items on renewed French data sets

Xiaoou Wang, Olga Seminck, Pascal Amsili


Abstract
The Winograd Schema Challenge (WSC) consists of a set of anaphora resolution problems resolvable only by reasoning about world knowledge. This article describes the update of the existing French data set and the creation of three subsets allowing for a more robust, fine-grained evaluation protocol of WSC in French (FWSC) : an associative subset (items easily resolvable with lexical co-occurrence), a switchable subset (items where the inversion of two keywords reverses the answer) and a negatable subset (items where applying negation on its verb reverses the answer). Experiences on these data sets with CamemBERT reach SOTA performances. Our evaluation protocol showed in addition that the higher performance could be explained by the existence of associative items in FWSC. Besides, increasing the size of training corpus improves the model’s performance on switchable items while the impact of larger training corpus remains small on negatable items.
Anthology ID:
2022.jeptalnrecital-taln.13
Volume:
Actes de la 29e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale
Month:
6
Year:
2022
Address:
Avignon, France
Venue:
JEP/TALN/RECITAL
SIG:
Publisher:
ATALA
Note:
Pages:
136–143
Language:
URL:
https://aclanthology.org/2022.jeptalnrecital-taln.13
DOI:
Bibkey:
Cite (ACL):
Xiaoou Wang, Olga Seminck, and Pascal Amsili. 2022. Investigating associative, switchable and negatable Winograd items on renewed French data sets. In Actes de la 29e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale, pages 136–143, Avignon, France. ATALA.
Cite (Informal):
Investigating associative, switchable and negatable Winograd items on renewed French data sets (Wang et al., JEP/TALN/RECITAL 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2022.jeptalnrecital-taln.13.pdf
Code
 xiaoouwang/fwsc285
Data
WSCWinoGrande