Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT

Zhiyong Wu; Yun Chen; Ben Kao; Qun Liu

doi:10.18653/v1/2020.acl-main.383

Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT

Abstract

By introducing a small set of additional parameters, a probe learns to solve specific linguistic tasks (e.g., dependency parsing) in a supervised manner using feature representations (e.g., contextualized embeddings). The effectiveness of such probing tasks is taken as evidence that the pre-trained model encodes linguistic knowledge. However, this approach of evaluating a language model is undermined by the uncertainty of the amount of knowledge that is learned by the probe itself. Complementary to those works, we propose a parameter-free probing technique for analyzing pre-trained language models (e.g., BERT). Our method does not require direct supervision from the probing tasks, nor do we introduce additional parameters to the probing process. Our experiments on BERT show that syntactic trees recovered from BERT using our method are significantly better than linguistically-uninformed baselines. We further feed the empirically induced dependency structures into a downstream sentiment classification task and find its improvement compatible with or even superior to a human-designed dependency schema.

Anthology ID:: 2020.acl-main.383
Volume:: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
Month:: July
Year:: 2020
Address:: Online
Editors:: Dan Jurafsky, Joyce Chai, Natalie Schluter, Joel Tetreault
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 4166–4176
Language:
URL:: https://aclanthology.org/2020.acl-main.383
DOI:: 10.18653/v1/2020.acl-main.383
Bibkey:
Cite (ACL):: Zhiyong Wu, Yun Chen, Ben Kao, and Qun Liu. 2020. Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 4166–4176, Online. Association for Computational Linguistics.
Cite (Informal):: Perturbed Masking: Parameter-free Probing for Analyzing and Interpreting BERT (Wu et al., ACL 2020)
Copy Citation:
PDF:: https://preview.aclanthology.org/nschneid-patch-4/2020.acl-main.383.pdf
Video:: http://slideslive.com/38929032
Code: LividWo/Perturbed-Masking

PDF Search Code Video