Context-Free Transductions with Neural Stacks

Yiding Hao; William Merrill; Dana Angluin; Robert Frank; Noah Amsel; Andrew Benz; Simon Mendelsohn

doi:10.18653/v1/W18-5433

Context-Free Transductions with Neural Stacks

Yiding Hao, William Merrill, Dana Angluin, Robert Frank, Noah Amsel, Andrew Benz, Simon Mendelsohn

Abstract

This paper analyzes the behavior of stack-augmented recurrent neural network (RNN) models. Due to the architectural similarity between stack RNNs and pushdown transducers, we train stack RNN models on a number of tasks, including string reversal, context-free language modelling, and cumulative XOR evaluation. Examining the behavior of our networks, we show that stack-augmented RNNs can discover intuitive stack-based strategies for solving our tasks. However, stack RNNs are more difficult to train than classical architectures such as LSTMs. Rather than employ stack-based strategies, more complex networks often find approximate solutions by using the stack as unstructured memory.

Anthology ID:: W18-5433
Volume:: Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP
Month:: November
Year:: 2018
Address:: Brussels, Belgium
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 306–315
Language:
URL:: https://aclanthology.org/W18-5433
DOI:: 10.18653/v1/W18-5433
Bibkey:
Cite (ACL):: Yiding Hao, William Merrill, Dana Angluin, Robert Frank, Noah Amsel, Andrew Benz, and Simon Mendelsohn. 2018. Context-Free Transductions with Neural Stacks. In Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, pages 306–315, Brussels, Belgium. Association for Computational Linguistics.
Cite (Informal):: Context-Free Transductions with Neural Stacks (Hao et al., EMNLP 2018)
Copy Citation:
PDF:: https://preview.aclanthology.org/paclic-22-ingestion/W18-5433.pdf
Code: viking-sudo-rm/StackNN + additional community code

PDF Search Code