@inproceedings{bernardy-etal-2021-transformer,
    title = "Can the Transformer Learn Nested Recursion with Symbol Masking?",
    author = "Bernardy, Jean-Philippe  and
      Ek, Adam  and
      Maraev, Vladislav",
    editor = "Zong, Chengqing  and
      Xia, Fei  and
      Li, Wenjie  and
      Navigli, Roberto",
    booktitle = "Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021",
    month = aug,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/ingest-emnlp/2021.findings-acl.67/",
    doi = "10.18653/v1/2021.findings-acl.67",
    pages = "753--760"
}Markdown (Informal)
[Can the Transformer Learn Nested Recursion with Symbol Masking?](https://preview.aclanthology.org/ingest-emnlp/2021.findings-acl.67/) (Bernardy et al., Findings 2021)
ACL