@inproceedings{ilinykh-etal-2022-look,
    title = "Look and Answer the Question: On the Role of Vision in Embodied Question Answering",
    author = "Ilinykh, Nikolai  and
      Emampoor, Yasmeen  and
      Dobnik, Simon",
    editor = "Shaikh, Samira  and
      Ferreira, Thiago  and
      Stent, Amanda",
    booktitle = "Proceedings of the 15th International Conference on Natural Language Generation",
    month = jul,
    year = "2022",
    address = "Waterville, Maine, USA and virtual meeting",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/ingest-emnlp/2022.inlg-main.19/",
    doi = "10.18653/v1/2022.inlg-main.19",
    pages = "236--245",
    abstract = ""
}Markdown (Informal)
[Look and Answer the Question: On the Role of Vision in Embodied Question Answering](https://preview.aclanthology.org/ingest-emnlp/2022.inlg-main.19/) (Ilinykh et al., INLG 2022)
ACL