@inproceedings{snell-etal-2022-context,
    title = "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems",
    author = "Snell, Charlie  and
      Yang, Sherry  and
      Fu, Justin  and
      Su, Yi  and
      Levine, Sergey",
    editor = "Carpuat, Marine  and
      de Marneffe, Marie-Catherine  and
      Meza Ruiz, Ivan Vladimir",
    booktitle = "Findings of the Association for Computational Linguistics: NAACL 2022",
    month = jul,
    year = "2022",
    address = "Seattle, United States",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/ingest-emnlp/2022.findings-naacl.181/",
    doi = "10.18653/v1/2022.findings-naacl.181",
    pages = "2351--2366",
    abstract = "Goal-oriented dialogue systems face a trade-off between fluent language generation and task-specific control. While supervised learning with large language models is capable of producing realistic text, how to steer such responses towards completing a specific task without sacrificing language quality remains an open question. In this work, we formulate goal-oriented dialogue as a partially observed Markov decision process, interpreting the language model as a representation of both the dynamics and the policy. This view allows us to extend techniques from learning-based control, such as task relabeling, to derive a simple and effective method to finetune language models in a goal-aware way, leading to significantly improved task performance. We additionally introduce a number of training strategies that serve to better focus the model on the task at hand. We evaluate our method, Context-Aware Language Models (CALM), on a practical flight-booking task using AirDialogue. Empirically, CALM outperforms the state-of-the-art method by 7{\%} in terms of task success, matching human-level task performance."
}Markdown (Informal)
[Context-Aware Language Modeling for Goal-Oriented Dialogue Systems](https://preview.aclanthology.org/ingest-emnlp/2022.findings-naacl.181/) (Snell et al., Findings 2022)
ACL