@article{henderson-etal-2008-hybrid,
    title = "Hybrid Reinforcement/Supervised Learning of Dialogue Policies from Fixed Data Sets",
    author = "Henderson, James  and
      Lemon, Oliver  and
      Georgila, Kallirroi",
    journal = "Computational Linguistics",
    volume = "34",
    number = "4",
    year = "2008",
    url = "https://aclanthology.org/J08-4002",
    doi = "10.1162/coli.2008.07-028-R2-05-82",
    pages = "487--511",
}
Markdown (Informal)
[Hybrid Reinforcement/Supervised Learning of Dialogue Policies from Fixed Data Sets](https://aclanthology.org/J08-4002) (Henderson et al., CL 2008)
ACL