@inproceedings{jwalapuram-2017-evaluating,
    title = "Evaluating Dialogs based on {G}rice{'}s Maxims",
    author = "Jwalapuram, Prathyusha",
    editor = "Kovatchev, Venelin  and
      Temnikova, Irina  and
      Gencheva, Pepa  and
      Kiprov, Yasen  and
      Nikolova, Ivelina",
    booktitle = "Proceedings of the Student Research Workshop Associated with {RANLP} 2017",
    month = sep,
    year = "2017",
    address = "Varna",
    publisher = "INCOMA Ltd.",
    url = "https://preview.aclanthology.org/iwcs-25-ingestion/R17-2003/",
    doi = "10.26615/issn.1314-9156.2017_003",
    pages = "17--24",
    abstract = "There is no agreed upon standard for the evaluation of conversational dialog systems, which are well-known to be hard to evaluate due to the difficulty in pinning down metrics that will correspond to human judgements and the subjective nature of human judgment itself. We explored the possibility of using Grice{'}s Maxims to evaluate effective communication in conversation. We collected some system generated dialogs from popular conversational chatbots across the spectrum and conducted a survey to see how the human judgements based on Gricean maxims correlate, and if such human judgments can be used as an effective evaluation metric for conversational dialog."
}Markdown (Informal)
[Evaluating Dialogs based on Grice’s Maxims](https://preview.aclanthology.org/iwcs-25-ingestion/R17-2003/) (Jwalapuram, RANLP 2017)
ACL