@inproceedings{zhao-kawahara-2018-unified,
    title = "A Unified Neural Architecture for Joint Dialog Act Segmentation and Recognition in Spoken Dialog System",
    author = "Zhao, Tianyu  and
      Kawahara, Tatsuya",
    editor = "Komatani, Kazunori  and
      Litman, Diane  and
      Yu, Kai  and
      Papangelis, Alex  and
      Cavedon, Lawrence  and
      Nakano, Mikio",
    booktitle = "Proceedings of the 19th Annual {SIG}dial Meeting on Discourse and Dialogue",
    month = jul,
    year = "2018",
    address = "Melbourne, Australia",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/iwcs-25-ingestion/W18-5021/",
    doi = "10.18653/v1/W18-5021",
    pages = "201--208",
    abstract = "In spoken dialog systems (SDSs), dialog act (DA) segmentation and recognition provide essential information for response generation. A majority of previous works assumed ground-truth segmentation of DA units, which is not available from automatic speech recognition (ASR) in SDS. We propose a unified architecture based on neural networks, which consists of a sequence tagger for segmentation and a classifier for recognition. The DA recognition model is based on hierarchical neural networks to incorporate the context of preceding sentences. We investigate sharing some layers of the two components so that they can be trained jointly and learn generalized features from both tasks. An evaluation on the Switchboard Dialog Act (SwDA) corpus shows that the jointly-trained models outperform independently-trained models, single-step models, and other reported results in DA segmentation, recognition, and joint tasks."
}Markdown (Informal)
[A Unified Neural Architecture for Joint Dialog Act Segmentation and Recognition in Spoken Dialog System](https://preview.aclanthology.org/iwcs-25-ingestion/W18-5021/) (Zhao & Kawahara, SIGDIAL 2018)
ACL