Abstract
We present a topic boundary detection method that searches for connections between sequences of utterances in multi party dialogues. The connections are established based on word identity. We compare our method to a state-of-the art automatic Topic boundary detection method that was also used on multi party dialogues. We checked various methods of preprocessing of the data, including stemming, lemmatization and stopword filtering with a text-based as well as speech-based stopword lists. Using standard evaluation methods we found that our method outperformed the state-of-the art method.- Anthology ID:
- L08-1310
- Volume:
- Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)
- Month:
- May
- Year:
- 2008
- Address:
- Marrakech, Morocco
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2008/pdf/660_paper.pdf
- DOI:
- Cite (ACL):
- Margot Mieskes and Michael Strube. 2008. Parameters for Topic Boundary Detection in Multi-Party Dialogues. In Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08), Marrakech, Morocco. European Language Resources Association (ELRA).
- Cite (Informal):
- Parameters for Topic Boundary Detection in Multi-Party Dialogues (Mieskes & Strube, LREC 2008)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2008/pdf/660_paper.pdf