Abstract
This paper contributes to the limited body of empirical research in the domain of discourse structure of information seeking queries. We describe the development of an annotation schema for coding topic development in information seeking queries and the initial observations from a pilot sample of query sessions. The main idea that we explore is the relationship between constant and variable discourse entities and their role in tracking changes in the topic progression. We argue that the topicalized entities remain stable across development of the discourse and can be identified by a simple mechanism where anaphora resolution is a precursor. We also claim that a corpus annotated in this framework can be used as training data for dialogue management and computational semantics systems.- Anthology ID:
- L16-1277
- Volume:
- Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
- Month:
- May
- Year:
- 2016
- Address:
- Portorož, Slovenia
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 1755–1761
- Language:
- URL:
- https://aclanthology.org/L16-1277
- DOI:
- Cite (ACL):
- Marta Andersson, Adnan Öztürel, and Silvia Pareti. 2016. Annotating Topic Development in Information Seeking Queries. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 1755–1761, Portorož, Slovenia. European Language Resources Association (ELRA).
- Cite (Informal):
- Annotating Topic Development in Information Seeking Queries (Andersson et al., LREC 2016)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/L16-1277.pdf