Annotating Topic Development in Information Seeking Queries

Marta Andersson, Adnan Öztürel, Silvia Pareti


Abstract
This paper contributes to the limited body of empirical research in the domain of discourse structure of information seeking queries. We describe the development of an annotation schema for coding topic development in information seeking queries and the initial observations from a pilot sample of query sessions. The main idea that we explore is the relationship between constant and variable discourse entities and their role in tracking changes in the topic progression. We argue that the topicalized entities remain stable across development of the discourse and can be identified by a simple mechanism where anaphora resolution is a precursor. We also claim that a corpus annotated in this framework can be used as training data for dialogue management and computational semantics systems.
Anthology ID:
L16-1277
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1755–1761
Language:
URL:
https://aclanthology.org/L16-1277
DOI:
Bibkey:
Cite (ACL):
Marta Andersson, Adnan Öztürel, and Silvia Pareti. 2016. Annotating Topic Development in Information Seeking Queries. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 1755–1761, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Annotating Topic Development in Information Seeking Queries (Andersson et al., LREC 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/L16-1277.pdf