Thomas Aquinas in the TüNDRA: Integrating the Index Thomisticus Treebank into CLARIN-D

Scott Martens, Marco Passarotti


Abstract
This paper describes the integration of the Index Thomisticus Treebank (IT-TB) into the web-based treebank search and visualization application TueNDRA (Tuebingen aNnotated Data Retrieval & Analysis). TueNDRA was originally designed to provide access via the Internet to constituency treebanks and to tools for searching and visualizing them, as well as tabulating statistics about their contents. TueNDRA has now been extended to also provide full support for dependency treebanks with non-projective dependencies, in order to integrate the IT-TB and future treebanks with similar properties. These treebanks are queried using an adapted form of the TIGERSearch query language, which can search both hierarchical and sequential information in treebanks in a single query. As a web application, making the IT-TB accessible via TueNDRA makes the treebank and the tools to use of it available to a large community without having to distribute software and show users how to install it.
Anthology ID:
L14-1550
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
767–774
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/70_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Scott Martens and Marco Passarotti. 2014. Thomas Aquinas in the TüNDRA: Integrating the Index Thomisticus Treebank into CLARIN-D. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 767–774, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Thomas Aquinas in the TüNDRA: Integrating the Index Thomisticus Treebank into CLARIN-D (Martens & Passarotti, LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/70_Paper.pdf