Atril: an XML Visualization System for Corpus Texts

Andressa Rodrigues Gomide, Conceição Carapinha, Cornelia Plag


Abstract
This paper presents Atril, an XML visualization system for corpus texts, developed for, but not restricted to, the project Corpus de Audiências (CorAuDis), a corpus composed of transcripts of sessions of criminal proceedings recorded at the Coimbra Court. The main aim of the tool is to provide researchers with a web-based environment that allows for an easily customizable visualization of corpus texts with heavy structural annotation. Existing corpus analysis tools such as SketchEngine, TEITOK and CQPweb offer some kind of visualization mechanisms, but, to our knowledge, none meets our project’s main needs. Our requirements are a system that is open-source; that can be easily connected to CQPweb and TEITOK, that provides a full text-view with switchable visualization templates, that allows for the visualization of overlapping utterances. To meet those requirements, we created Atril, a module with a corpus XML file viewer, a visualization management system, and a word alignment tool.
Anthology ID:
2022.lrec-1.611
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
5692–5695
Language:
URL:
https://aclanthology.org/2022.lrec-1.611
DOI:
Bibkey:
Cite (ACL):
Andressa Rodrigues Gomide, Conceição Carapinha, and Cornelia Plag. 2022. Atril: an XML Visualization System for Corpus Texts. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 5692–5695, Marseille, France. European Language Resources Association.
Cite (Informal):
Atril: an XML Visualization System for Corpus Texts (Rodrigues Gomide et al., LREC 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2022.lrec-1.611.pdf