Collaborative Development and Evaluation of Text-processing Workflows in a UIMA-supported Web-based Workbench

Rafal Rak, Andrew Rowley, Sophia Ananiadou


Abstract
Challenges in creating comprehensive text-processing worklows include a lack of the interoperability of individual components coming from different providers and/or a requirement imposed on the end users to know programming techniques to compose such workflows. In this paper we demonstrate Argo, a web-based system that addresses these issues in several ways. It supports the widely adopted Unstructured Information Management Architecture (UIMA), which handles the problem of interoperability; it provides a web browser-based interface for developing workflows by drawing diagrams composed of a selection of available processing components; and it provides novel user-interactive analytics such as the annotation editor which constitutes a bridge between automatic processing and manual correction. These features extend the target audience of Argo to users with a limited or no technical background. Here, we focus specifically on the construction of advanced workflows, involving multiple branching and merging points, to facilitate various comparative evalutions. Together with the use of user-collaboration capabilities supported in Argo, we demonstrate several use cases including visual inspections, comparisions of multiple processing segments or complete solutions against a reference standard, inter-annotator agreement, and shared task mass evaluations. Ultimetely, Argo emerges as a one-stop workbench for defining, processing, editing and evaluating text processing tasks.
Anthology ID:
L12-1572
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2971–2976
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/960_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Rafal Rak, Andrew Rowley, and Sophia Ananiadou. 2012. Collaborative Development and Evaluation of Text-processing Workflows in a UIMA-supported Web-based Workbench. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 2971–2976, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Collaborative Development and Evaluation of Text-processing Workflows in a UIMA-supported Web-based Workbench (Rak et al., LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/960_Paper.pdf