Abstract
From tweets to product reviews, text is ubiquitous on the web and often contains valuable information for both enterprises and consumers. However, the online text is generally noisy and incomplete, requiring users to process and analyze the data to extract insights. While there are systems effective for different stages of text analysis, users lack extensible platforms to support interactive text analysis workflows end-to-end. To facilitate integrated text analytics, we introduce LEAM, which aims at combining the strengths of spreadsheets, computational notebooks, and interactive visualizations. LEAM supports interactive analysis via GUI-based interactions and provides a declarative specification language, implemented based on a visual text algebra, to enable user-guided analysis. We evaluate LEAM through two case studies using two popular Kaggle text analytics workflows to understand the strengths and weaknesses of the system.- Anthology ID:
- 2021.dash-1.9
- Volume:
- Proceedings of the Second Workshop on Data Science with Human in the Loop: Language Advances
- Month:
- June
- Year:
- 2021
- Address:
- Online
- Editors:
- Eduard Dragut, Yunyao Li, Lucian Popa, Slobodan Vucetic
- Venue:
- DaSH
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 52–58
- Language:
- URL:
- https://aclanthology.org/2021.dash-1.9
- DOI:
- 10.18653/v1/2021.dash-1.9
- Cite (ACL):
- Peter Griggs, Cagatay Demiralp, and Sajjadur Rahman. 2021. Towards integrated, interactive, and extensible text data analytics with Leam. In Proceedings of the Second Workshop on Data Science with Human in the Loop: Language Advances, pages 52–58, Online. Association for Computational Linguistics.
- Cite (Informal):
- Towards integrated, interactive, and extensible text data analytics with Leam (Griggs et al., DaSH 2021)
- PDF:
- https://preview.aclanthology.org/naacl-24-ws-corrections/2021.dash-1.9.pdf