EduCoder: An Open-Source Annotation System for Education Transcript Data

Saad Ashraf, Jim Malamut, Vishal Kumar, Guanzhong Pan, HyunJi Nam, Mei Tan, Lucía Langlois, Liliana Carolina Santos-Deonizio, Helen Spencer Higgins, Dorottya Demszky


Abstract
We present EduCoder, an open-source web platform designed for annotating classroom conversation transcripts. Existing annotation tools do not support the team-based workflows or access to instructional context that education discourse research requires. EduCoder addresses these gaps by combining transcript text, synchronized video, and instructional materials within a single workspace. The platform supports scoping annotation to specific portions of a lesson, coordinating work across annotation teams, and optionally integrating LLM-generated annotations with structured human–LLM comparison. EduCoder is freely accessible at https://edu-coder.com.
Anthology ID:
2026.acl-demo.59
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Greg Durrett, Ping Jian
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
597–604
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-demo.59/
DOI:
Bibkey:
Cite (ACL):
Saad Ashraf, Jim Malamut, Vishal Kumar, Guanzhong Pan, HyunJi Nam, Mei Tan, Lucía Langlois, Liliana Carolina Santos-Deonizio, Helen Spencer Higgins, and Dorottya Demszky. 2026. EduCoder: An Open-Source Annotation System for Education Transcript Data. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), pages 597–604, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
EduCoder: An Open-Source Annotation System for Education Transcript Data (Ashraf et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-demo.59.pdf