Abstract
pyMMAX2 is an API for processing MMAX2 stand-off annotation data in Python. It provides a lightweight basis for the development of code which opens up the Java- and XML-based ecosystem of MMAX2 for more recent, Python-based NLP and data science methods. While pyMMAX2 is pure Python, and most functionality is implemented from scratch, the API re-uses the complex implementation of the essential business logic for MMAX2 annotation schemes by interfacing with the original MMAX2 Java libraries. pyMMAX2 is available for download at http://github.com/nlpAThits/pyMMAX2.- Anthology ID:
- 2020.law-1.16
- Volume:
- Proceedings of the 14th Linguistic Annotation Workshop
- Month:
- December
- Year:
- 2020
- Address:
- Barcelona, Spain
- Venue:
- LAW
- SIG:
- SIGANN
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 167–173
- Language:
- URL:
- https://aclanthology.org/2020.law-1.16
- DOI:
- Cite (ACL):
- Mark-Christoph Müller. 2020. pyMMAX2: Deep Access to MMAX2 Projects from Python. In Proceedings of the 14th Linguistic Annotation Workshop, pages 167–173, Barcelona, Spain. Association for Computational Linguistics.
- Cite (Informal):
- pyMMAX2: Deep Access to MMAX2 Projects from Python (Müller, LAW 2020)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/2020.law-1.16.pdf
- Code
- nlpathits/pymmax2