Formal Machine Interpretation for the Semasiographic Mixtec Codices of Precolonial and Early Colonial Mesoamerica
Christopher Driggers-Ellis, Gabriel Ayoubi, Girish.Salunke811@Gmail.Com Girish.Salunke811@Gmail.Com, Christan Grant
Abstract
The precolonial and early colonial Mixtec codices describe the history and stories of the region in a semasiographic medium that is full of symbolic representations and meant to be narrated.Recently, the community has introduced datasets of XML representations of related media, including Aztec codices and Mayan hieroglyphic script, in a step towards symbolic machine interpretation of these historic Mesoamerican artifacts.In this work, we propose formal symbolic machine interpretation of XML encodings representing facsimile images from the Mixtec Codex Zouche-Nuttal.We demonstrate the efficacy of symbolic machine interpretation from XML step-by-step, showing how our parser and interpreter process text capturing a scene from the Mixtec Codex Zouche-Nuttall.We hope our contribution and the example we provide motivate collaboration among the archaeological, historical, linguistic, and natural language processing research communities to apply machine interpretation to Mixtec codices and similar manuscripts.- Anthology ID:
- 2026.alvr-main.20
- Volume:
- Proceedings of the 4th Workshop on Advances in Language and Vision Research (ALVR)
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, USA
- Editors:
- Qianqi Yan, Syrielle Montariol, Yue Fan, Jing Gu, Jiayi Pan, Manling Li, Parisa Kordjamshidi, Alane Suhr, Xin Eric Wang
- Venues:
- ALVR | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 230–238
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl-workshops/2026.alvr-main.20/
- DOI:
- Cite (ACL):
- Christopher Driggers-Ellis, Gabriel Ayoubi, Girish.Salunke811@Gmail.Com Girish.Salunke811@Gmail.Com, and Christan Grant. 2026. Formal Machine Interpretation for the Semasiographic Mixtec Codices of Precolonial and Early Colonial Mesoamerica. In Proceedings of the 4th Workshop on Advances in Language and Vision Research (ALVR), pages 230–238, San Diego, California, USA. Association for Computational Linguistics.
- Cite (Informal):
- Formal Machine Interpretation for the Semasiographic Mixtec Codices of Precolonial and Early Colonial Mesoamerica (Driggers-Ellis et al., ALVR 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl-workshops/2026.alvr-main.20.pdf