Sathurgini Uthayakumar
2026
Thiruppugazh-KG Dataset: A Manually Annotated Resource for Computational Analysis of Tamil Devotional Literature
Garthigan Kumarasamy | Jubeerathan Thevakumar | Sathurgini Uthayakumar | Disne Kajanath | Narthana Sivalingam | Uthayasanker Thayasivam
Proceedings of the Sixth Workshop on Speech, Vision, and Language Technologies for Dravidian Languages
Garthigan Kumarasamy | Jubeerathan Thevakumar | Sathurgini Uthayakumar | Disne Kajanath | Narthana Sivalingam | Uthayasanker Thayasivam
Proceedings of the Sixth Workshop on Speech, Vision, and Language Technologies for Dravidian Languages
This paper introduces Thiruppugazh-KG, a semantically annotated dataset and knowledge graph derived from the Thiruppugazh corpus, a 14th-century collection of 1,335 Tamil devotional hymns composed by Arunagirinathar. The dataset includes annotations for entities, devotional themes, mythological events, philosophical concepts, imagery, and sacred locations mentioned in each hymn. Using these annotations, we construct a Neo4j-based knowledge graph that models relationships between hymns and their associated cultural and narrative elements. Graph analytics, including PageRank, are applied to identify prominent entities and sacred locations within the corpus. The resulting resource provides a structured representation of Tamil devotional literature and supports computational analysis of cultural texts in low-resource languages.