Mapping Hymns and Organizing Concepts in the Rigveda: Quantitatively Connecting the Vedic Suktas

Venkatesh Bollineni, Igor Crk, Eren Gultepe


Abstract
Accessing and gaining insight into the Rigveda poses a non-trivial challenge due to its extremely ancient Sanskrit language, poetic structure, and large volume of text. By using NLP techniques, this study identified topics and semantic connections of hymns within the Rigveda that were corroborated by seven well-known groupings of hymns. The 1,028 suktas (hymns) from the modern English translation of the Rigveda by Jamison and Brereton were preprocessed and sukta-level embeddings were obtained using, i) a novel adaptation of LSA, presented herein, ii) SBERT, and iii) Doc2Vec embeddings. Following an UMAP dimension reduction of the vectors, the network of suktas was formed using k-nearest neighbours. Then, community detection of topics in the sukta networks was performed with the Louvain, Leiden, and label propagation methods, whose statistical significance of the formed topics were determined using an appropriate null distribution. Only the novel adaptation of LSA using the Leiden method, had detected sukta topic networks that were significant (z = 2.726, p < .01) with a modularity score of 0.944. Of the seven famous sukta groupings analyzed (e.g., creation, funeral, water, etc.) the LSA derived network was successful in all seven cases, while Doc2Vec was not significant and failed to detect the relevant suktas. SBERT detected four of the famous suktas as separate groups, but mistakenly combined three of them into a single mixed group. Also, the SBERT network was not statistically significant.
Anthology ID:
2025.nlp4dh-1.44
Volume:
Proceedings of the 5th International Conference on Natural Language Processing for Digital Humanities
Month:
May
Year:
2025
Address:
Albuquerque, USA
Editors:
Mika Hämäläinen, Emily Öhman, Yuri Bizzoni, So Miyagawa, Khalid Alnajjar
Venues:
NLP4DH | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
514–523
Language:
URL:
https://preview.aclanthology.org/landing_page/2025.nlp4dh-1.44/
DOI:
Bibkey:
Cite (ACL):
Venkatesh Bollineni, Igor Crk, and Eren Gultepe. 2025. Mapping Hymns and Organizing Concepts in the Rigveda: Quantitatively Connecting the Vedic Suktas. In Proceedings of the 5th International Conference on Natural Language Processing for Digital Humanities, pages 514–523, Albuquerque, USA. Association for Computational Linguistics.
Cite (Informal):
Mapping Hymns and Organizing Concepts in the Rigveda: Quantitatively Connecting the Vedic Suktas (Bollineni et al., NLP4DH 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2025.nlp4dh-1.44.pdf