Santham: A Curated Sanskrit–Tamil Dataset with Anvaya and Segmentation for Building and Evaluating Machine Translation

Prasanna Venkatesh T S, Ketaki Mangesh Shetye, Vishnuraj Arjunasamy, Ayush Kumar Sahu, Sriram Krishnan, Parameswari Krishnamurthy


Anthology ID:
2026.iscls-1.5
Volume:
Proceedings of the 8th International Sanskrit Computational Linguistics Symposium
Month:
March
Year:
2026
Address:
IIT Roorkee, Roorkee, India
Editors:
Pavankumar Satuluri, Pawan Goyal
Venue:
ISCLS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
65–80
Language:
URL:
https://preview.aclanthology.org/ingest-iscls/2026.iscls-1.5/
DOI:
Bibkey:
Cite (ACL):
Prasanna Venkatesh T S, Ketaki Mangesh Shetye, Vishnuraj Arjunasamy, Ayush Kumar Sahu, Sriram Krishnan, and Parameswari Krishnamurthy. 2026. Santham: A Curated Sanskrit–Tamil Dataset with Anvaya and Segmentation for Building and Evaluating Machine Translation. In Proceedings of the 8th International Sanskrit Computational Linguistics Symposium, pages 65–80, IIT Roorkee, Roorkee, India. Association for Computational Linguistics.
Cite (Informal):
Santham: A Curated Sanskrit–Tamil Dataset with Anvaya and Segmentation for Building and Evaluating Machine Translation (S et al., ISCLS 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-iscls/2026.iscls-1.5.pdf