Santham: A Curated Sanskrit–Tamil Dataset with Anvaya and Segmentation for Building and Evaluating Machine Translation
Prasanna Venkatesh T S, Ketaki Mangesh Shetye, Vishnuraj Arjunasamy, Ayush Kumar Sahu, Sriram Krishnan, Parameswari Krishnamurthy
- Anthology ID:
- 2026.iscls-1.5
- Volume:
- Proceedings of the 8th International Sanskrit Computational Linguistics Symposium
- Month:
- March
- Year:
- 2026
- Address:
- IIT Roorkee, Roorkee, India
- Editors:
- Pavankumar Satuluri, Pawan Goyal
- Venue:
- ISCLS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 65–80
- Language:
- URL:
- https://preview.aclanthology.org/ingest-iscls/2026.iscls-1.5/
- DOI:
- Cite (ACL):
- Prasanna Venkatesh T S, Ketaki Mangesh Shetye, Vishnuraj Arjunasamy, Ayush Kumar Sahu, Sriram Krishnan, and Parameswari Krishnamurthy. 2026. Santham: A Curated Sanskrit–Tamil Dataset with Anvaya and Segmentation for Building and Evaluating Machine Translation. In Proceedings of the 8th International Sanskrit Computational Linguistics Symposium, pages 65–80, IIT Roorkee, Roorkee, India. Association for Computational Linguistics.
- Cite (Informal):
- Santham: A Curated Sanskrit–Tamil Dataset with Anvaya and Segmentation for Building and Evaluating Machine Translation (S et al., ISCLS 2026)
- PDF:
- https://preview.aclanthology.org/ingest-iscls/2026.iscls-1.5.pdf