Sashank Tatavolu
2025
Indian Grammatical Tradition-Inspired Universal Semantic Representation Bank (USR Bank 1.0)
Soma Paul
|
Sukhada Sukhada
|
Bidisha Bhattacharjee
|
Kumari Riya
|
Sashank Tatavolu
|
Kamesh R
|
Isma Anwar
|
Pratibha Rani
Proceedings of the 1st Workshop on Benchmarks, Harmonization, Annotation, and Standardization for Human-Centric AI in Indian Languages (BHASHA 2025)
In this paper, we introduce USR Bank 1.0, a multi-layered, text-level semantic representation framework designed to capture not only the predicate-argument structure of an utterance but also the speaker’s communicative intent as expressed linguistically. Built on the Universal Semantic Grammar (USG), which is grounded in Pāṇinian grammar and the Indian Grammatical Tradition (IGT), USR systematically encodes semantic, morpho-syntactic, discourse, and pragmatic information across distinct layers. In the USR generation process, initial USRs are automatically generated using a dedicated USR-builder tool and subsequently validated via a web-based interface (SAVI), ensuring high inter-annotator agreement and semantic fidelity. Our evaluation on Hindi texts demonstrates robust dependency and discourse annotation consistency and strong semantic similarity in USR-to-text generation. By distributing semantic-pragmatic information across layers and capturing the speaker’s perspective, USR provides a cognitively motivated, language-agnostic framework with promising applications in multilingual natural language processing.
Search
Fix author
Co-authors
- Isma Anwar 1
- Bidisha Bhattacharjee 1
- Soma Paul 1
- Kamesh R 1
- Pratibha Rani 1
- show all...