Are LLMs Court-Ready? Evaluating Frontier Models on Indian Legal Reasoning
Kush Juvekar, Arghya Bhattacharya, Sai Khadloya, Utkarsh Saxena
Abstract
Large language models (LLMs) are moving into legal workflows, yet we lack a jurisdiction-grounded way to gauge their basic competence in thereof. We use India’s public legal examinations as a transparent proxy. Our multi-year benchmark assembles objective screens from top national and state exams and evaluates open and frontier LLMs under real world exam conditions. To probe beyond MCQs, we also include a lawyer-graded, paired-blinded study of long-form answers from the Supreme Court’s Advocate-on-Record exam. This is, to our knowledge, the first exam-grounded, India-specific yardstick for LLM court-readiness released with datasets and protocols. Our work shows that while frontier systems consistently clear historical cutoffs and often match or exceed recent top-scorer bands on objective exams, none surpasses the human topper on long-form reasoning. Grader notes converge on three reliability failure modes—procedural/format compliance, authority/citation discipline, and forum-appropriate voice/structure. These findings delineate where LLMs can assist (checks, cross-statute consistency, statute and precedent lookups) and where human leadership remains essential: forum-specific drafting and filing, procedural and relief strategy, reconciling authorities and exceptions, and ethical, accountable judgment.- Anthology ID:
- 2025.nllp-1.26
- Volume:
- Proceedings of the Natural Legal Language Processing Workshop 2025
- Month:
- November
- Year:
- 2025
- Address:
- Suzhou, China
- Editors:
- Nikolaos Aletras, Ilias Chalkidis, Leslie Barrett, Cătălina Goanță, Daniel Preoțiuc-Pietro, Gerasimos Spanakis
- Venues:
- NLLP | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 359–369
- Language:
- URL:
- https://preview.aclanthology.org/ingest-emnlp/2025.nllp-1.26/
- DOI:
- Cite (ACL):
- Kush Juvekar, Arghya Bhattacharya, Sai Khadloya, and Utkarsh Saxena. 2025. Are LLMs Court-Ready? Evaluating Frontier Models on Indian Legal Reasoning. In Proceedings of the Natural Legal Language Processing Workshop 2025, pages 359–369, Suzhou, China. Association for Computational Linguistics.
- Cite (Informal):
- Are LLMs Court-Ready? Evaluating Frontier Models on Indian Legal Reasoning (Juvekar et al., NLLP 2025)
- PDF:
- https://preview.aclanthology.org/ingest-emnlp/2025.nllp-1.26.pdf