Pub-LawBench: Public-Oriented Benchmarking for LegalAI
Qiaoyu Zheng, Zehan Ma, Yijing Zhang, Qiqi Wang, Huijia Li, Qian Liu
Abstract
Large language models (LLMs) are playing an increasingly pivotal role in LegalAI. However, existing benchmarks are primarily tailored for legal professionals, emphasizing deep reasoning and explainability. While public-facing legal applications demand outputs that are direct, actionable, and accessible, a need largely overlooked by current evaluation frameworks. To bridge this gap, we propose a public-oriented LegalAI benchmark grounded in legal functionalism and genre analysis. Specifically, we categorize public legal demands into two core tasks: Instant Question Answering and Legal Text Generation. We further introduce three public-oriented evaluation dimensions: legal normativity, content relevance, and format usability, which collectively assess the practical validity and user readiness of model outputs. To reflect real-world lay user usage, we evaluate 17 LLMs on Pub-LawBench using only simple prompts and Chain-of-Thought under a vanilla inference setting, excluding complex techniques like RAG or agent-based methods inaccessible to non-experts. Experiments reveal limitations of current LLMs in delivering effective public-oriented legal assistance, highlighting the need for more user-centric model development and benchmarking. Our code and datasets are available for review at https://anonymous.4open.science/r/P-LawBench-E565/.- Anthology ID:
- 2026.acl-long.1680
- Volume:
- Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- ACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 36281–36303
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.acl-long.1680/
- DOI:
- Cite (ACL):
- Qiaoyu Zheng, Zehan Ma, Yijing Zhang, Qiqi Wang, Huijia Li, and Qian Liu. 2026. Pub-LawBench: Public-Oriented Benchmarking for LegalAI. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 36281–36303, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- Pub-LawBench: Public-Oriented Benchmarking for LegalAI (Zheng et al., ACL 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.acl-long.1680.pdf