Evaluating Customized vs. Generalist Transformer-based Models for Legal Contract Classification
Amrita Singh, H. Suhan Karaca, Aditya Joshi, Hye-young Paik, Jiaojiao Jiang
Abstract
Despite advances in legal NLP, no comprehensive evaluation of Transformer-based models customized for legal tasks (referred to as ’legal-specific’ models in this paper) exists for contract classification tasks. To address this gap, we present an evaluation of 13 legal-specific transformer-based models on 3 English-language contract classification tasks and compare them with 9 generalist models. The results show that legal-specific models consistently outperform generalist models, especially on tasks requiring nuanced legal understanding. They also help reduce misclassification of rare classes in imbalanced datasets. Legal-BERT and Contracts-BERT establish new SOTAs on two of the three tasks, despite having 69% fewer parameters than the best-performing generalist models. We also identify CaseLaw-BERT and LexLM as strong additional baselines for contract classification. Our results highlight the shortcomings of generalist models, emphasizing the need for domain-specific customization, particularly in the context of legal applications.- Anthology ID:
- 2026.customnlp4u-1.5
- Volume:
- Proceedings of the Second Workshop on Customizable NLP: Progress and Challenges in Customizing NLP for a Domain, Application, Group, or Individual (CustomNLP4U)
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, USA
- Editors:
- Sheshera Mysore, Sachin Kumar, Vidhisha Balachandran, Shirley Anugrah Hayati, Faeze Brahman, Hanane Nour Moussa, Alireza Salemi
- Venues:
- CustomNLP4U | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 44–54
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl-workshops/2026.customnlp4u-1.5/
- DOI:
- Cite (ACL):
- Amrita Singh, H. Suhan Karaca, Aditya Joshi, Hye-young Paik, and Jiaojiao Jiang. 2026. Evaluating Customized vs. Generalist Transformer-based Models for Legal Contract Classification. In Proceedings of the Second Workshop on Customizable NLP: Progress and Challenges in Customizing NLP for a Domain, Application, Group, or Individual (CustomNLP4U), pages 44–54, San Diego, California, USA. Association for Computational Linguistics.
- Cite (Informal):
- Evaluating Customized vs. Generalist Transformer-based Models for Legal Contract Classification (Singh et al., CustomNLP4U 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl-workshops/2026.customnlp4u-1.5.pdf