Towards IP Intelligence: Benchmarking Large Language Models on Intellectual Property Knowledge and Practice

Qiyao Wang; Guhong Chen; Hongbo Wang; Huaren Liu; Minghui Zhu; Zhifei Qin; Li Linwei; Yilin Yue; Shiqiang Wang; Jiayan Li; Wu Yihang; Ziqiang Liu; Longze Chen; Run Luo; Liyang Fan; Jiaming Li; Lei Zhang; Kan Xu; Hamid Alinejad-Rokny; Chengming Li; Shiwen Ni; Yuan Lin; Min Yang

Towards IP Intelligence: Benchmarking Large Language Models on Intellectual Property Knowledge and Practice

Qiyao Wang, Guhong Chen, Hongbo Wang, Huaren Liu, Minghui Zhu, Zhifei Qin, Li Linwei, Yilin Yue, Shiqiang Wang, Jiayan Li, Wu Yihang, Ziqiang Liu, Longze Chen, Run Luo, Liyang Fan, Jiaming Li, Lei Zhang, Kan Xu, Hamid Alinejad-Rokny, Chengming Li, Shiwen Ni, Yuan Lin, Min Yang

Abstract

Intellectual Property (IP) is a highly specialized domain that integrates technical and legal knowledge, making it inherently complex and knowledge-intensive. Recent advancements in LLMs have demonstrated their potential to handle IP tasks, enabling more efficient analysis, understanding, and generation of IP-related content. However, existing datasets and benchmarks focus narrowly on patents or cover limited aspects of the IP field, lacking alignment with real-world scenarios. To bridge this gap, we introduce **IPBench**, the first comprehensive IP task taxonomy and a large-scale bilingual benchmark encompassing **8 IP mechanisms and 20 distinct tasks**, designed to evaluate LLMs in real-world IP practice. We benchmark **19 main LLMs**, ranging from general purpose to domain-specific, including chat-oriented and reasoning-focused models, under zero-shot, few-shot, and chain-of-thought settings. Our results show that even the top-performing model, DeepSeek-V3, achieves only 75.8% accuracy, indicating significant room for improvement. Notably, open-source IP and law-oriented models lag behind closed-source general-purpose models. To foster future research, we publicly release IPBench, and will expand it with additional tasks to better reflect real-world complexities and support model advancements in the IP domain. We provide the data, code in the supplementary materials.

Anthology ID:: 2026.findings-acl.1000
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 20007–20052
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1000/
DOI:
Bibkey:
Cite (ACL):: Qiyao Wang, Guhong Chen, Hongbo Wang, Huaren Liu, Minghui Zhu, Zhifei Qin, Li Linwei, Yilin Yue, Shiqiang Wang, Jiayan Li, Wu Yihang, Ziqiang Liu, Longze Chen, Run Luo, Liyang Fan, Jiaming Li, Lei Zhang, Kan Xu, Hamid Alinejad-Rokny, Chengming Li, Shiwen Ni, Yuan Lin, and Min Yang. 2026. Towards IP Intelligence: Benchmarking Large Language Models on Intellectual Property Knowledge and Practice. In Findings of the Association for Computational Linguistics: ACL 2026, pages 20007–20052, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Towards IP Intelligence: Benchmarking Large Language Models on Intellectual Property Knowledge and Practice (Wang et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1000.pdf
Checklist:: 2026.findings-acl.1000.checklist.pdf

PDF Cite Search Checklist Fix data