Bing Zhao
2026
PLAWBENCH: A Rubric-Based Benchmark for Evaluating LLMs in Real-World Legal Practice
Yuzhen Shi | Huanghai Liu | Yiran HU | Song Gaojie | Xu Xinran | Yubo Ma | Tianyi Tang | Li Zhang | Qingjing Chen | Feng Di | Wenbo Lv | Weiheng Wu | Kexin Yang | Sen Yang | Wei Wang | Rongyao Shi | Qiu Yuanyang | Yuemeng Qi | Zhang Jingwen | Sui Xiaoyu | Yifan Chen | Zhang Yi | An Yang | Bowen Yu | Dayiheng Liu | Junyang Lin | Weixing Shen | Bing Zhao | Charles L. A. Clarke | HU Wei
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Yuzhen Shi | Huanghai Liu | Yiran HU | Song Gaojie | Xu Xinran | Yubo Ma | Tianyi Tang | Li Zhang | Qingjing Chen | Feng Di | Wenbo Lv | Weiheng Wu | Kexin Yang | Sen Yang | Wei Wang | Rongyao Shi | Qiu Yuanyang | Yuemeng Qi | Zhang Jingwen | Sui Xiaoyu | Yifan Chen | Zhang Yi | An Yang | Bowen Yu | Dayiheng Liu | Junyang Lin | Weixing Shen | Bing Zhao | Charles L. A. Clarke | HU Wei
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
As large language models (LLMs) are increasingly applied to legal domain-specific tasks, evaluating their ability to perform legal work in real-world settings has become essential. However, existing legal benchmarks rely on simplified and highly standardized tasks, failing to capture the ambiguity, complexity, and reasoning demands of real legal practice. Moreover, prior evaluations often adopt coarse, single-dimensional metrics and do not explicitly assess fine-grained legal reasoning. To address these limitations, we introduce PLawBench, a Practical Law Benchmark designed to evaluate LLMs in realistic legal practice scenarios. Grounded in real-world legal workflows, PLawBench models the core processes of legal practitioners through three task categories: public legal consultation, practical case analysis, and legal document generation. These tasks assess a model’s ability to identify legal issues and key facts, perform structured legal reasoning, and generate legally coherent documents. PLawBench comprises 850 questions across 13 practical legal scenarios, with each question accompanied by expert-designed evaluation rubrics, resulting in approximately 12,500 rubric items for fine-grained assessment. Using an LLM-based evaluator aligned with human expert judgments, we evaluate 10 state-of-the-art LLMs. Experimental results show that none achieves strong performance on PLawBench, revealing substantial limitations in the fine-grained legal reasoning capabilities of current LLMs and highlighting important directions for future evaluation and development of legal LLMs. Data is available at: https://anonymous.4open.science/r/PLawbench-B524/.
2011
A Statistical Tree Annotator and Its Applications
Xiaoqiang Luo | Bing Zhao
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies
Xiaoqiang Luo | Bing Zhao
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies
Learning to Transform and Select Elementary Trees for Improved Syntax-based Machine Translations
Bing Zhao | Young-Suk Lee | Xiaoqiang Luo | Liu Li
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies
Bing Zhao | Young-Suk Lee | Xiaoqiang Luo | Liu Li
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies
2010
Constituent Reordering and Syntax Models for English-to-Japanese Statistical Machine Translation
Young-Suk Lee | Bing Zhao | Xiaoqian Luo
Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)
Young-Suk Lee | Bing Zhao | Xiaoqian Luo
Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)
2009
A Simplex Armijo Downhill Algorithm for Optimizing Statistical Machine Translation Decoding Parameters
Bing Zhao | Shengyuan Chen
Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Bing Zhao | Shengyuan Chen
Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
2008
Generalizing Local and Non-Local Word-Reordering Patterns for Syntax-Based Machine Translation
Bing Zhao | Yaser Al-onaizan
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing
Bing Zhao | Yaser Al-onaizan
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing
2007
A Log-Linear Block Transliteration Model based on Bi-Stream HMMs
Bing Zhao | Nguyen Bach | Ian Lane | Stephan Vogel
Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference
Bing Zhao | Nguyen Bach | Ian Lane | Stephan Vogel
Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference
2006
BiTAM: Bilingual Topic AdMixture Models for Word Alignment
Bing Zhao | Eric P. Xing
Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions
Bing Zhao | Eric P. Xing
Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions
The UKA/CMU statistical machine translation system for IWSLT 2006
Matthias Eck | Ian Lane | Nguyen Bach | Sanjika Hewavitharana | Muntsin Kolss | Bing Zhao | Almut Silja Hildebrand | Stephan Vogel | Alex Waibel
Proceedings of the Third International Workshop on Spoken Language Translation: Evaluation Campaign
Matthias Eck | Ian Lane | Nguyen Bach | Sanjika Hewavitharana | Muntsin Kolss | Bing Zhao | Almut Silja Hildebrand | Stephan Vogel | Alex Waibel
Proceedings of the Third International Workshop on Spoken Language Translation: Evaluation Campaign
2005
A Generalized Alignment-Free Phrase Extraction
Bing Zhao | Stephan Vogel
Proceedings of the ACL Workshop on Building and Using Parallel Texts
Bing Zhao | Stephan Vogel
Proceedings of the ACL Workshop on Building and Using Parallel Texts
Bilingual Word Spectral Clustering for Statistical Machine Translation
Bing Zhao | Eric P. Xing | Alex Waibel
Proceedings of the ACL Workshop on Building and Using Parallel Texts
Bing Zhao | Eric P. Xing | Alex Waibel
Proceedings of the ACL Workshop on Building and Using Parallel Texts
Learning a Log-Linear Model with Bilingual Phrase-Pair Features for Statistical Machine Translation
Bing Zhao | Alex Waibel
Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing
Bing Zhao | Alex Waibel
Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing
Inner-Outer Bracket Models for Word Alignment using Hidden Blocks
Bing Zhao | Niyu Ge | Kishore Papineni
Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing
Bing Zhao | Niyu Ge | Kishore Papineni
Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing
The CMU Statistical Machine Translation System for IWSLT2005
Sanjika Hewavitharana | Bing Zhao | Hildebrand | Almut Silja | Matthias Eck | Chiori Hori | Stephan Vogel | Alex Waibel
Proceedings of the Second International Workshop on Spoken Language Translation
Sanjika Hewavitharana | Bing Zhao | Hildebrand | Almut Silja | Matthias Eck | Chiori Hori | Stephan Vogel | Alex Waibel
Proceedings of the Second International Workshop on Spoken Language Translation
2004
Phrase Pair Rescoring with Term Weighting for Statistical Machine Translation
Bing Zhao | Stephan Vogel | Matthias Eck | Alex Waibel
Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing
Bing Zhao | Stephan Vogel | Matthias Eck | Alex Waibel
Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing
Language Model Adaptation for Statistical Machine Translation via Structured Query Models
Bing Zhao | Matthias Eck | Stephan Vogel
COLING 2004: Proceedings of the 20th International Conference on Computational Linguistics
Bing Zhao | Matthias Eck | Stephan Vogel
COLING 2004: Proceedings of the 20th International Conference on Computational Linguistics
2003
Efficient Optimization for Bilingual Sentence Alignment Based on Linear Regression
Bing Zhao | Klaus Zechner | Stephen Vogel | Alex Waibel
Proceedings of the HLT-NAACL 2003 Workshop on Building and Using Parallel Texts: Data Driven Machine Translation and Beyond
Bing Zhao | Klaus Zechner | Stephen Vogel | Alex Waibel
Proceedings of the HLT-NAACL 2003 Workshop on Building and Using Parallel Texts: Data Driven Machine Translation and Beyond
Word Alignment Based on Bilingual Bracketing
Bing Zhao | Stephan Vogel
Proceedings of the HLT-NAACL 2003 Workshop on Building and Using Parallel Texts: Data Driven Machine Translation and Beyond
Bing Zhao | Stephan Vogel
Proceedings of the HLT-NAACL 2003 Workshop on Building and Using Parallel Texts: Data Driven Machine Translation and Beyond
The CMU statistical machine translation system
Stephan Vogel | Ying Zhang | Fei Huang | Alicia Tribble | Ashish Venugopal | Bing Zhao | Alex Waibel
Proceedings of Machine Translation Summit IX: Papers
Stephan Vogel | Ying Zhang | Fei Huang | Alicia Tribble | Ashish Venugopal | Bing Zhao | Alex Waibel
Proceedings of Machine Translation Summit IX: Papers
In this paper we describe the components of our statistical machine translation system. This system combines phrase-to-phrase translations extracted from a bilingual corpus using different alignment approaches. Special methods to extract and align named entities are used. We show how a manual lexicon can be incorporated into the statistical system in an optimized way. Experiments on Chinese-to-English and Arabic-to-English translation tasks are presented.
Search
Fix author
Co-authors
- Stephan Vogel 9
- Alex Waibel 7
- Matthias Eck 4
- Nguyen Bach 2
- Sanjika Hewavitharana 2
- Ian Lane 2
- Young-Suk Lee 2
- Xiaoqiang Luo 2
- Eric Xing 2
- Yaser Al-Onaizan 1
- Qingjing Chen 1
- Shengyuan Chen 1
- Yifan Chen 1
- Charles L. A. Clarke 1
- Feng Di 1
- Song Gaojie 1
- Niyu Ge 1
- Yiran HU 1
- Hildebrand 1
- Almut Silja Hildebrand 1
- Chiori Hori 1
- Fei Huang 1
- Zhang Jingwen 1
- Muntsin Kolss 1
- Liu Li 1
- Junyang Lin 1
- Dayiheng Liu 1
- Huanghai Liu 1
- Xiaoqian Luo 1
- Wenbo Lv 1
- Yubo Ma 1
- Kishore Papineni 1
- Yuemeng Qi 1
- Weixing Shen 1
- Rongyao Shi 1
- Yuzhen Shi 1
- Almut Silja 1
- Tianyi Tang 1
- Alicia Tribble 1
- Ashish Venugopal 1
- Wei Wang 1
- HU Wei 1
- Weiheng Wu 1
- Sui Xiaoyu 1
- Xu Xinran 1
- An Yang 1
- Kexin Yang 1
- Sen Yang 1
- Zhang Yi 1
- Bowen Yu 1
- Qiu Yuanyang 1
- Klaus Zechner 1
- Li Zhang 1
- Ying Zhang 1