Pengfei Zhou
Also published as: 鹏飞 周
2025
MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification
xu Zhao Pan
|
Pengfei Zhou
|
Jiaxin Ai
|
Wangbo Zhao
|
Kai Wang
|
Xiaojiang Peng
|
Wenqi Shao
|
Hongxun Yao
|
Kaipeng Zhang
Findings of the Association for Computational Linguistics: ACL 2025
Reasoning is an essential capacity for large language models (LLMs) to address complex tasks, whereas the identification of process errors is vital for improving this ability. Recently, process-level reward models (PRMs) were proposed to provide step-wise rewards that facilitate reinforcement learning and data production during training and guide LLMs toward correct steps during inference, thereby improving reasoning accuracy. However, existing benchmarks of PRMs are text-based and focus on error detection, neglecting other scenarios like reasoning search. To address this gap, we introduce MPBench, a comprehensive, multi-task, multimodal benchmark designed to systematically assess the effectiveness of PRMs in diverse scenarios. MPBench employs three evaluation paradigms, each targeting a specific role of PRMs in the reasoning process: (1) Step Correctness, which assesses the correctness of each intermediate reasoning step; (2) Answers Aggregation, which aggregates multiple solutions and selects the best one; and (3) Reasoning Process Search, which guides the search for optimal reasoning steps during inference. Through these paradigms, MPBench makes comprehensive evaluations and provides insights into the development of multimodal PRMs.
2023
差比句结构及其缺省现象的识别补全研究(A Study on Identification and Completion of Comparative Sentence Structures with Ellipsis Phenomenon)
Pengfei Zhou (周鹏飞)
|
Weiguang Qv (曲维光)
|
Tingxin Wei (魏庭新)
|
Junsheng Zhou (周俊生)
|
Bin Li (李斌)
|
Yanhui Gu (顾彦慧)
Proceedings of the 22nd Chinese National Conference on Computational Linguistics
“差比句是用来表达两个或多个事物之间的相似或不同之处的句子结构,常用句式为“X比Y+比较结果”。差比句存在多种结构变体且大量存在省略现象,造成汉语语法研究和自然语言处理任务困难,因此实现差比句结构的识别和对其缺省结构进行补全非常有意义。本文采用序列化标注方法构建了一个差比句语料库,提出了一个能够融合字与词信息的LatticeBERT-BILSTM-CRF模型来对差比句结构自动识别,并且能对缺省单位进行自动补全,实验结果验证了方法的有效性。”
Search
Fix author
Co-authors
- Jiaxin Ai 1
- Yanhui Gu (顾彦慧) 1
- Bin Li (李斌) 1
- xu Zhao Pan 1
- Xiaojiang Peng 1
- show all...