XiaoYan Li
Also published as: Xiaoyan Li
2026
RPC-Bench: A Fine-grained Benchmark for Research Paper Comprehension
Yelin Chen | Fanjin Zhang | Suping Sun | Yunhe Pang | Yuanchun Wang | Jian Song | XiaoYan Li | Lei Hou | Shu Zhao | Jie Tang | Juanzi Li
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Yelin Chen | Fanjin Zhang | Suping Sun | Yunhe Pang | Yuanchun Wang | Jian Song | XiaoYan Li | Lei Hou | Shu Zhao | Jie Tang | Juanzi Li
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Understanding research papers remains challenging for foundation models due to specialized scientific discourse and complex figures and tables, yet existing benchmarks offer limited fine-grained evaluation at scale. To address this gap, we introduce RPC-Bench, a large-scale question-answering benchmark built from review–rebuttal exchanges of high-quality computer science papers, containing 15K human-verified QA pairs. We design a fine-grained taxonomy aligned with the scientific research flow to assess models’ ability to understand and answer why, what, and how questions in scholarly contexts. We also define an elaborate LLM–human interaction annotation framework to support large-scale labeling and quality control. Following the LLM-as-a-Judge paradigm, we develop a scalable framework that evaluates models on correctness-completeness and conciseness, with high agreement to human judgment. Experiments reveal that even the strongest models (GPT-5) achieve only 68.2% correctness-completeness, dropping to 37.46% after conciseness adjustment, highlighting substantial gaps in precise academic paper understanding.
2025
Enhancing the Planning Capabilities of Large Language Models by Building External World Models
Edwin Chen | Xiaoyan Li | Colin Bellinger | Yunli Wang
Proceedings of the 2nd Workshop on Agent AI for Scenario Planning
Edwin Chen | Xiaoyan Li | Colin Bellinger | Yunli Wang
Proceedings of the 2nd Workshop on Agent AI for Scenario Planning
2021
Text Style Transfer: Leveraging a Style Classifier on Entangled Latent Representations
Xiaoyan Li | Sun Sun | Yunli Wang
Proceedings of the 6th Workshop on Representation Learning for NLP (RepL4NLP-2021)
Xiaoyan Li | Sun Sun | Yunli Wang
Proceedings of the 6th Workshop on Representation Learning for NLP (RepL4NLP-2021)
Learning a good latent representation is essential for text style transfer, which generates a new sentence by changing the attributes of a given sentence while preserving its content. Most previous works adopt disentangled latent representation learning to realize style transfer. We propose a novel text style transfer algorithm with entangled latent representation, and introduce a style classifier that can regulate the latent structure and transfer style. Moreover, our algorithm for style transfer applies to both single-attribute and multi-attribute transfer. Extensive experimental results show that our method generally outperforms state-of-the-art approaches.
2010
Exploiting Multi-Features to Detect Hedges and their Scope in Biomedical Texts
Huiwei Zhou | Xiaoyan Li | Degen Huang | Zezhong Li | Yuansheng Yang
Proceedings of the Fourteenth Conference on Computational Natural Language Learning – Shared Task
Huiwei Zhou | Xiaoyan Li | Degen Huang | Zezhong Li | Yuansheng Yang
Proceedings of the Fourteenth Conference on Computational Natural Language Learning – Shared Task