Xiaoxue Li
2026
Rethinking LLM Watermark Detection in Black-Box Settings: A Non-Intrusive Third-Party Framework
Zhuoshang Wang | Yubing Ren | Yanan Cao | Fang Fang | Xiaoxue Li | Li Guo
Findings of the Association for Computational Linguistics: ACL 2026
Zhuoshang Wang | Yubing Ren | Yanan Cao | Fang Fang | Xiaoxue Li | Li Guo
Findings of the Association for Computational Linguistics: ACL 2026
While watermarking serves as a critical mechanism for LLM provenance, existing secret-key schemes tightly couple detection with injection, requiring access to keys or provider-side scheme-specific detectors for verification. This dependency creates a fundamental barrier for real-world governance, as independent auditing becomes impossible without compromising model security or relying on the opaque claims of service providers. To resolve this dilemma, we introduce TTP-Detect, a pioneering black-box framework designed for non-intrusive, third-party watermark verification. By decoupling detection from injection, TTP-Detect reframes verification as a relative hypothesis testing problem. It employs a proxy model to amplify watermark-relevant signals and a suite of complementary relative measurements to assess the alignment of the query text with watermarked distributions. Extensive experiments across representative watermarking schemes, datasets and models demonstrate that TTP-Detect achieves superior detection performance and robustness against diverse attacks.
Reinforcement Learning on Pre-Training Data
Siheng Li | Kejiao Li | Zenan Xu | Guanhua Huang | Kun Li | Haoyuan Wu | Wujiajia | Zihao Zheng | Chenchen Zhang | Kun Shi | Xue Gong | Qi Yi | Ruibin Xiong | Tingqiang Xu | Yuhao Jiang | Jianfeng Yan | Yuyuan Zeng | Guanghui Xu | Jinbao Xue | Zhijiang xu | Zheng Fang | Shuai LI | Qibin Liu | Xiaoxue Li | Zhuoyu Li | Yangyu Tao | Fei Gao | Cheng Jiang | Bochao Wang | Kai Liu | Jianchen Zhu | Wai Lam | Bo Zhou | Di Wang
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Siheng Li | Kejiao Li | Zenan Xu | Guanhua Huang | Kun Li | Haoyuan Wu | Wujiajia | Zihao Zheng | Chenchen Zhang | Kun Shi | Xue Gong | Qi Yi | Ruibin Xiong | Tingqiang Xu | Yuhao Jiang | Jianfeng Yan | Yuyuan Zeng | Guanghui Xu | Jinbao Xue | Zhijiang xu | Zheng Fang | Shuai LI | Qibin Liu | Xiaoxue Li | Zhuoyu Li | Yangyu Tao | Fei Gao | Cheng Jiang | Bochao Wang | Kai Liu | Jianchen Zhu | Wai Lam | Bo Zhou | Di Wang
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Recent progress in large language models (LLMs) is largely driven by scaling training compute through either pre-training with next-token prediction (NTP) or post-training with reinforcement learning (RL). The former contributes to learning broad knowledge and skills from general data, while struggling with data inefficiency and catastrophic forgetting in continual learning settings. The latter incentivizes reasoning capabilities with strong generalization, but is constrained by limited data availability due to its reliance on human annotation. To alleviate these issues, we propose Reinforcement Learning on Pre-Training data (RLPT), which combines the advantages of learning from general data and RL. In particular, RLPT derives reward signals directly from general text data through a next-segment reasoning objective, rewarding the policy for correctly predicting next text segments conditioned on the prefix text. Experiments across multiple benchmarks and models demonstrate the effectiveness of . For example, RLPT yields substantial improvements in continual pre-training (+4.6%) and provides a strong foundation for post-training (+3.4%) on Qwen3-8B-Base.
2023
Using Deep Learning to Find the Next Unicorn: A Practical Synthesis on Optimization Target, Feature Selection, Data Split and Evaluation Strategy
Lele Cao | Vilhelm von Ehrenheim | Sebastian Krakowski | Xiaoxue Li | Alexandra Lutz
Proceedings of the Fifth Workshop on Financial Technology and Natural Language Processing and the Second Multimodal AI For Financial Forecasting
Lele Cao | Vilhelm von Ehrenheim | Sebastian Krakowski | Xiaoxue Li | Alexandra Lutz
Proceedings of the Fifth Workshop on Financial Technology and Natural Language Processing and the Second Multimodal AI For Financial Forecasting
Search
Fix author
Co-authors
- Lele Cao 1
- Yanan Cao 1
- Fang Fang 1
- Zheng Fang 1
- Fei Gao 1
- Xue Gong 1
- Li Guo 1
- Guanhua Huang 1
- Yuhao Jiang 1
- Cheng Jiang 1
- Sebastian Krakowski 1
- Shuai LI 1
- Wai Lam 1
- Siheng Li 1
- Kejiao Li 1
- Kun Li 1
- Zhuoyu Li 1
- Qibin Liu 1
- Kai Liu 1
- Alexandra Lutz 1
- Yubing Ren 1
- Kun Shi 1
- Yangyu Tao 1
- Vilhelm Von Ehrenheim 1
- Zhuoshang Wang 1
- Bochao Wang 1
- Di Wang 1
- Haoyuan Wu 1
- Wujiajia 1
- Ruibin Xiong 1
- Zenan Xu 1
- Tingqiang Xu 1
- Guanghui Xu 1
- Jinbao Xue 1
- Jianfeng Yan 1
- Qi Yi 1
- Yuyuan Zeng 1
- Chenchen Zhang 1
- Zihao Zheng 1
- Bo Zhou 1
- Jianchen Zhu 1
- Zhijiang xu 1