Synthetic Proofs with Tool-Integrated Reasoning: Contrastive Alignment for LLM Mathematics with Lean
Mark Obozov, Michael Diskin, Aleksandr Beznosikov, Alexander Gasnikov, Serguei Barannikov
Abstract
Modern mathematical reasoning benchmarks primarily focus on answer finding rather than proof verification, creating a gap in evaluating the proving capabilities of large language models (LLMs). We present a methodology for generating diverse mathematical proof tasks using formal tools. Our approach combines Lean-based synthetic problem generation with a Tool-Integrated Reasoning (TiR) framework for partial (sampling-based) proof validation, and it uses contrastive preference optimization to align the model’s proof outputs. Experiments on the Qwen-2.5 family of models demonstrate meaningful improvements in mathematical reasoning, particularly for smaller models. Our aligned models achieve up to a 57% higher success rate than baselines on the MiniF2F benchmark (across 0.5B, 1.5B, and 7B parameter models). These results highlight the potential of synthetic data and integrated validation for advancing LLM-based mathematical reasoning.- Anthology ID:
- 2025.mathnlp-main.15
- Volume:
- Proceedings of The 3rd Workshop on Mathematical Natural Language Processing (MathNLP 2025)
- Month:
- November
- Year:
- 2025
- Address:
- Suzhou, China
- Editors:
- Marco Valentino, Deborah Ferreira, Mokanarangan Thayaparan, Leonardo Ranaldi, Andre Freitas
- Venues:
- MathNLP | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 195–202
- Language:
- URL:
- https://preview.aclanthology.org/ingest-emnlp/2025.mathnlp-main.15/
- DOI:
- Cite (ACL):
- Mark Obozov, Michael Diskin, Aleksandr Beznosikov, Alexander Gasnikov, and Serguei Barannikov. 2025. Synthetic Proofs with Tool-Integrated Reasoning: Contrastive Alignment for LLM Mathematics with Lean. In Proceedings of The 3rd Workshop on Mathematical Natural Language Processing (MathNLP 2025), pages 195–202, Suzhou, China. Association for Computational Linguistics.
- Cite (Informal):
- Synthetic Proofs with Tool-Integrated Reasoning: Contrastive Alignment for LLM Mathematics with Lean (Obozov et al., MathNLP 2025)
- PDF:
- https://preview.aclanthology.org/ingest-emnlp/2025.mathnlp-main.15.pdf