Open Problems Solved by LLMs? A Survey of Verifiable Mathematical Discovery

Ioannis Tzachristas, Georgios Tzachristas, Aifen Sui


Abstract
Recent years have produced a small but rapidly growing set of results where Large Language Models (LLMs) - usually embedded in a search-and-verification loop - advance the state of the art on problems previously regarded as "open" in the pragmatic sense of lacking a best-known construction, bound, or proof certificate. This paper surveys that emerging line of work with a Big Picture emphasis: what makes these successes possible, what should count as "solved", and what design patterns generalize? We (i) propose an evidence ladder for interpreting "LLM solved an open problem" claims, (ii) map mathematical subfields by difficulty dimensions that matter for LLM-based discovery, (iii) curate a timeline of key breakthroughs leading to verifiable discovery systems, and (iv) synthesize the techniques and frameworks - tool use, retrieval, search, and verification - that repeatedly appear in successful case studies. We give particular attention to formal-methods backends common in security and verification contexts, including Linear Temporal Logic (LTL) and Satisfiability Modulo Theories (SMT) solvers, as scalable middle-layer verifiers between lightweight tests and proof assistants. We close with an evaluation and reproducibility checklist aimed at making the next wave of claims easier to trust, reproduce, and build upon, while separating peer-reviewed or certificate-backed results from fast-moving community reports that are useful signals but not yet stable evidence.
Anthology ID:
2026.bigpicture-main.2
Volume:
Proceedings of The Big Picture v2: Crafting a Research Narrative
Month:
July
Year:
2026
Address:
San Diego, CA, USA
Editors:
Yanai Elazar, Allyson Ettinger, Nora Kassner, Sebastian Ruder
Venues:
BigPicture | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
10–21
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.bigpicture-main.2/
DOI:
Bibkey:
Cite (ACL):
Ioannis Tzachristas, Georgios Tzachristas, and Aifen Sui. 2026. Open Problems Solved by LLMs? A Survey of Verifiable Mathematical Discovery. In Proceedings of The Big Picture v2: Crafting a Research Narrative, pages 10–21, San Diego, CA, USA. Association for Computational Linguistics.
Cite (Informal):
Open Problems Solved by LLMs? A Survey of Verifiable Mathematical Discovery (Tzachristas et al., BigPicture 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.bigpicture-main.2.pdf