@inproceedings{tzachristas-etal-2026-open,
title = "Open Problems Solved by {LLM}s? A Survey of Verifiable Mathematical Discovery",
author = "Tzachristas, Ioannis and
Tzachristas, Georgios and
Sui, Aifen",
editor = "Elazar, Yanai and
Ettinger, Allyson and
Kassner, Nora and
Ruder, Sebastian",
booktitle = "Proceedings of The Big Picture v2: Crafting a Research Narrative",
month = jul,
year = "2026",
address = "San Diego, CA, USA",
publisher = "Association for Computational Linguistics",
url = "https://preview.aclanthology.org/ingest-acl-workshops/2026.bigpicture-main.2/",
pages = "10--21",
ISBN = "979-8-89176-416-3",
abstract = "Recent years have produced a small but rapidly growing set of results where Large Language Models (LLMs) - usually embedded in a search-and-verification loop - advance the state of the art on problems previously regarded as ``open'' in the pragmatic sense of lacking a best-known construction, bound, or proof certificate. This paper surveys that emerging line of work with a Big Picture emphasis: what makes these successes possible, what should count as ``solved'', and what design patterns generalize? We (i) propose an evidence ladder for interpreting ``LLM solved an open problem'' claims, (ii) map mathematical subfields by difficulty dimensions that matter for LLM-based discovery, (iii) curate a timeline of key breakthroughs leading to verifiable discovery systems, and (iv) synthesize the techniques and frameworks - tool use, retrieval, search, and verification - that repeatedly appear in successful case studies. We give particular attention to formal-methods backends common in security and verification contexts, including Linear Temporal Logic (LTL) and Satisfiability Modulo Theories (SMT) solvers, as scalable middle-layer verifiers between lightweight tests and proof assistants. We close with an evaluation and reproducibility checklist aimed at making the next wave of claims easier to trust, reproduce, and build upon, while separating peer-reviewed or certificate-backed results from fast-moving community reports that are useful signals but not yet stable evidence."
}Markdown (Informal)
[Open Problems Solved by LLMs? A Survey of Verifiable Mathematical Discovery](https://preview.aclanthology.org/ingest-acl-workshops/2026.bigpicture-main.2/) (Tzachristas et al., BigPicture 2026)
ACL