Research #openai #discrete geometry #reasoning models #mathematics

Erdős Unit Distance Conjecture Disproved: Inside the OpenAI Proof

OpenAI's reasoning model disproved an 80-year-old geometry conjecture — verified by a nine-mathematician team including a Fields Medalist.

Creeta

May 28, 2026

Erdős Unit Distance Conjecture Disproved: Inside the OpenAI Proof

The Erdős Unit Distance Problem: What Was Actually Conjectured

The unit distance problem — formally written as U(n) — asks a precise combinatorial question: given n points placed arbitrarily in the Euclidean plane, what is the maximum number of pairs of those points at exactly distance 1 apart? Paul Erdős posed the question in 1946 , and for roughly 80 years the best-known constructions were square-grid and lattice point arrangements . These yield approximately n^(1+o(1)) unit-distance pairs — an exponent that grows subpolynomially, meaning the improvement over n alone is slower than any fixed power of n. The conjecture, never formally proved, held that no arrangement could achieve a genuinely polynomial improvement over these grid baselines. Improving U(n) by a fixed exponent δ > 0 was widely considered out of reach.

Quick Answer: U(n) measures the maximum unit-distance pairs for n points in the Euclidean plane. Since 1946, lattice constructions set the bar at roughly n^(1+o(1)) pairs. In May 2026, an OpenAI reasoning model produced a construction achieving n^(1+0.014) — a genuine polynomial improvement, independently verified by nine mathematicians including Fields Medalist Tim Gowers.

To be precise about the notation: the o(1) in n^(1+o(1)) means the extra exponent shrinks toward zero as n grows. In practice, the best lattice constructions add only logarithmic or near-logarithmic density bonuses. The conjecture was that this limitation was fundamental — that U(n) could not grow as n^(1+c) for any fixed constant c > 0. That is the claim that has now been disproved.

The problem held a central position in combinatorial and discrete geometry because it connects multiple fields: the count of integer representations as sums of two squares (number theory), unit-distance graphs (graph theory), and extremal point-set questions (combinatorics). Results in one domain frequently implied bounds in others, giving the conjecture structural significance beyond the specific question it asks.

The difficulty was also simultaneously asymptotic and constructive. A genuine polynomial improvement required an infinite family of configurations — not a single clever arrangement for a fixed n — and required proving the count held for all sufficiently large n. Lattice constructions were effective precisely because they exploit algebraic regularity in integer arithmetic. Beating them required a different algebraic foundation, not just a refinement of grid geometry. For 80 years, no one found one.

What the Model Produced: The n^(1+0.014) Construction

On May 20, 2026, OpenAI announced that an internal general-purpose reasoning model had produced an infinite family of point configurations achieving n^(1+δ) unit-distance pairs, where δ = 0.014 . This is a true polynomial improvement — not a logarithmic refinement, not an approximation artifact — over every previously known construction. At one million points, configurations built via this method contain tens of thousands more unit-distance pairs than the best grid arrangement of comparable size .

The δ = 0.014 exponent may appear modest, but its significance is categorical, not quantitative. Princeton mathematician Will Sawin, one of the nine verifying mathematicians, confirmed that the improvement is a genuine polynomial gain rather than a finite-sample artifact or rounding error. The distinction is critical: a polynomial improvement means the gap between the new construction and lattice arrangements grows without bound as n increases — it cannot be explained away by numerical imprecision.

"δ = 0.014 is a real polynomial improvement — not a rounding artifact," — Will Sawin, Mathematician, Princeton University, as characterized in the verification analysis (source: dasroot.net)

Construction Type	Unit-Distance Pair Count (asymptotic)	Exponent δ	Algebraic Basis
Square-grid / Lattice (prior best)	≈ n^1+o(1)	δ → 0 (subpolynomial)	Integer arithmetic, rational grid points
OpenAI model construction (2026)	≈ n^1.014	δ = 0.014 (fixed polynomial)	Algebraic integers via class field towers

The construction is significant for what it is not: it is not an exhaustive search, not a verified-by-enumeration result, and not a specialization of any existing discrete geometry framework. The model synthesized techniques from Golod-Shafarevich group theory, infinite class field towers, and analytic number theory results by Ellenberg–Venkatesh and Hajir–Maire–Ramakrishna . None of these prior results had been connected to the unit-distance problem before this construction appeared. The model did not simply refine existing lattice approaches — it introduced algebraic machinery from an entirely different domain.

The Algebraic Engine: Golod-Shafarevich, Class Field Towers, and the Cross-Domain Synthesis

The construction's core mathematical engine draws on three bodies of prior work, none of which originated in discrete geometry. Understanding why these tools solve the unit-distance problem requires following the chain of reasoning that connects abstract algebra to point configurations in the Euclidean plane — and recognizing that this chain was not previously drawn in the published literature.

Golod-Shafarevich theory (1964) is a result in group theory and algebraic number theory that guarantees the existence of infinite towers of algebraic number fields with controlled discriminants . A number field's discriminant measures how densely its algebraic integers are packed relative to the ordinary integers — small discriminants mean the field's integers are algebraically close-packed in a precise technical sense. The Golod-Shafarevich theorem guarantees that such towers exist and can be constructed explicitly given appropriate conditions on the base field. This was not a new result when the model used it, but it had no prior application to combinatorial geometry.

Infinite class field towers are the mechanism that translates Golod-Shafarevich into a point configuration. A class field tower is a sequence of algebraic number fields — each a specific type of extension of the previous one — with the property that the Galois group at each level has controlled structure. When the tower is infinite (guaranteed by Golod-Shafarevich under the right base-field conditions), the algebraic integers at successive levels have discriminants that grow slowly relative to the degree of the field. This slow growth is structurally important: it means the integers at high levels of the tower are densely packed, with many pairwise differences landing close to unit norm.

The translation into a unit-distance configuration is then direct: take algebraic integers from a high level of the tower and embed them as points in the Euclidean plane via the real and imaginary parts of their complex conjugates. The discriminant control guarantees that a polynomial fraction of all pairwise differences have absolute value near 1. With selection and rescaling, this produces the n^(1+0.014) count. The algebraic density of the tower, not the geometric regularity of a grid, is the source of the improvement.

Ellenberg–Venkatesh and Hajir–Maire–Ramakrishna are analytic number theory and algebraic geometry results that provide the quantitative bounds on tower discriminants needed to extract a concrete exponent δ from the construction . Without these bounds, the existence of the tower can be proved but a specific value of δ cannot be computed. The Ellenberg–Venkatesh and Hajir–Maire–Ramakrishna results turn an existential guarantee into a quantitative one — and specifically, they are tight enough to yield δ = 0.014 rather than a weaker or conditional bound.

The cross-domain synthesis is the central contribution. Golod-Shafarevich was known to specialists in algebraic K-theory and infinite group theory. Ellenberg–Venkatesh and Hajir–Maire–Ramakrishna belong to the literature on number fields with small root discriminants. The unit-distance problem belongs to combinatorial and discrete geometry. There was no established connection between these bodies of work — no survey suggesting class field towers as a tool for extremal point-set geometry, no prior paper treating Golod-Shafarevich as relevant to U(n). The model identified the connection and assembled the proof components from separate literatures.

To be precise about what "synthesis" means here: each individual ingredient — Golod-Shafarevich, class field towers, the Ellenberg–Venkatesh bounds — existed in the literature. What did not exist was the recognition that they composed into a solution to U(n). That composition is the mathematical novelty. It does not require deriving new lemmas from first principles; it requires identifying that the right pieces were already available in different fields and connecting them correctly. That is a different kind of difficulty from incremental extension of known tools, and it is the kind of difficulty that is difficult to structure as a retrieval or search problem.

Verification: The Nine-Mathematician Companion Paper

Mathematical correctness of the construction was certified by a nine-mathematician team in a 19-page companion paper titled "Remarks on the disproof of the unit distance conjecture" . The panel includes Tim Gowers, who received the Fields Medal in 1998 , alongside Noga Alon, Will Sawin (Princeton), Thomas Bloom, Daniel Litt, Arul Shankar, Jacob Tsimerman, Victor Wang, and Melanie Matchett Wood. The coverage across combinatorics, number theory, and algebraic geometry directly matches the fields the construction draws on.

The companion paper "certified the mathematical correctness of the construction and provided broader context on the significance of the result within discrete geometry," per the OpenAI announcement describing the team's scope. (source: OpenAI)

Verification Team Member	Primary Field	Affiliation
Tim Gowers	Combinatorics / Functional Analysis	Collège de France / Cambridge (Fields Medal 1998 )
Noga Alon	Combinatorics / Graph Theory	Princeton University
Will Sawin	Algebraic Geometry / Number Theory	Princeton University
Melanie Matchett Wood	Algebraic Number Theory	Harvard University
Jacob Tsimerman	Number Theory	University of Toronto
Arul Shankar	Number Theory	University of Toronto
Daniel Litt	Algebraic Geometry	University of Toronto
Thomas Bloom	Analytic Number Theory	University of Oxford
Victor Wang	Analytic Number Theory	Princeton University

The primary research paper was authored by Lijie Chen, a researcher with OpenAI affiliation, with the model's contribution described as central to the discovery — not as a postprocessing or verification step performed after a human identified the construction. This authorship model — human name on the primary paper, AI contribution described narratively in the announcement — reflects the absence of any established convention for crediting AI-originated mathematical results.

It is important to specify what the verification team covered. They certified that the construction is mathematically correct and achieves the claimed n^(1+0.014) unit-distance pair count . What was not in scope — because it was not disclosed — is the reasoning process that produced the construction: the model's chain-of-thought, the number of inference attempts, and the degree of human framing involved. The mathematical output was independently verified. The generative process was not.

What OpenAI Has Not Disclosed

The OpenAI announcement describes the result as coming from "a new general-purpose reasoning model" . That description leaves substantial gaps for researchers and developers trying to understand what actually happened and whether it is reproducible.

OpenAI's blog describes the system as having "produced new mathematical results" — notably not claiming human-equivalent mathematical understanding or general reasoning capability. The phrasing is deliberate and minimal. (source: OpenAI)

Model identity: No model name, version, or release date has been disclosed as of May 2026 . It is explicitly described as not a math-specific system and not a scaffolded proof-search engine. Whether it belongs to the o-series family, a pre-release variant, or a different architecture is unspecified — which makes it difficult to contextualize against what developers currently have access to via API.

Chain-of-thought: The internal reasoning trace used to arrive at the construction was not shared with the verification team and has not been published. External researchers cannot analyze the reasoning process, identify whether the model followed a recognizable mathematical strategy, or determine whether it would generalize to structurally similar problems. Mathematical correctness of the output is verified. The process is opaque.

Scaffolding: The degree of human involvement in problem framing, prompt iteration, and result selection is unspecified. Did the model receive the problem statement cold? Was it decomposed or rephrased by researchers before submission? Were outputs from multiple inference runs compared and filtered? These questions bear directly on the "AI-originated" characterization, and none have been answered publicly.

Reproducibility gap: For mathematical results, correctness and reproducibility are normally separate standards. The construction has been verified as correct. But external researchers cannot reproduce the generative process, audit the reasoning chain, or test whether the same model would produce the same or an equivalent construction under independent prompting. Technical community discussion on Lobste.rs has flagged this specifically as a bottleneck for treating this result as a precedent for AI mathematical capability .

AI-Originated vs AI-Assisted Mathematics: Why the Distinction Matters

Prior milestones in AI and mathematics operated in a structurally different mode. AlphaProof searched for Lean 4 proofs of olympiad problems — well-posed problems with known difficulty levels, in a human-defined formal system. Lean and Coq proof assistants help humans verify and fill steps in proofs whose direction humans have already established. GitHub Copilot for proof writing accelerates code generation but does not propose mathematical direction. These are AI-assisted workflows: the human provides the conjecture, the approach, and the framing; the AI fills steps or checks correctness.

The unit-distance result is characterized as AI-originated: the model proposed the core construction — specifically, the connection between class field towers and the U(n) bound — autonomously . If this characterization holds under scrutiny (and the scaffolding questions from the previous section are directly relevant here), it represents a qualitative shift from AI as proof-checker to AI as conjecture-solver. The model was not given a candidate construction to verify — it produced the candidate.

The distinction has practical implications beyond credit attribution. AI-assisted tools fit naturally into existing research workflows: use them as a fast literature reviewer or a step-validator, with the human still driving the mathematical direction. AI-originated results require rethinking where in the research pipeline the AI sits. If the model is producing the core ideas, not just filling steps, the relevant operational question changes from "how do we verify this output?" to "how do we identify which problems are worth setting up for the model?"

Academic authorship and credit are not equipped for this. The publication model used here — human author on the primary paper, AI contribution described narratively in the announcement — is a pragmatic workaround, not a framework. It does not address reproducibility (who do you contact if you cannot verify a step?), priority (what is the date of "discovery" when an AI produced the output during an inference run?), or replication (how does another group repeat the experiment?). These conventions were designed for human collaboration and will require explicit revision as AI-originated outputs become more common.

Benchmark coverage is a concrete and immediate gap. AIME, MATH, MiniF2F, and IMO shortlist problems test structured problem-solving: well-posed questions, known answers, evaluation against those answers. Open-ended conjecture work — "here is a class of problems, find something new" — is not measured by any current public benchmark. If you are evaluating a reasoning model for research or engineering tooling, your existing benchmark suite will not predict whether the model can do what happened here . That is a gap worth building around, not ignoring.

Technical skeptics have noted that the construction assembles known components — Golod-Shafarevich, Ellenberg-Venkatesh, class field towers — rather than deriving mathematics from first principles. This is a fair observation. But cross-domain synthesis is how most major mathematical advances work: the Wiles proof of Fermat's Last Theorem assembled elliptic curves, modular forms, and Galois representations that each existed independently. The question is whether the model identified the synthesis, or whether human guidance pointed it there. That question remains open because the scaffolding is undisclosed.

For Developers: What to Watch in Reasoning Model Capabilities

For teams building with or evaluating reasoning models, the unit-distance result carries several practical signals — independent of the marketing context around it.

Domain-crossing in long chain-of-thought: The construction required connecting group theory (Golod-Shafarevich, 1964 ), algebraic number theory (class field towers, Ellenberg–Venkatesh bounds), and combinatorial geometry (U(n) bounds). Structured search and retrieval-augmented approaches typically operate within a single domain's vocabulary and link structure. Extended chain-of-thought reasoning appears capable of traversing domain boundaries in ways that retrieval cannot. If you are framing problems for reasoning models, how broadly or narrowly you scope the problem domain in the prompt may matter more than model selection alone .

Opacity as a practical bottleneck: Verified output without an auditable reasoning process limits what you can build on top of a result. In software terms: if you cannot inspect the derivation, you cannot write tests against the reasoning steps, identify which parts of the approach transfer to adjacent problems, or debug failures when the model produces an incorrect output. The verification team certified the output; they did not certify the process. For AI-for-science or AI-for-engineering workflows, this gap has operational consequences.

What to watch over the next 6–12 months:

Model disclosure: Whether OpenAI names the model and makes it available via API. If this is an o-series variant, the capability may be partially accessible today. If it is a distinct architecture, the timeline is unknown.
Formal verification integration: Whether Lean or Coq verification becomes a reproducibility layer on top of model-generated mathematical outputs. This would address chain-of-thought opacity by providing a checkable certificate independent of the generative process — a certificate anyone can verify without access to the model.
Replication attempts: Whether independent researchers can reproduce an equivalent construction using currently available models. This would bound how much of the capability depends on the specific undisclosed system versus general reasoning model capacity at the frontier.
Benchmark evolution: Whether open-ended synthesis tasks appear in public evaluation suites. If your current evaluation covers only competition math, it will not predict open-ended conjecture capability .

The result is a single data point, not a trend line. One construction in one subfield of combinatorics does not establish that reasoning models can systematically resolve open conjectures. But it is a data point that sits outside the capability class your current benchmarks measure, produced by a system whose identity has not been disclosed. Both facts are worth tracking.

Frequently Asked Questions

What is the Erdős unit distance conjecture?

The Erdős unit distance conjecture concerns U(n): the maximum number of pairs at exactly unit distance among n points placed arbitrarily in the Euclidean plane. For nearly 80 years, the best known constructions were square-grid and lattice arrangements, yielding roughly n^(1+o(1)) unit-distance pairs — meaning the exponent above n grows subpolynomially, essentially the rate of n times a very slowly growing function . The conjecture held that no arrangement could achieve a genuine polynomial improvement — that no fixed c > 0 exists such that U(n) grows as n^(1+c) for all large n. This was never formally proved, but it resisted attack since Erdős posed the problem in 1946 . The difficulty was both asymptotic and constructive: disproving it required an infinite family of configurations — not one clever finite arrangement — that beats the lattice baseline for all large n.

Which OpenAI model solved the Erdős unit distance problem?

OpenAI has not disclosed the model name, version, or training details as of May 2026 . It is described only as "a new general-purpose reasoning model" — not a math-specific system, not a model fine-tuned for theorem proving or proof search, and not a system specifically targeted at the unit-distance problem. Whether it belongs to the o-series family or represents a distinct architecture has not been stated publicly. The model identity remains undisclosed.

Was the mathematical proof independently verified?

Yes. A nine-mathematician team authored a 19-page companion paper titled "Remarks on the disproof of the unit distance conjecture" certifying the mathematical correctness of the construction . The team includes Fields Medalist Tim Gowers (1998) , Princeton's Will Sawin and Noga Alon, and six other specialists in combinatorics, number theory, and algebraic geometry — the exact fields the construction draws on. Their verification covered the construction and its mathematical properties: specifically, whether it achieves n^(1+0.014) unit-distance pairs as claimed. The reasoning process that produced the construction was not reviewed, as it was not disclosed to the verification team.

What is Golod-Shafarevich theory and why does it appear in a geometry proof?

Golod-Shafarevich is a 1964 result in group theory and algebraic number theory that guarantees the existence of infinite towers of algebraic number fields with controlled discriminants . A discriminant measures how densely packed the algebraic integers in a number field are relative to the ordinary integers. In the unit-distance construction, Golod-Shafarevich provides the structural guarantee that an infinite tower of fields with small discriminants exists. Algebraic integers drawn from high levels of such a tower, when embedded as points in the Euclidean plane via their complex conjugates, have many pairwise differences landing near unit norm. The construction exploits this algebraic density to achieve the polynomial improvement in unit-distance pair count. There was no prior published connection between Golod-Shafarevich theory and discrete geometry — the cross-domain synthesis is what the model contributed.

Does this mean AI can now solve any major open mathematical conjecture?

No. This is one result in one subfield, and several important caveats apply. The degree of human scaffolding in problem setup — how the problem was framed, how many prompting iterations occurred, how outputs were selected — has not been disclosed, which makes it difficult to characterize precisely how autonomous the discovery was . The reasoning trace is not public. The model is not accessible to outside researchers. The construction assembles existing mathematical tools rather than deriving new mathematics from first principles. Whether the same approach transfers to other open problems in combinatorics, number theory, or other subfields is entirely unknown. This result is a notable data point about reasoning model capability in open-ended synthesis; it is not evidence that automated conjecture resolution is a general, repeatable capability.

What Comes Next: Open Questions and the Broader Signal

The unit-distance result is significant for what it demonstrates about general-purpose reasoning at the frontier, and equally significant for the questions it leaves open. A model without domain-specific fine-tuning identified a connection between 1964 group theory and a 1946 combinatorial geometry problem , assembled the relevant analytic number theory machinery, and produced a construction that cleared expert review by nine specialists covering the exact fields it drew on. That is a concrete, externally-verified observation — independent of how it is characterized in press coverage.

For practitioners, the actionable signal is narrow and specific. First, the cross-domain synthesis capability visible here is not captured by current public benchmarks — if you evaluate models for research or engineering tooling using structured competition math, those scores will not predict open-ended synthesis performance. Second, the opacity of the reasoning trace is a real operational bottleneck for building on model-generated results; formal verification integration (Lean, Coq) is the most plausible near-term mitigation, and whether OpenAI pursues this as a reproducibility layer is worth watching. Third, model disclosure and API access will determine whether this capability class is exploitable by teams outside OpenAI or whether it remains a closed demonstration.

The broader question — whether AI-originated mathematical results will become routine and what that means for how knowledge is produced, credited, and peer-reviewed — is genuinely open. The publication and verification model used here is a pragmatic workaround for a situation the academic system was not designed for. The fact that a nine-mathematician team including a Fields Medalist co-authored a companion paper certifying an AI-generated construction, while OpenAI's blog uses cautious language about "producing results," is itself informative about where the frontier currently sits: consequential enough to require serious verification, uncertain enough to require careful framing. Tracking the next few results in this space — and specifically, whether scaffolding details are disclosed — will tell you more than this single data point can.

Last updated: 2026-05-28. Article reflects information publicly available as of the OpenAI announcement; model identity, reasoning trace, and scaffolding details remain undisclosed as of this date. Review OpenAI's announcement and the dasroot.net technical summary for primary sources.