What has AI actually proved in mathematics recently?

Highlights from 2025–2026 include IMO gold-level performance from DeepMind and OpenAI systems, Google DeepMind’s Aletheia producing publishable Ph.D.-level results in arithmetic geometry, OpenAI’s general model disproving a combinatorial geometry conjecture related to the planar unit distance problem, and Math Inc.’s Gauss agent formalizing Maryna Viazovska’s sphere-packing work in Lean in days rather than years.

What is “Big Mathematics”?

Terence Tao’s term for large-scale, decentralized collaboration where humans handle creative strategy and AI handles technical grunt work—often mediated by formal proof assistants like Lean that verify every logical step so trust does not depend on reputation alone.

Should math students use AI?

Researchers warn that skipping the struggle weakens intuition—the same muscle Venkatesh and Fraser describe as central to mathematics. Useful guidance emerging from the community treats AI like a calculator for parts of a problem, not a shortcut past understanding. Institutions are drafting publication and research norms accordingly.

What are the three futures for AI in math?

IEEE summarizes them as AI as tool (human understanding first), AI as partner (shared discovery), and AI as oracle (answers matter most, humans may interpret results they did not derive). Most working mathematicians lean toward tool or partner; the oracle future is what worries students at forums like Heidelberg Laureate.

How do proof assistants like Lean fit in?

Lean, Isabelle, and Rocq check proofs step-by-step. Historically, translating informal proofs into that machine-readable form took enormous human effort. LLMs are now automating much of that formalization pipeline, which Terence Tao argues unlocks safer mass collaboration—including with anonymous contributors and AI agents whose work can be verified, not merely trusted.

Does “Lean compiles” mean an AI proof is trustworthy?

It means the proof is logically valid *for the statement Lean checked*—not that humans chose the right theorem statement, imported the right definitions, or produced something maintainable. A vacuous or mis-specified formalization can compile while proving the wrong thing, similar to a test suite that passes with trivial assertions. You still need expert oversight.

What is the Mathlib vs “mathslop” debate?

After Gauss autoformalized Viazovska’s sphere-packing theorems, formal-math practitioners pushed back: Mathlib is a human-curated library with reusable APIs; a ~200,000-line machine-generated blob may verify in Lean but lacks an intelligible interface nobody would merge into shared infrastructure. Critics warn of two layers—maintainable formal math and orphaned verified results humans never read.

Will AI replace mathematicians? What IEEE’s “Big | explainx.ai Blog

Q: Will AI replace mathematicians?

Not entirely, according to most researchers IEEE interviewed—but the job is changing fast. Some tasks (formalization, case checking, searching solution spaces) are already fair game for AI. The open debate is whether humans stay in the loop for intuition and understanding, collaborate as partners, or become interpreters of opaque machine proofs.

explainx.ainewsletter3.5k

workshops ↗

Will AI replace mathematicians? What IEEE’s “Big | explainx.ai Blog | explainx.ai

TL;DR — the questions people actually ask

Will AI replace mathematicians? Not wholesale—but it is already competitive on some abstract reasoning tasks, and the kind of work valued is shifting.
What changed in 2025–2026? IMO gold, Aletheia’s research-level output, OpenAI’s autonomous disproof of a major geometry conjecture, and LLMs accelerating Lean formalization.
What is Terence Tao’s “Big Mathematics”? Humans plus machines at scale, with formal verification as the trust layer.
Should students use AI for homework? The community’s worry: yes, they will—and they may skip building the intuition that makes someone a mathematician, not just an answer-fetcher.
Does “Lean compiles” settle everything? No—you still need humans to specify the right theorem and interpret whether the artifact is worth keeping.
Where is the primary source? Benjamin Skuse’s feature in IEEE Spectrum (June 25, 2026)—plus the skeptical thread that followed on Hacker News when the piece hit the front page.

Why this IEEE piece landed like a bombshell

Benjamin Skuse opens with a confession many applied-math Ph.D.s will recognize: re-read your thesis a decade later and realize a modern LLM-assisted workflow might finish it in days. Pure mathematicians in the next desk over looked idle for years—and some never published. Only later did he understand they were not performing intelligence or masochism; they were pursuing a slow, private joy when a hard idea clicks.

That joy—Jeremy Avigad at Carnegie Mellon calls it neither purely aesthetic nor athletic, but the feeling when “you’ve been thinking long and hard about something complex, difficult, and then—all of a sudden—it just comes together”—is what IEEE asks AI to threaten.

The article is not a product launch. It is a status report from the Heidelberg Laureate Forum (September 2025), where young researchers heard elders describe futures in which superhuman AI forms conjectures, searches spaces, proves theorems, verifies proofs, and generalizes—without humans in the loop. Yang-Hui He’s line stuck: human mathematicians could become “priests to oracles.” Students in the hall, Skuse reports, looked devastated.

If you ship AI agents for a living, this is the same existential question wearing a chalkboard costume: When the machine can do the hard part, what is left for people—and does “left” still matter?

	AI as tool	AI as partner	AI as oracle
Role of AI	Assistant	Collaborator	Autonomous researcher
What matters most	Human understanding	Shared discovery	Answers
Human fear level	Low	Mixed	High

Will AI replace mathematicians? What IEEE’s “Big Mathematics” debate means for proofs, Lean, and your career

Why this IEEE piece landed like a bombshell

What AI has actually done in mathematics (not hype)

Related posts

What Is the Jacobian Conjecture? Fable 5 Counterexample Explained

AI Advice Kills "I Don't Know": Cognitive Surrender in a PsyArXiv Study

Did Fable 5 Disprove the Jacobian Conjecture? Alpoge Thread Explained

Competition math at IMO gold level

Aletheia and Ph.D.-level research output

OpenAI and combinatorial geometry

Formalization: Gauss, Viazovska, and Lean

The Mathlib vs “verified blob” fight

What skeptics on Hacker News got right (without the hype)

“Lean compiles or it doesn’t” — half true

You still need Terence Tao—or someone close

Mochizuki reminds us verification culture ≠ instant certainty

AI is still mostly at the “you missed a trick” stage

Why not all three futures at once?

Access and review queues get worse before they get better

Three futures: tool, partner, oracle

Terence Tao on formal verification as the trust layer

Risks IEEE highlights (and why developers should care)

Access and elitism

Motivation and intellectual atrophy

Publication and norms

So is AI “sucking the soul out of math”?

What to watch next

Bottom line