#025

ChatGPT cracked a 60-year Erdős problem for a 23-year-old, and OpenAI is paying $25K for NDAs

A 23-year-old with no math degree used ChatGPT to crack a 60-year Erdős problem 7-year experts couldn't. Karpathy's wiki gets a multi-agent office. OpenAI buys NDAs.

Listen to this edition

A 23-year-old with no math training gave ChatGPT a 58-year-old conjecture on an idle Monday. GPT-5.4 Pro cracked Erdős Problem #1196 in 80 minutes, single attempt. Fields Medalist Terence Tao confirmed the proof and said the math world had a collective “mental block.”

The AI used a technique from a related field that no human expert had tried on this problem. Jared Lichtman, the world’s top specialist on primitive sets, had spent 7 years on this exact conjecture.

In today’s indie hacker news:

  • 23-year-old + ChatGPT cracks 60-year Erdős problem in 80 minutes
  • Devs ship 28-year-old projects after Claude Code finishes them
  • Karpathy’s LLM wiki gets a multi-agent office with shared Git memory
  • OpenAI pays $25K for a GPT-5.5 jailbreak you can’t talk about
  • Xiaomi MiMo V2.5 Pro matches frontier at $1/M tokens
  • Colorado exempts open source from age-verification bill

TOP STORIES

VIBE MATH WORKS

🧠 23-year-old with no math degree used ChatGPT to crack a 60-year Erdős problem

23-year-old with no math degree used ChatGPT to crack a 60-year Erdős problem

The story: Liam Price, 23, gave Erdős Problem #1196 to GPT-5.4 Pro “on an idle Monday afternoon.” He didn’t know what the problem was. The AI returned a complete proof in 80 minutes using a Markov chain approach with von Mangoldt weights, a technique from a related area of math that no human had tried on this problem. Terence Tao and Stanford’s Jared Lichtman refined and shortened the proof. Three days later, a stronger solution appeared from Nat Sothanaphan collaborating with GPT-5.4 Thinking.

The details:

  • Erdős Problem #1196: open since 1968, unsolved for 58 years. About primitive sets where no element divides any other
  • GPT-5.4 Pro solved it in 80 minutes, single session, single attempt
  • Lichtman, world’s top primitive-sets expert, had spent 7 years on this exact problem
  • Tao: “The humans that looked at it collectively made a slight wrong turn at move one”
  • Lichtman called it “a Book Proof,” Erdős’s term for an ideally elegant proof. First AI result at that level

Why builders care: Price had no credentials, no advisor, no institutional access. Just a ChatGPT Pro subscription and a habit of pasting hard problems. The AI had no “collective mental block.” It hadn’t internalized the standard approach everyone else started with. Same energy as an indie hacker entering an industry without knowing what “can’t be done.”


28 YEARS LATE, STILL SHIPPED

🔧 AI coding tools are finishing the side projects developers abandoned years ago

AI coding tools are finishing the side projects developers abandoned years ago

The story: Matthew Brunelle built “Sub-standard”, a YouTube Music to OpenSubsonic API shim, in a single evening using Claude Code with Opus 4.6. The project had sat stalled for 1.5 months. His argument: AI is fair game for utility projects. Learning projects, do yourself. The HN thread became a confessional.

The details:

  • Brunelle: stalled 1.5 months, shipped in one evening with Claude Code + Opus 4.6
  • HN commenter hypercube33 finished a Quake 2 mod started in 1998. 28 years from start to ship
  • HN commenter jedberg revived a weather visualization app he’d failed to finish for 12 years
  • HN commenter saadn92 maintains ~120 personal tools. Volume impossible without AI
  • 224 points, 121 comments in 12 hours. High signal for a personal blog post

Why builders care: Every developer has a drawer full of dead projects. The constraint shifted from “can I write this code” to “can I define what I want clearly enough.” That’s a skill indie hackers already have.


YOUR AGENTS HAVE A SHARED BRAIN NOW

📝 Karpathy’s LLM wiki idea gets a multi-agent office with shared Git memory

Karpathy's LLM wiki idea gets a multi-agent office with shared Git memory

The story: wuphf (named after The Office’s fictional startup) is an open-source multi-agent orchestrator where Claude, Codex, and OpenClaw agents share a persistent wiki backed by a local Git repo. Based on Karpathy’s LLM wiki gist that hit 5,000+ stars and 16M+ views on X. Agents write to private notebooks and promote durable insights to a shared team wiki. Git-tracked under an “archivist” identity.

The details:

  • 97% prompt cache hit rate with fresh-context-per-turn architecture
  • 87K tokens/turn flat vs 484K for accumulated-context approaches. 7x cheaper
  • 464 GitHub stars, 227 HN points, 105 comments. Install: npx wuphf
  • 193 releases shipped. Go backend, TypeScript frontend, MIT licensed
  • HN caveat: “Everyone is writing. Nobody is reading.” AI-generated wikis can compound hallucinations

Why builders care: Every time you start a new Claude Code session, the agent has amnesia about your codebase. wuphf makes agents responsible for maintaining shared knowledge across sessions. The git substrate means you can audit your agents’ knowledge evolution. The open question: does AI-maintained documentation stay useful, or become its own kind of noise?


$25K BUYS YOUR SILENCE

🔒 OpenAI pays $25K to jailbreak GPT-5.5’s bio guardrails. Winner signs an NDA.

OpenAI pays $25K to jailbreak GPT-5.5 bio guardrails

The story: OpenAI launched a bio bug bounty offering $25,000 for a single universal jailbreak prompt that defeats all five bio-safety questions in GPT-5.5. One prompt, five questions, no prior context. Codex Desktop only, not ChatGPT. Applications are invite-first with mandatory NDA covering all findings.

The details:

  • $25K single winner. Same prize as the GPT-5 bio bounty despite OpenAI’s $300B+ valuation
  • 5 bio-safety questions, not publicly disclosed. Participants see them after vetting
  • Testing window: April 28 to July 27, 2026. Applications close June 22
  • Codex Desktop only. OpenAI is stress-testing its agentic surface, not chat
  • Anthropic’s approach: no cash bounty, 3,000+ collective red-team hours, no universal jailbreak found

Why builders care: If you’re building on GPT-5.5, this signals OpenAI views its own bio guardrails as the weakest link worth stress-testing. The NDA means no public disclosure of what works. Builders adding safety layers on GPT-5.5 are flying blind even after the program closes.


📈 Xiaomi MiMo V2.5 Pro matches frontier at $1/M tokens - 1T total params, 42B active MoE. SWE-bench Pro 57.2%, above Claude Opus 4.6’s 53.4%. $1/M input on OpenRouter vs Claude’s $5/M. 379 upvotes on r/LocalLLaMA. Caveat: Xiaomi’s token-efficiency benchmark (ClawEval) is self-hosted.

⚖️ Colorado exempts open source from age-verification bill - System76 CEO Carl Richell lobbied Sen. Matt Ball, and Colorado’s SB26-051 now exempts open-source OSes, apps, code repos, and containerized platforms. Senate passed 28-0, House vote expected April 28. Template for other states.


DRAMA

FOUNDER ENLIGHTENMENT ZONE

😤 “I’m done with SaaS” burnout thread hits 106 comments on r/SaaS

Technical founder spent ~1 year building SaaS solo, realized building is the easy part and distribution is the actual job. Didn’t quit. Pivoted to CTO/co-founder of 3 startups with partners handling sales. 60 upvotes, 106 comments (1.77x comment-to-upvote ratio). Top reply: “There is a specific kind of burnout that comes from screaming into the void of the microSaaS market.”

Why builders care: Distribution beats product. If you can’t sell, find someone who can. The thread is a support group disguised as a rant.


FIRST DOLLAR

FIRST CUSTOMER IS A YC COMPANY

💵 Solo builder lands first sale of an AI blog tool to a YC company

An r/SaaS builder shipped WithNextSEO, a tool that analyzes a company’s website and generates a blog post in ~30 seconds. First sale went to a YC company. Builder learned to code 5 years ago, built it after work over time. 30 upvotes, 27 comments of genuine advice. Community consensus: “First sale is the hardest. Now figure out why they bought and get 10 more.”


STACK OF THE DAY

🛠️ claude-mem-viz - Go-based terminal UI that lets you browse, edit, prune, and audit Claude Code’s automatic memory files at ~/.claude/. Fuzzy search across all project memories. Warnings for orphan files, dangling links, and stale entries. Installable via Homebrew or go install. Entirely offline. Free, open source.

Not sponsored. We just feature tools builders would actually use.


BOOKMARKED TODAY

🔍 AI Visibility Monitor - Open-source tool that tracks whether GPT or Claude cite your website. Useful for content creators watching AI-generated citations.

📦 PyPI watchdog - Automated testing layer that checks if PyPI packages actually install and function, not just whether they exist.

🔗 Peeroxide - Full wire-compatible Rust implementation of the Hyperswarm P2P networking stack. For builders working on distributed systems who want native performance.


Curated by AI, built by a human.