#049

Karpathy shuttered his AI school for Anthropic, Gemini Flash is 5x pricier, GCP kicked Railway

Karpathy paused Eureka Labs to use Claude to train Claude at Anthropic. Gemini 3.5 Flash hiked 5x to $1.50/M. Google Cloud cut Railway off mid-outage.

Listen to this edition

Andrej Karpathy just paused his AI-education startup Eureka Labs after 22 months and joined Anthropic’s pre-training team. His brief on day one: use Claude to train Claude on Nick Joseph’s team, the most compute-hungry part of building a frontier model.

Karpathy is the second-most-followed AI researcher on the internet and he coined the term “vibe coding.” If he can’t out-research a $43B ARR lab from a solo startup, neither can you, and that has consequences for how you pick your next wedge.

In today’s indie hacker news:

  • 🧠 Karpathy walks from his solo startup straight to Anthropic
  • 💸 Gemini Flash 3.5 quietly takes a Pro-adjacent price tag
  • ⏳ Google’s CLI tool of 12 months gets a four-week kill notice
  • 🚂 GCP cuts Railway’s account mid-outage, no appeals
  • 🛠️ PrismoDev catches a file loaded 286 times into your agent

TOP STORIES

CLAUDE TRAINS CLAUDE

🧠 Karpathy paused Eureka Labs after 22 months to use Claude to train Claude at Anthropic

Karpathy paused Eureka Labs after 22 months to use Claude to train Claude at Anthropic

The story: Karpathy posted May 19 that he’s joining Anthropic this week and pausing Eureka Labs, the AI-education company he founded in July 2024 after leaving OpenAI a second time. He’s reporting to Nick Joseph on the pre-training team to build a recursive loop where the current Claude accelerates research on the next Claude. Fortune walked the arc: OpenAI founding member in 2015, Tesla AI director through 2022, back to OpenAI in 2023, out again to start Eureka in 2024, and now Anthropic at the most compute-hungry layer of the stack.

The details:

  • Karpathy’s own words on X: “I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D.”
  • His manager Nick Joseph is ex-OpenAI, lasted 9 months there before defecting to Anthropic. The pre-training team is the one that touches every Sonnet, Haiku, and Opus update before you do
  • Eureka Labs shipped LLM-101n, a hands-on course Karpathy taught while live-coding GPT-2 from scratch. It will be paused, not killed. He says he plans to “resume my work on it in time”
  • Anthropic’s $43B ARR in April is the gravity well: 80x growth from $9B at the end of 2025. Talks of a $30B raise at a $900B-$1T valuation are open
  • The HN thread hit 1,223 points and 508 comments in 13 hours. Top comment read the move as confirmation that “solo AI is dead, the labs eat everything”

Why builders care: The wedge that survives the recursive self-improvement era is the stuff a lab can’t ship from a model card. Pick one of three: a customer relationship the model never sees, a workflow that wraps the model in your domain, or a private dataset your users keep feeding you. Whatever you do, stop competing on raw model capability.



FLASH ATE A FRONTIER PRICE TAG

💸 Gemini 3.5 Flash launched at 5x the input price of 2.5 Flash and Google wants it as your default

Gemini 3.5 Flash launched at 5x the input price of 2.5 Flash and Google wants it as your default

The story: Google launched 3.5 Flash at I/O 2026 with a 1M-token context window and pricing that jumps to $1.50 per million input tokens and $9 output. Sundar Pichai opened the keynote with “Welcome to the agentic Gemini era” and closed it with the same line. Simon Willison ran his benchmark and watched the bill go from $172 on 2.5 Flash to $1,552 on 3.5 Flash, a 9x end-to-end jump on the same workload.

The details:

  • Pricing now sits at 75% of Gemini Pro on both input and output. Claude Haiku 4.5 is $1/$5 and GPT-5-mini is $0.25/$2. Flash is the most expensive small/fast model on the board
  • Cached input drops to $0.15 per million, a 90% discount. If your prompts and tool schemas are stable, caching is the only knob that keeps the new Flash inside its old budget
  • It beats Gemini 3.1 Pro on Terminal-Bench 2.1 (76.2%) and MCP Atlas (83.6%) but loses on Humanity’s Last Exam and 128k MRCR v2. Pichai’s pitch: pay Pro-adjacent rates, get Pro-beating agentic results
  • Google demoed Antigravity 2.0 building a working OS in 12 hours using 93 parallel sub-agents, 15,000 model requests, and 2.6B tokens for under $1K in credits. The agent boom now has a cost ceiling Google is happy to charge against
  • HN’s top comment from GodelNumbering laid out the trajectory: 2.5 Flash $0.30/$2.50, 3.0 Flash Preview $0.50/$3, 3.5 Flash $1.50/$9

Why builders care: If you wrote your unit economics on 2.5 Flash, redo the math today. Implement context caching before you ship anything new on 3.5, or move the cost-sensitive paths to Haiku 4.5 or GPT-5-mini and keep Flash for the workloads where its speed claim earns its rate.


12 MONTHS IN, 4 WEEKS OUT

⏳ Gemini CLI stops serving requests June 18 and Google force-migrates everyone to Antigravity CLI

Gemini CLI stops serving requests June 18 and Google force-migrates everyone to Antigravity CLI

The story: Google posted on May 19 that Gemini CLI and Gemini Code Assist IDE extensions stop serving Pro, Ultra, and free individual users on June 18, 2026. The tool launched on June 25, 2025, so the lifespan clocks in at exactly 12 months. Antigravity CLI is the Go-rewritten replacement with async multi-agent orchestration, and it’s already shipping alongside Antigravity 2.0 desktop, SDK, and IDE under one umbrella.

The details:

  • Programmatic and Agent Client Protocol support in Antigravity CLI is unconfirmed. If you wired Gemini into a cron, a CI step, or a headless agent loop, you don’t yet know whether the new binary exposes the same interface
  • Both tools write global rules to the same ~/.gemini/GEMINI.md path, so if you install both they fight over context. Issue 16058 documents the migration also relocating brain entries and overwriting mcp_config.json symlinks, breaking live MCP server connections
  • Quotas shift from daily to weekly windows. Gemini CLI ran 60 req/min and 1,000 req/day on free. Antigravity CLI moves everyone to weekly caps, and Pro subscribers already got locked out for 7-day stretches earlier this year on the same model
  • Standard, Enterprise, and Google Cloud customers keep their access. The kill switch only fires on free, Pro, and Ultra individual tiers
  • GitHub orgs cannot install new Gemini Code Assist for GitHub instances after June 18. Existing installs stop working in the weeks after

Why builders care: The classic Google rebrand-and-retire move on a tool builders only just adopted. Audit every script that calls gemini this week, test Antigravity CLI in non-interactive mode before the cutoff, and budget for a quota model that hands you a week’s worth of calls and dares you to ration them.


YOUR PAAS HAS A KILL SWITCH

🚂 Google Cloud blocked Railway’s account mid-outage and 10M deploys/month went dark for 4.5 hours

Google Cloud blocked Railway's account mid-outage and 10M deploys/month went dark for 4.5 hours

The story: Railway’s status page logged the incident at 22:29 UTC on May 19. Co-founder Jake Cooper posted at 23:37 UTC: “It appears Google Cloud has blocked our account, and so some services are unavailable. We’ve escalated this to Google.” The dashboard, API, and a chunk of customer workloads went down for 4 hours 36 minutes. Three HN threads ran in parallel and hit 275 points in under four hours.

The details:

  • Railway declared its Metal migration “complete” in June 2025 with “all workloads running 100% on Railway Metal.” This outage proves the control plane, dashboard, and API still ride on GCP
  • Recovery timeline: 01:23 UTC compute back, 01:41 UTC “non-enterprise deploys remain paused, enterprise deploys are unaffected,” 03:05 UTC most workloads restored. The trigger for the block was never disclosed by Google
  • The platform runs 10M+ deployments and 1T+ requests per month, and raised $100M led by TQ Ventures in January to challenge AWS. None of that mattered while the GCP trust-and-safety bot held the key
  • Precedent: GCP wiped UniSuper’s Australian pension fund in 2024, and SSLMate’s account has been suspended three times. There is no public appeals process
  • HN user brokenodo posted the cleanest playbook: “Switched to Render. DNS cutover in 1 minute, full migration in 30 minutes”

Why builders care: Every PaaS on top of one cloud has a kill switch you don’t own. Keep a Dockerfile and IaC that boots on a second target (Render, Fly, a VPS), pre-stage the DNS records, and run the failover drill once a quarter so the muscle memory is there when the bot pulls your plug.


🪙 Token-waste tooling is the new picks-and-shovels wave - Three Show HNs in 48 hours, all aimed at the same wound. PrismoDev’s README cites a real session where one file got loaded into context 286 times. Logbox pipes dev-server output into SQLite and exposes it over MCP so agents stop pasting terminal noise. Forge’s guardrails push a self-hosted 8B model to 86.5% on a 26-scenario agentic eval. Three days after levelsio went viral complaining about Claude Code slowness, the tooling response is here.

🛡️ Agent governance is the new YC pitch deck slide - Three more Show HNs this week wrap a control plane around AI agents. SafeRun ships sub-50ms p95 inline action checks plus replay debugging. Capframe mints macaroon-style ed25519 capability tokens for every tool call. AgentWing flips the wedge and parallelizes agent subtasks via a director layer. Indie hackers are selling the seatbelts for the agent boom they shipped six months ago.

🔓 GitHub itself is investigating unauthorized access to its own repos - GitHub posted to X on May 19-20 that it is “investigating unauthorized access to GitHub’s internal repositories” and claims no customer impact so far. Same week, Krebs reported a Nightwing contractor pushed a public repo named “Private-CISA” exposing three privileged AWS GovCloud accounts and CISA’s internal artifactory. Credentials stayed valid for 48 hours after takedown. The “private repo equals safe” assumption broke twice in one news cycle.


FIRST DOLLAR

SAME-DAY SHIP, SAME-DAY STRIPE

💵 $216 in one day by shipping the feature a prospect emailed about

pablo-was-here from r/SaaS hit hot rank 4 with the kind of story most builders never write up. A company emailed Motion Software asking for three licenses under one billing account, a feature he didn’t have. He built Workspaces that day, replied as a human (no template), and woke up to a $216 Stripe ping for three annual licenses paid upfront. Pair this with Papermakin1 crossing their first $300 day the same week. Two builders, two thresholds, no growth hack involved.


STACK OF THE DAY

🛠️ PrismoDev - MIT-licensed local CLI that diagnoses token waste in your Claude Code, Cursor, and Codex sessions in real time. Install with npx getprismo doctor, point it at a repo, and it tells you which files got reloaded a hundred times into context and which lockfiles are quietly eating your budget. Tiny project (7 stars on day one) but the kind of pick-and-shovel tool that pays for itself the first time you stop a runaway agent loop.

Not sponsored. We just feature tools builders would actually use.


BOOKMARKED TODAY

🇦🇹 Mistral acquires Emmi AI - Austria-based Physics AI startup, 30+ researchers building neural surrogates for CFD, particle flow, and aerospace simulation. Mistral folds the team into a new Linz office aimed at “the leading AI stack for Industrial Engineering.” Deal value not disclosed.

🖼️ OpenAI adopts Google’s SynthID watermark - ChatGPT, Codex, and the OpenAI API now stamp every generated image with DeepMind’s invisible SynthID plus C2PA metadata. A public verification tool is in preview at openai.com.

🧽 remove-ai-watermarks shipped the same week - MIT Python CLI by wiltodelta that strips SynthID, C2PA, EXIF, and XMP from generated images. 335 stars, v0.4.1 on May 17. Provenance arms race in real time.