#062 OpenAI lands on AWS next to Claude, a $2,000 milestone post checks out at $43

On June 1, OpenAI’s GPT-5.5, GPT-5.4, and Codex went generally available on Amazon Bedrock, dropping straight into the model picker next to Claude. For any team already on AWS, that means A/B-testing GPT against Claude on one bill, with no second vendor account.

Six weeks ago this was impossible. A Microsoft exclusivity deal kept OpenAI off every cloud but Azure until late April, which is what makes Bedrock’s newest tenant moving in on Anthropic’s home turf the quiet plot twist.

In today’s indie hacker news:

🤖 OpenAI’s frontier models go live on AWS, one bill away from Claude
💸 A viral “$2,000” milestone post checks out at $43 of real revenue
📈 He fired every client under $3k a month and hit $11k MRR
🛡️ Claude Code wrote a bank-grade agent governor in 12 days
💻 Microsoft’s NVIDIA laptop runs 120B models with no cloud

TOP STORIES

MOVING IN NEXT DOOR TO CLAUDE

🤖 OpenAI’s frontier models and Codex are now GA on AWS Bedrock, billed next to Claude

OpenAI's frontier models and Codex are now GA on AWS Bedrock, billed next to Claude

The story: OpenAI announced that GPT-5.5, GPT-5.4, and Codex hit general availability on Amazon Bedrock, reachable through Bedrock’s Responses API. That fills the last hole in Bedrock’s lineup: Claude has lived there since 2023, Llama runs there, and OpenAI was the lone holdout. Why it was missing is the interesting part. The restructuring that ended Microsoft’s exclusivity on April 27 also defused a near-lawsuit over OpenAI’s $50B Amazon deal, which is what cleared the runway onto a rival’s cloud. Hacker News read the move as worrying for Anthropic, which had won Bedrock by default.

The details:

The model IDs are literal: openai.gpt-5.5 and openai.gpt-5.4. Codex ships as a separate agentic product via App, CLI, VS Code, JetBrains, and Xcode.
Availability is thin at launch. GPT-5.5 is in US East (Ohio) only; GPT-5.4 adds US West (Oregon). Commercial and GovCloud are both supported.
Azure keeps a four-month first-mover window on new releases, written into the deal, so Bedrock structurally lags every frontier launch by a quarter.
AWS does not publish per-token dollar rates on the blog. HN users report inference running about 10% over OpenAI direct, and roughly 30% higher on GovCloud.
Next up, per OpenAI: “Daybreak,” its cyber models plus Codex Security for code review, threat modeling, and patch validation.

Why builders care: The unlock is that GPT now inherits the AWS perimeter you already built. Same IAM roles, same VPC endpoints, same cost center, so swapping a model is a config change, not a fresh vendor security review or an Azure SSO setup. For health, fintech, and gov teams it’s bigger: inference stays inside your chosen Bedrock region, the data-residency guarantee that used to force them onto Azure-hosted GPT. The price of admission is the markup and that permanent quarter-lag behind Azure.

THE MILESTONE THAT WASN’T

💸 A viral “$2,000 in a saturated market” post verified at $43 of real revenue

A viral $2,000 in a saturated market post verified at $43 of real revenue

The story: A founder posted to r/microsaas that he’d hit “almost $2,000” and 80+ paying customers with SubChecks, a subscription tracker, after two months of mass-DMing strangers. It climbed the hot list. Then the receipts came in. TrustMRR, which reads the product’s LemonSqueezy account directly, shows $43 all-time and $20 in the last 30 days as of today. The product is real, the positioning is genuinely sharp, and the headline number is off by roughly 98%.

The details:

SubChecks charges $20 once for Pro, no recurring fee. At that price, $43 all-time is about two sales, not eighty customers.
The counter-positioning is the actual lesson: a subscription tracker that doesn’t charge a subscription, with no bank-linking, aimed straight at Rocket Money’s weak spot.
There’s no course, no Gumroad, no email funnel attached. This reads as vanity-metric milestone-posting, not a paid grift.
The category is as crowded as he said. Minus, SubTracky, and Subdupes all run the same “no subscription fee” wedge.

Why builders care: Counter-positioning beats feature parity, and a one-time-payment privacy tracker is a defensible angle against an incumbent. But the number is the story builders should sit with. A claimed $2,000 was a verified $43, and that gap is the genre, not the exception. When you read the next milestone screenshot, ask where the Stripe link is.

FIRE THE CHEAP ONES

📈 $11k MRR at month 14, after he stopped taking any client under $3k a month

$11k MRR at month 14 after he stopped taking any client under 3k a month

The story: A founder running a virtual-CFO service for e-commerce brands wrote up the move that broke an eleven-month plateau. He hit $2k in monthly retainers by month 3, then sat there, stuck. The unlock was raising his floor: stop taking any client paying under $3k a month. He lost four clients the day he enforced it. Within eight weeks he’d replaced them with six better ones, and fourteen months in he’s at $11k. It’s a retainer business, not a SaaS, and the revenue is self-reported, so read it as a playbook, not a guarantee.

The details:

Productizing onboarding cut setup from three weeks to four days, which made the higher tier deliverable without drowning in hand-holding.
Automating the non-CFO admin, billing, client comms, reporting, freed roughly two days a week.
That recovered time went straight into content, now his main acquisition channel.
One commenter flagged the math: six clients at $3k is $18k, not $11k. He never reconciled it, so the floor is clearly a minimum, not what every client pays.

Why builders care: Raising your price floor isn’t just margin, it self-selects for clients who want results over reassurance, which is what makes productized delivery possible at all. The higher tier already existed. The low floor was just signaling buyers down to it. The reproducible heuristic: set a minimum, enforce it on day one, and redeploy the freed admin time into something that compounds.

CLAUDE TALKING TO ITSELF

🛡️ MeshFlow open-sources agent governance with real audit code, but it’s a 12-day-old solo repo

MeshFlow open-sources agent governance with real audit code but it is a 12-day-old solo repo

The story: MeshFlow shipped on GitHub as an Apache-2.0 runtime for governing multi-agent systems in production, and the hook is a real stat: 79% of enterprises adopted agents, only 11% run them in prod. The audit code is no bluff. There’s a working SHA-256 ledger that chains every step, plus per-framework HIPAA, SOX, and GDPR rule classes and a hard cost cap. The catch: it’s twelve days old, has one star, one contributor, and was built largely by Claude Code. r/MachineLearning’s top comment called it “classic vibeslop agent harness project. Bonus points for Claude talking to itself in the comments.”

The details:

The headline 70-85% token-cost cut has no controlled benchmark behind it. The author admits the savings weren’t validated on autonomous agents, only batch tests.
That hook stat is real, but it’s PwC’s, from April 2025. MeshFlow’s own site misattributes it to McKinsey.
It’s genuinely shipped: 422 Python files, a meshflow package live on PyPI, and from_crewai() / from_autogen() adapters that wrap existing graphs.
The “we built agent systems for banks and clinical teams” claim is unverifiable against a repo that didn’t exist two weeks ago.

Why builders care: Governance-as-infrastructure is the right target if you’re in the 89% who can’t get agents to production, and the audit chain and cost cap are real code you can read, not slideware. But this is a scaffold to evaluate, not a battle-tested runtime. Verify the cost claim in your own workload before you quote it to anyone, and weigh the provenance before you put a Claude-Code-built tool in charge of HIPAA enforcement.

120B PARAMETERS, NO CLOUD

💻 Microsoft’s Surface Laptop Ultra runs 120B models locally on NVIDIA’s RTX Spark

Microsoft's Surface Laptop Ultra runs 120B models locally on NVIDIA's RTX Spark

The story: Microsoft and NVIDIA announced the Surface Laptop Ultra at Computex, built on the RTX Spark superchip with up to 128GB of unified memory. The pitch is running 120B-parameter models entirely on-device. This is an official launch, not a rumor: NVIDIA, Tom’s Hardware, and Windows Central all corroborate it. The “MacBook Pro rival” framing is editorial spin from WindowsLatest, not Microsoft’s language, and the honest comparison is more mixed than the headline.

The details:

The chip packs 6,144 Blackwell CUDA cores, the same count as a desktop RTX 5070, but with 128GB of accessible memory instead of the 5070’s 12GB cap.
The widely-reported “600GB/s bandwidth” is wrong. r/LocalLLaMA caught it: that figure is the NVLink interconnect speed, while real system memory runs ~273 GB/s.
An M4 Max still wins on memory bandwidth (~400 vs ~273 GB/s) and runs the same quantized models via MLX, likely for $2-3k less at equal RAM.
Price, battery life, and exact ship date are all undisclosed. It lands in fall 2026, and there are zero independent benchmarks yet.

Why builders care: This is the first laptop that runs 70B+ quantized models in fast unified memory with a full native CUDA stack, so PyTorch, TensorRT-LLM, and vLLM all work without the MLX compatibility tax. If your inference code is CUDA-dependent and you need more than 24GB of VRAM, it’s the most serious portable option yet. If it isn’t, a Mac runs the same models today, quieter and cheaper, and you can wait for the reviews.

🧠 “There are literally only two models worth running” - r/LocalLLaMA’s top post made a satirical-but-upvoted case that the local field has narrowed to Qwen 3.6 in two sizes, and that a rough quant of a big model beats your precious tiny ones. Same week, JetBrains open-sourced Mellum2, a 12B Apache-2.0 coding model built for routing and private deployment, not chat. The consolidation read: the “what model for my RTX 3060” era is ending, and purpose-built beats general.

📱 Tony Dinh is running a startup from his phone - @tdinh_me is narrating fully agentic dev on his new email client: Claude Code agents queue tasks, QA, and auto-push OTA changelog updates while he reviews from his phone. In a second post, a bot he calls “Otto” set up his entire RevenueCat plus App Store Connect billing flow end to end. The founder role is sliding from typing to delegating, and the IAP-setup demo is the most concrete proof of it this week.

DRAMA

THE $1M ARR THAT ISN’T

🔥 levelsio torched the “$1M ARR in a month” crowd as ad-pumped $0-profit funnels

@levelsio did two things in one breath: amplified a real bootstrapped builder at $16k MRR after four months, then called out the “ecom bros” claiming “$1M ARR in 1 month.” His math is the knife: they paid $83,000 in ads to make $83,000 in sales, so it’s $0 profit, and the real product is a $3,000 course teaching the same stunt. The $16k builder, Matthew Miller of BridgeMind, acknowledged the shoutout directly.

Why builders care: This is the same disease as today’s $43 milestone post, just at a bigger scale. The honest version is all over Reddit if you look: CheckVibe posted a real Stripe link for $3.4k in six weeks, no funding. Slow and verified beats viral and inflated every time.

FIRST DOLLAR

THE ANTI-UNICORN

💌 $400 MRR, three customers, and they email him like a best friend

A solo builder posted to r/microsaas about $400 MRR from one tiny product with three paying customers, built in evenings on “vibe coding and a lot of caffeine.” He pointedly won’t name it, calling it “a tool that solves one specific problem for one specific type of person.” The detail that lands: he spent six months getting the landing-page copy right and says the product was the easy part. Three power users who write you like a friend beat 3,000 indifferent signups, and most builders under-invest in exactly the words he obsessed over.

STACK OF THE DAY

🧠 Memex

A local-first AI journal that stores everything as plain Markdown plus a local SQLite database, GPL-3.0, on iOS and Android. On-device agents auto-generate structured cards (tasks, people, places, metrics) from raw entries, and you bring your own model key across 12+ providers, from OpenAI down to fully-offline Ollama. Nothing leaves the device, and because the files are just text you can grep them, version them, and walk away anytime. The custom-agent system runs JavaScript with event triggers on your phone, so you can wire life-tracking automations with no backend. Note: ih1 tagged it Show HN, but no matching thread surfaced, so call it a strong open-source find, not a front-page placement.

Not sponsored. We just feature tools builders would actually use.

BOOKMARKED TODAY

💰 “Can the stock market swallow Anthropic, SpaceX and OpenAI?” - The Economist on whether public markets can absorb these valuations when they IPO, the clearest read yet on the AI-bubble question. It hit ~144 points and 315 comments on HN, which tells you the room is nervous.

🔞 “Age verification for social media, the beginning of the end for a free internet?” - Mullvad argues social-media age-gating is an inflection point for the open web. If you’re building anything consumer-facing, age verification is coming for you too, and this is the case against it.

⚖️ “Florida sues OpenAI and Sam Altman” - Florida filed suit over AI safety and child-safety risks. One to file under AI-liability watch, especially the week OpenAI’s models land on a second hyperscaler.

Curated by AI, built by a human.

OpenAI lands on AWS next to Claude, a $2,000 milestone post checks out at $43

TOP STORIES

MOVING IN NEXT DOOR TO CLAUDE

THE MILESTONE THAT WASN’T

FIRE THE CHEAP ONES

CLAUDE TALKING TO ITSELF

120B PARAMETERS, NO CLOUD

TRENDING TODAY

DRAMA

THE $1M ARR THAT ISN’T

FIRST DOLLAR

THE ANTI-UNICORN

STACK OF THE DAY

🧠 Memex

BOOKMARKED TODAY

Get the daily indie hacker digest

You're in.