Codex beats Claude Code, Stripe launched the AI bank & SpaceX targets a $7.5T valuation | GMP EP11

Codex beats Claude Code, Stripe launched the AI bank & SpaceX targets a $7.5T valuation | GMP EP11

Cursor — a third-party tool with no privileged access — extracts more performance from Anthropic's own Opus 4.7 than Anthropic's own Claude Code does.

May 1, 2026 1:24:33 Difficulty: Intermediate Played
Chapter

No indexed bits in this chapter.

Snapshots ()

Stats

Episode stats

Insight Overview

insights
chapters

Insight distribution

Sub-Categories

Speaker breakdown

Talk Time

Key Quotes ()

This episode

Cast

This episode

Claims & Sources

4 / 20 cited (20%)

Factual claims made this episode, and whether a source was named.

Cursor, a third-party tool with no privileged access, extracts more performance from Anthropic's Opus 4.7 model than Anthropic's own Claude Code harness does.

Ben Broch no source cited

GPT 5.5 performs better inside Cursor than inside OpenAI's own Codex environment.

Rik no source cited

Cursor was founded in 2022.

Ben Broch no source cited

Elon Musk placed a bid valuing Cursor at $60 billion.

Ben Broch no source cited

Grok 4.3 is priced at $1.25 per million input tokens and $2.50 per million output tokens, compared to Anthropic's $5/$25 and OpenAI's $5/$30.

Rik no source cited

Grok 4.3 claimed the number-one position on Bridgebench with the lowest hallucination rate of any available model.

Rik Bridgebench leaderboard (tweet cited on episode)

GPT 5.5 became the most expensive AI model per token, overtaking Opus 4.7.

Rik no source cited

Despite higher per-token pricing, GPT 5.5 uses fewer tokens to solve the same problem as Opus 4.7, making it cheaper in effective cost.

Rik no source cited

GitHub Copilot announced a shift from flat-rate subscriptions to usage-based billing effective June 1, 2026.

Rik no source cited

The latest DeepSeek model is 90 to 95% cheaper than GPT 5.5 and comparable Claude models.

Rik no source cited

Global data-centre compute capacity currently sits at approximately 0.1 terawatts.

Ben Broch Claude (queried live during the episode)

Elon Musk's SpaceX pay package requires hitting a $7.5 trillion valuation, deploying 100 terawatts of compute in space, and establishing a permanent Mars colony of at least 1 million people.

Rik no source cited

The Human Genome Project took 13 years to sequence the human genome and only achieved 92% completion.

Ben Broch no source cited

The US Department of War named seven AI partners: SpaceX, OpenAI, Google, NVIDIA, Reflection, Microsoft, and AWS — excluding Anthropic.

Ben Broch US Department of War press release, May 1 2026

Over 1.3 million Department of War personnel have used the AI platform, generating tens of millions of prompts and deploying hundreds of thousands of agents in only five months.

Ben Broch US Department of War press release, May 1 2026

Google invested $40 billion into Anthropic.

Ben Broch no source cited

OpenAI is reportedly building an AI-first phone in collaboration with Qualcomm, targeting a 2028 launch.

Rik no source cited

The 19-year-old creator of CalAI launched a stealth startup that reached $177,000 MRR within one month of launch.

Rik no source cited

Neuralink shares were available for investment at a $43 billion implied valuation on a crypto-based pre-IPO platform.

Rik no source cited

Tim Cook stepped down as Apple CEO, with John Turnis set to take over effective September 2026.

Ben Broch no source cited

TL;DR

Three AI builders break down the biggest week in AI yet: Cursor's third-party harness outperforms Anthropic's own Claude Code on Opus 4.7, Codex launches as an everything-app with HeyGen video and browser baked in, Grok 4.3 drops at one-quarter competitor pricing, and GitHub Copilot kills flat-rate subscriptions. Stripe Treasury turns Stripe into an AI-native bank, SpaceX approves Elon's $7.5T pay package with a Mars colony clause, and the Department of War picks seven AI partners — pointedly excluding Anthropic. Key takeaway: the abstraction layer (Cursor, Phantom) captures more value than the base-layer infrastructure (model labs, blockchains).

#AI coding harnesses #Cursor vs Claude Code #model pricing wars #abstraction layer value accrual #SpaceX Mars colony #Stripe Treasury MCP #Department of War AI contracts #Anthropic exclusion #Grok 4.3 benchmark #GitHub Copilot usage billing #vibe coding #whole genome sequencing #agentic workflows #AI phone OpenAI #compute constraints #Cursor #Claude Code #Codex #Grok 4.3 #Stripe Treasury #SpaceX #Elon Musk #Anthropic #OpenAI #harnesses #abstraction layer #GitHub Copilot #usage-based billing #Department of War #AI agents #Mythos #genome sequencing #MCP #AI valuations

2 minute taster

Look closer

Three builders discuss the biggest AI news of the week: Cursor outperforming Claude Code on Anthropic's own models, Codex becoming an everything-app with HeyGen integration, Grok 4.3 dropping at a fraction of competitors' prices, GitHub Copilot moving to usage-based billing, Stripe Treasury launching with MCP support, SpaceX's $7.5T pay package for Elon, and the Department of War naming seven AI partners while notably excluding Anthropic.

Chapter list
Harness
In AI development, the software layer (prompts, memory management, tooling) that wraps a base model to extract optimal performance for a specific use case — distinct from the model itself.
MCP (Model Context Protocol)
An open standard launched by Anthropic that lets AI models connect to external tools and data sources in a standardised way.
Agentic workflow
An AI-driven process where one or more models autonomously plan and execute multi-step tasks, often calling external tools, without continuous human input.
IDE (Integrated Development Environment)
Software that bundles a code editor, debugger, and build tools into one interface; in AI coding contexts, products like Cursor and Replit act as AI-enhanced IDEs.
Context window
The maximum amount of text (tokens) a language model can process in a single interaction; a larger window allows the model to 'remember' more of a conversation or codebase.
Vibe coding
Casual, natural-language-driven software development where a user describes what they want and an AI agent writes the code, lowering the barrier to entry.
MRR (Monthly Recurring Revenue)
A normalised measure of predictable monthly revenue from subscriptions or recurring contracts; a key SaaS health metric.
TAM (Total Addressable Market)
The total revenue opportunity available for a product or service if it achieved 100% market share.
Inference
The process of running a trained AI model to generate outputs; inference costs are the compute expenses incurred each time a model responds to a query.
Supervoting shares
A class of stock that carries more votes per share than ordinary shares, allowing a holder to retain outsized control of a company.
Whole genome sequencing
A lab process that reads the complete DNA sequence of an organism's genome, providing far more detail than standard genetic tests.
Terawatt
One trillion watts of power; used here as a measure of global data-centre compute capacity — the world currently sits at roughly 0.1 terawatts.
Hallucination (AI)
When an AI model confidently generates factually incorrect or fabricated information, presenting it as true.
Agnostic (provider-agnostic)
Describing a tool or workflow that is not tied to a single vendor's models or services, allowing easy switching between providers.
Abundance
As used by Elon Musk, a civilisational goal of having more than enough energy, compute, and resources for all humanity, driven by space expansion and AI.
OAuth
An open standard for access-delegation that lets users authorise third-party apps to access their accounts without sharing passwords; referenced here in the context of OpenClaw losing OAuth access.
Abstraction layer
A software tier that hides underlying complexity, letting users or developers interact with a simplified interface rather than raw infrastructure — analogous here to wallets in crypto sitting above blockchains.
CICD pipeline
Continuous Integration / Continuous Deployment: an automated software workflow that builds, tests, and ships code changes with minimal human intervention.
Phantom (crypto wallet)
A popular non-custodial wallet for the Solana blockchain, used here as an analogy for the high-value abstraction layer that captures fees from end-users.
Comptue-constrained
A state where demand for AI model usage outstrips available GPU/compute supply, forcing providers to impose rate limits or raise prices.