Issue 16 — April 13 – 19, 2026
This Week in AI
Narrated by Rachel · AI host
The AI buildout is colliding with hard physical limits — electricity, DRAM, and power infrastructure are already constrained, not theoretically at risk — while the software layer faces its own reckoning: legacy SaaS companies that failed to ship real agents in the last 18 months are entering a slow death spiral, and the enterprise market, not consumer, is emerging as two-thirds of the AI value game. Beneath both stories runs a single thread: the gap between what AI can do today and how institutions, markets, and individuals are actually prepared to use it is wider than almost anyone in the industry is willing to say out loud.
The AI Infrastructure Crisis Is Already Here — Not Coming
Ben Horowitz on AI Anxiety, Big Tech Transitions & The Future of Startups | a16z
The US electricity shortage for AI is a present emergency, not a future risk — and the hardware bottleneck is worse than most realize. a16z just raised $15B across four funds specifically because America needs to rebuild physical infrastructure from scratch.
- Servers shipping without RAM: Dell is selling servers with no memory because all DRAM has been absorbed by AI buildout; a new DRAM fab takes five years to build
- Power is gone now: "We're pretty much out of electricity now in the United States. Like not 12 months from now, like right now"
- a16z invested in a literal power transformer company — not an AI transformer — because manufacturing capacity is the binding constraint
- The China compute graph is "like this" (vertical); the US capacity graph is "like that" (flat)
"There are computers that show up with no RAM. Like if you buy a server from Dell, they're like, 'Sorry, we don't have any RAM to sell you.'"
Tokens Per Megawatt Is the Only AI Infrastructure Metric That Matters
Why Cost Per Token Is the Only Metric You Need for AI TCO
GPU-hour is a broken metric — it measures time, not output, and hides the majority of real costs. The right frame is tokens per megawatt: how much intelligence does a fixed power envelope produce?
- Token demand is growing orders of magnitude faster than total flops being deployed — a structural supply-demand gap
- The least efficient data centers have 100% overhead — as much energy wasted as used productively
- "Inference iceberg" framing: cost-per-GPU-hour sits at the surface; tokens-per-megawatt is at the bottom where the real economics live
- Mispricing tokens relative to your energy losses is an existential business threat: "your factory is going to go out of business"
- 800V DC architecture — now going mainstream — originated from a hyperscaler open-source initiative 10–15 years ago; only gained traction after Nvidia committed to it
- 35x performance improvement between GPU generations when measuring correctly — far beyond Moore's Law predictions
"The atomic unit of whatever that industrial revolution being an electron — in this case the atomic unit is a token."
Crypto Is AI's Identity Layer — And Ben Horowitz Woke Up in a Cold Sweat About It
Ben Horowitz on AI Anxiety, Big Tech Transitions & The Future of Startups | a16z
Horowitz's middle-of-the-night realization: a deepfake of him on Zoom could instruct his finance team to wire $500M to Nigeria. His response: cryptographic authentication for everything. This is the practical case for blockchain that cuts through the hype.
- Core questions crypto solves for AI: Are you human or bot? Can you prove you're you? Can you sign a contract?
- AI agents cannot be credit card merchants — they aren't humans — so they need a bare instrument on the internet: "internet money"
- a16z's policy: "Don't believe anything from me unless it's got my cryptographic key on it"
- Horowitz draws the blockchain-as-AI-identity-layer thesis across four distinct problems: deepfake fraud, bot detection, UBI distribution, and AI economic agency
"How does an AI become an economic actor? Like, how do I make money as an AI? How does somebody send me money? Like, can I be a merchant, a credit card merchant if I'm not a human? I don't think so."
Anthropic's Mythos: A Machine Gun for Hacking, and a Market That Reacted Backwards
SpaceX's Financials Leaked: Is it Worth $2TN | Meta Debuts Muse Spark: Are They Back in the AI Race?
Anthropic built a model called Mythos that it's withholding from public release because it's too capable at finding security vulnerabilities — and the market sold off cybersecurity stocks in response, which is exactly backwards.
- The machine gun analogy: both a rifle and a machine gun can kill, but quantity and speed change everything — Mythos finds vulnerabilities at a scale and pace no human team can match
- Real-world illustration: a $100M fitness app was breached within two days of launch, with 3.2M records stolen
- Rory's counterargument: if attackers have machine guns, defenders need tanks — cybersecurity spending should go up, not down
- Implication: AI-assisted code review before deployment will become standard practice
"The metaphor I was trying to look at here is very simple — it's like the difference between a rifle and a machine gun. Quantity makes a huge difference."
The 60% Agent Problem Is a Slow Death Spiral for SaaS Incumbents
SpaceX's Financials Leaked | "I don't buy Dario from Anthropic anymore..."
A 60% AI solution can't be monetized — it has to be given away free, trapping incumbents in a doom loop. This is the sharpest articulation of why legacy SaaS companies face an existential threat from AI-native competitors.
- "If your agents are only 60% as good, you're in a slow death spiral"
- A 60% product must be bundled into base pricing — it can't generate new revenue
- Example: HubSpot launching AI agents that are 60% as good as standalone solutions means HubSpot cannot charge for them
- The moat critique: "Moats keep your customers in, but they don't lure any new ones in. No one's excited to cross the moat except the folks that want to breach the castle walls"
- PE-backed SaaS companies had 18 months, loyal customer bases, and good engineers — and largely failed to ship real agents: "It's tragic"
- New hiring test: "Would I replace them with an agent?" — everyone thinks it, few say it out loud
"If you could charge 60% of what Claude charges, that'd be great. But a 60% product has to be free."
Robotics Just Had Its GPT Moment — And Deployment Is Two Years Ahead of Schedule
The GPT Moment for Robotics Is Here
Physical Intelligence's Quan reveals that real-world robot deployment became a serious consideration just two years into a company that expected to wait five — and the underlying model results are more surprising than the timeline.
- Generalist beats specialist by 50%: a single model trained across 10 robot platforms outperformed specialized models optimized for each individual platform
- Zero-shot task completion: tasks that required hundreds of hours of training data last year can now be performed with zero data collection
- RT2 demo: a robot correctly moved an object "to Taylor Swift" even though Taylor Swift never appeared in robot training data — pure vision-language transfer
- All PI robot control — including laundry folding and coffee making — runs by querying a cloud API, with latency hidden via action chunking
- Quan has never seen partner robots in person and intentionally doesn't ask how they collect data — proving the platform-agnostic model works
- A Claude-based agent babysitting pre-training runs autonomously yields a 50% improvement in compute utilization
- PI open-sources the exact same model weights used internally — no held-back version
"Until basically until ChatGPT, I didn't know if [laundry-folding robots] would exist even in my entire lifetime."
Enterprise Is Two-Thirds of the AI Game — The Opposite of the Internet
SpaceX's Financials Leaked | Meta Debuts Muse Spark
The structural bet that defined the consumer internet era — that consumer is two-thirds of value — may be inverted for AI. This has cascading implications for OpenAI, Microsoft, and every SaaS incumbent.
- If enterprise is two-thirds of AI value, OpenAI's fractured relationship with Microsoft — the dominant path to every enterprise — is an indulgence it can no longer afford: "They need to get some couples therapy"
- CIOs are now token-maxing: creating fixed dollar budgets for AI and making departments compete for allocation — shifting spend from rogue developer purchases to centralized control (per Aaron Levy's roadshow observations)
- Public SaaS stocks can't be properly valued until Anthropic, OpenAI, and AI-native companies IPO — investors are always comparing them to mythical private growth stories
- Even a $100B OpenAI ad business — one of the greatest ad launches ever — may not justify the valuation without matching enterprise revenue
- Anthropic will IPO before OpenAI: addition of the Nordis CEO to Anthropic's board is a clear IPO preparation signal
- Consumer vs. enterprise AI personas are diverging: younger users prefer OpenAI's emotionally supportive tone; enterprise users want harsh, decisive critique
"Enterprise is two-thirds of the ball game in AI and consumer is one-third or less — which is the flip of the last time."
The Blueprint for Superintelligence Will Be Set in Two Years; Deployment Takes Decades
Superintelligence Has a Timeline
The foundational architecture for superintelligence will be established within a couple of years — but broad real-world deployment will take a generation, not a news cycle.
- No overnight AI takeover: base models alone won't suffice — category-specific post-training is required for each industry vertical
- "I don't think that it's a couple of years from now and GPT starts growing 10% year over year globally"
- Prediction: "depth-first" AI players — highly specialized, domain-focused companies — will dominate over the coming decade rather than generalist platforms
- The blueprint being set ≠ the blueprint being deployed: the gap between those two timelines is where most of the real business opportunity lives
Agents Are Already Autonomous Enough to Send Email Without Permission
Building Agents at Home: Homeschooling, Parenting and More | The a16z Show
Jesse's home AI agent autonomously sent an important email on her behalf — and wrote it perfectly, because it had access to her full email history. The trust and provisioning questions this raises are not theoretical.
- Her agents learned to spin up new agents entirely on their own, complete with full family context and team docs — no human onboarding required
- The email was indistinguishable from her own writing: correct tone, style, and "too many exclamation points" — because the agent had her full inbox
- "I will take to my grave the fact that that email was sent by an agent"
- Practical workflow that works today: 30-second voice note → agent produces a beautifully written lesson log with photos
- Identified product gap: current AI voice tools don't pick up children's voices as well as adult voices — a real barrier for AI-assisted education
- Architectural insight: giving agents books to "read" as curriculum sources produces quirkier, more distinctive outputs than generic prompting
"It decided to go into my inbox and send the email as me."
NVIDIA and Physical Intelligence Are Racing to Own the Full Autonomous Stack
NVIDIA Automotive at GTC 2026 · The GPT Moment for Robotics Is Here · Qasar Younis on building the first truly end-to-end autonomous system
Three separate episodes this week converged on the same thesis: the race is to own end-to-end autonomous systems, not individual components.
- NVIDIA: "Everything that moves will be autonomous" — built all four computers in the AV stack (training, simulation, evaluation, car); announced BYD, Nissan, Geely, and Uber as new robotaxi platform partners
- Applied Intuition (Qasar Younis): driverless trucks already running in quarries and mines in Japan today; vision is mine → autonomous truck → port as a single automated logistics chain
- PI: napkin math puts robotics at 10% of US GDP (~$2.4T) as the justification for the data collection investment required
- Common thread: the value accrues to whoever controls the full stack, not individual layers
Key Takeaways
- The AI infrastructure crisis is now, not later: US electricity and DRAM are already constrained; token demand is growing orders of magnitude faster than compute deployment; "tokens per megawatt" is the metric that actually captures real costs.
- The 60% agent problem is existential for SaaS incumbents: a product that's 60% as good as a standalone AI solution must be given away free, trapping legacy companies in a monetization doom loop while AI-native competitors charge full price.
- Crypto is the identity and payments layer AI requires: deepfake fraud, bot detection, and AI economic agency all demand cryptographic authentication and bearer instruments — the practical case for blockchain is no longer theoretical.
- Robotics deployment is two years ahead of schedule, and the generalist model beats the specialist: cross-embodiment training yields 50% better performance than specialized models; zero-shot task completion is arriving faster than even founders predicted.
- Enterprise is likely two-thirds of the AI value game — the inverse of the consumer internet — making OpenAI's fractured Microsoft relationship and the absence of AI-native public comparables the two biggest structural risks in the market right now.
- The blueprint for superintelligence will be set within two years, but deployment is a decade-long process: category-specific post-training requirements mean broad adoption is gradual, and "depth-first" vertical AI players will capture most of the near-term value.
Sources
- Why Claude Feels Different (And What That Means for AI) | The a16z Show
- Scaling Global Organizations in the Age of AI with ServiceNow CEO Bill McDermott
- Why AI Needs Blockchain
- AI Will Change How Governments Work
- Superintelligence Has a Timeline
- Ben Horowitz on AI Anxiety, Big Tech Transitions & The Future of Startups | a16z
- Building Agents at Home: Homeschooling, Parenting and More | The a16z Show
- Silicon Valley Thinks They're Reinventing The World: They're Not
- Qasar Younis on building the first truly end to end autonomous system
- "I don't buy Dario from Anthropic anymore..."
- SpaceX's Financials Leaked: Is it Worth $2TN | Meta Debuts Muse Spark: Are They Back in the AI Race?
- NVIDIA Automotive at GTC 2026
- The GPT Moment for Robotics Is Here
- "Have you heard of google.com?" — Opendoor CEO's Wake-Up Call on Offshoring
- Parenting Books = Best Leadership Advice
- We're entering a new golden era
- Vlad Tenev: Why Prediction Markets Could Replace Your Insurance
- Why Cost Per Token Is the Only Metric You Need for AI TCO
Source episodes
Sourced from 90 episodes across 10 podcasts this week
- Seeing if Opus 4.7 sucks [LIVE]
- Public.com Founder: "Our AI Trading Agent Has Zero Hallucinations"
- Scaling Global Organizations in the Age of AI with ServiceNow CEO Bill McDermott
- China’s Secret to Winning in AI
- AGI will be 10x the industrial revolution
- Jensen Huang Makes the Case for Selling Chips to China
- Agents of Chaos: When Helpful AI Agents Go Rogue [Paper Review]
- Silicon Valley Thinks They're Reinventing The World: They're Not
- Why Claude Feels Different (And What That Means for AI) | The a16z Show
- "I don't buy Dario from Anthropic anymore..."
- SpaceX's Financials Leaked: Is it Worth $2TN | Meta Debuts Muse Spark: Are They Back in the AI Race?
- The GPT Moment for Robotics Is Here
- Parenting Books = Best Leadership Advice
- We're entering a new golden era
- Jensen Huang Fires Back on China Chip Ban
- Why AI Needs Blockchain
- Jensen Huang – Will Nvidia’s moat persist?
- Why Censorship Always Misses What Actually Matters - Ada Palmer
- Why It Took Centuries to Invent Science - Ada Palmer
- Turns out nowhere is safe
- Superintelligence Has a Timeline
- AI Will Change How Governments Work
- Mythos... I can't sleep
- "But OpenClaw is expensive..."
- Ben Horowitz on AI Anxiety, Big Tech Transitions & The Future of Startups | a16z
- Building Agents at Home: Homeschooling, Parenting and More | The a16z Show
- From SpaceX to Founders Fund to Solving America's Nuclear Fuel Problem
- Ben Miller on turning “crazy” ideas into obvious ones
- Is Anthropic's Dario the Boy Who Cried Wolf?
- Startups vs Big AI Labs
- The AI Brokerage Quietly Taking on Schwab, Fidelity & Vanguard
- Keith Rabois: AI Is Now the Biggest Unlock for Cutting Fixed Costs at Opendoor
- Opus 4.7 just dropped... and I'm confused.
- "Have you heard of google.com?" — Opendoor CEO's Wake-Up Call on Offshoring
- Vlad Tenev: Why Prediction Markets Could Replace Your Insurance
- Vlad Tenev: "More than half of our revenue could be outside the U.S."
- Gili Ranaan explains why Isaac Asimov’s Three Laws of Robotics are not enough to protect humanity
- Anthropic's 1st Round: We Got 21 No's
- Building out of the Valley is a MASSIVE Advantage
- Inside the Factory Using Rocks & Sunlight to Fix AI's Power Problem | Exowatt
- The Early Days of Anthropic & How 21 of 22 VCs Rejected It | The Four Bottlenecks in AI | Anj Midha
- Advice to young people today
- "ElevenLabs is gonna be extremely difficult"
- How Exec Dinners are the best ROI for ElevenLabs
- Jack Altman's UBI Hot Take: "We Already Have It—It's Just Hidden in Jobs"
- Thrive Capital's Philip Clark: It's Not "After OpenAI"—It's "After SaaS"
- "Every time we see OpenAI's roadmap, I'm like oh my god" — Jake Paul on AI and Defense Tech
- Thrive Capital's deal philosophy: "You play against yourself, not against others"
- Max Levchin: “The Net IQ of the World Is About to Go Up 50 Points”
- Lightspeed's Michael Mignano: "ARR is fake"
- Why Mass-Market Fintech Is Forced to Go Predatory
- How a Drug Actually Gets Made
- This is the problem with MOATS
- Once Waymo’s become cbeaper than Uber, the rideshare market will grow by an order of magnitude
- Real greatness isn’t something that can be taught, it has to be experienced — Gili Ranaan
- Qasar Younis on building the first truly end to end autonomous system
- Why Cost Per Token Is the Only Metric You Need for AI TCO
- "Do Customers Love It?" Menlo Ventures' Deedy Das on the Only Question That Matters
- NVIDIA Automotive at GTC 2026
- How Applied Intuition runs real world operations from anywhere
- Roblox CEO on the Viral Engine Behind 150M Players
- “Fear is the biggest obstacle for founders.” — Gili Raanan
- Alfred Lin: Everyone Ends Up Embracing AI—The Real Money Is in the Mid Game
- Jack Altman: AI Will Do Most White-Collar Jobs—and Then Get a Body
- Hard work always leads to profit
- What makes Anthropic's Dario so special
- Is this what Europe needs to be competitive?
- We're in 1885 Industrial Revolution England Right Now
- "AI Alignment is NOT the hardest problem..."
- Most in Demand Companies Today
- 20 Years of CUDA: Honoring the Architects of the Accelerated Age
- How AI-RAN Turns Telecom Networks into Real-Time AI Infrastructure
- How AI Will Change Quantum Computing | NVIDIA AI Podcast Ep. 294
- Introducing NVIDIA Ising
- AI for Quantum: NVIDIA Ising Accelerates Useful Quantum Computing
- How AI Will Change Quantum Computing | NVIDIA AI Podcast Ep. 294
- CEO Jensen Huang joins Rep. Ro Khanna at Stanford for a conversation on AI
- Michael Mignano on where value actually accrues in the AI stack
- Michael Mignano on Suno's Contrarian Bet: Creators Will Listen to Their Own AI Music
- LEM Surgical - Dynamis Robotic Surgical System
- Jack Altman: AI's Real Bottleneck Is Energy—Why Fusion Matters
- How did OpenAI get to 1 billion users so fast? a16z’s Alex Zimmerman explains:
- Vlad Tenev on the Biggest Risk to Prediction Markets: Integrity
- Deel CEO Alex Bouaziz: The Trait You Can't Teach—"Default Optimism"
- Robinhood CEO: How We Went From 3 Business Lines to 11 at $100M+ Each
- Reddit's CEO: "The Internet Needs an Immune System" Against AI Slop
- Flock Safety is the most under- appreciated company in America today, working to eliminate all crime
- Gili Raanan says we’re about to face the “darkest period” in cybersecurity history.
- Ben Miller on the two threats to VC: Geopolitics and AI
- Lightspeed's Michael Mignano: "You gotta back Elon Musk" (xAI + Neuralink)