WEBVTT
NOTE The Rundown — nextbig.dev daily audio edition, 2026-07-01

1
00:00:04.500 --> 00:00:11.100
<v Oday>Anthropic just cut the price of a capable agent this week, and it did it to its own flagship.

2
00:00:11.100 --> 00:00:23.380
<v Shannon>It's Wednesday, July first. Here's the rundown: what Sonnet 5 actually changes, the silicon cutting costs from the other side, and where the money goes once the model stops being scarce.

3
00:00:23.560 --> 00:00:39.720
<v Oday>Sonnet 5 shipped, and the tell is what Anthropic took out. It drops the temperature, top-p, and top-k sampling knobs. How the model decodes is fixed now, not a dial you turn.

4
00:00:39.720 --> 00:00:57.720
<v Shannon>Around that quiet change is a loud one. Sonnet 5 runs the agentic coding and tool use that needed the flagship six months ago, at mid-tier pricing, with a million-token window. Every team running agents just watched that work get cheaper without touching their code.

5
00:00:57.720 --> 00:01:13.320
<v Oday>And it's the whole week landing on the invoice. Nine days ago an open Chinese model caught Claude at a tenth of the cost. Yesterday Anthropic pulled its flagship back from an export freeze.

6
00:01:13.320 --> 00:01:31.080
<v Shannon>Today it undercuts that same flagship with a model most teams will find good enough, and Google cut too, shipped its cheapest image model the same morning. Capable AI is getting cheaper from every direction, and the labs are doing the cutting to themselves before an open model does it for them.

7
00:01:31.080 --> 00:01:43.080
<v Oday>There's a hardware half to this. Etched, a startup whose chip runs transformers and nothing else, raised at five billion dollars on a billion in booked inference orders.

8
00:01:43.080 --> 00:01:57.880
<v Shannon>Which is a gamble that the architecture holds still long enough to bake into silicon. Grant the risk. Then look at the billion already booked. Enough serious buyers think inference has stopped changing shape that they're prepaying for chips that assume it.

9
00:02:00.140 --> 00:02:05.540
<v Oday>So what does Anthropic do about all this? Look at what it shipped next to Sonnet 5.

10
00:02:05.540 --> 00:02:18.780
<v Shannon>Claude Science. A standalone product for computational biology and drug discovery, sold as a tool for a job, not raw model access. That's a different business than renting the smartest tokens by the million.

11
00:02:18.780 --> 00:02:29.420
<v Oday>As the price of a token slides toward the open-model floor, a finished result a lab will pay for is the ground that keeps your margin off that floor.

12
00:02:29.420 --> 00:02:45.580
<v Shannon>For anyone building: re-price your agent traffic against Sonnet 5 before you assume you need the flagship. On most coding and tool-use work, the cheaper model clears the bar now, and the flagship only earns its premium on the genuinely hard reasoning.

13
00:02:46.960 --> 00:03:02.480
<v Oday>To the tape. We're watching Nvidia, Etched, Alphabet, and Anthropic. The inference-ASIC wave got its first hard number this week, and it's the first real pressure on the part of Nvidia's business that isn't training.

14
00:03:02.480 --> 00:03:23.040
<v Shannon>Highest conviction is a long on Anthropic, of all things. Sonnet 5, Claude Science, and Fable 5's return read as one strategy: cut the token price, move upmarket into products. The falsifier is if that price cut just compresses their margin and the outcome products don't pick up the slack.

15
00:03:23.040 --> 00:03:28.640
<v Oday>The tape is the desk's scorecard, not advice.

16
00:03:31.880 --> 00:03:33.880
<v Oday>Quick break — two from the desk.

17
00:03:33.880 --> 00:03:48.520
<v Shannon>One we know well: vote dot direct. If you're on an H O A or a board, it runs your elections digitally — secure, verifiable, no paper, no clipboard in the lobby. Point your council to vote dot direct.

18
00:03:48.520 --> 00:04:01.640
<v Oday>And if this is your ten minutes of A I for the day, get the written edition too. The full wire, free, every morning — leave your email at nextbig dot dev.

19
00:04:06.070 --> 00:04:25.030
<v Oday>Our call: within four months, at least one widely used agent framework or coding tool moves its default model down from a flagship to a mid-tier model in the Sonnet 5 class, because the cheaper one is now good enough for most of the work.

20
00:04:25.030 --> 00:04:37.670
<v Shannon>What proves us wrong: if by November first no major agent tool has changed its default, and they're all still pointing new users at the flagship. These defaults are public, so it settles cleanly.

21
00:04:37.670 --> 00:04:49.510
<v Oday>A team spending four figures a month on flagship agent calls can move most of that traffic to Sonnet 5 this week and read the savings on next month's bill. That's the rundown.