WEBVTT
NOTE The Rundown — nextbig.dev daily audio edition, 2026-06-18

1
00:00:04.500 --> 00:00:13.460
<v Oday>GLM-5.2 is the top open-weights model in the world this morning, and it bought that crown one expensive token at a time.

2
00:00:13.460 --> 00:00:18.700
<v Shannon>It's Thursday, June 18, 2026. Here's the rundown.

3
00:00:18.700 --> 00:00:31.460
<v Shannon>A new open-weights leader with an invoice problem, the DOJ deciding xAI's turbines are a national priority, and a code-graph MCP claiming ten-x fewer tokens. Then The Call.

4
00:00:31.640 --> 00:00:47.640
<v Oday>Z.ai's GLM-5.2 scored fifty-one on the Artificial Analysis Intelligence Index, eleven points above GLM-5.1 at the same size. Seven hundred fifty-three billion total parameters, forty billion active, shipped under MIT.

5
00:00:47.640 --> 00:00:57.000
<v Oday>It clears MiniMax-M3 and DeepSeek V4 Pro, both at forty-four, and sits second on the WebDev arena behind only Claude Fable 5.

6
00:00:57.000 --> 00:01:08.040
<v Shannon>And before anyone screenshots the leaderboard, look at how it got there. Same parameter class as 5.1. They didn't build a bigger model. They told it to think longer.

7
00:01:08.040 --> 00:01:19.640
<v Oday>The numbers back you up. GLM-5.2 burns forty-three thousand output tokens per task, thirty-seven thousand of it reasoning. GLM-5.1 spent twenty-six thousand.

8
00:01:19.640 --> 00:01:28.760
<v Shannon>That's the whole eleven-point jump. Test-time compute is the cheap way to climb a benchmark and the expensive way to run a product.

9
00:01:28.760 --> 00:01:40.840
<v Oday>Run the bill. Pricing holds at GLM-5.1 levels, four dollars forty per million output tokens. At forty-three thousand tokens a task, that's about nineteen cents per task in output alone.

10
00:01:40.840 --> 00:01:50.200
<v Shannon>A model scoring forty-four at twenty-four thousand tokens answers for under half that. So the rank and the invoice point in opposite directions.

11
00:01:50.200 --> 00:01:52.440
<v Oday>So when do you reach for it?

12
00:01:52.440 --> 00:02:01.560
<v Shannon>When hard reasoning is the constraint and volume is modest. Pick it for the answer you can't get cheaper. Don't pick it for a loop running millions of times a day.

13
00:02:01.560 --> 00:02:10.600
<v Shannon>At that scale you cap the reasoning budget or you stay on a leaner open model, because the marginal correct answer is not worth eighty percent more tokens.

14
00:02:10.600 --> 00:02:21.080
<v Oday>Fifty-one is fourth overall. Claude Fable 5 is at sixty, Opus 4.8 at fifty-six, GPT-5.5 at high reasoning at fifty-five.

15
00:02:21.080 --> 00:02:32.840
<v Shannon>And that's the honest read. The open top is now one long reasoning trace from the closed frontier, not a generation behind. The gap is compute you pay for on every call.

16
00:02:32.870 --> 00:02:35.190
<v Oday>So the thing to watch isn't the score.

17
00:02:35.190 --> 00:02:45.350
<v Shannon>It's token efficiency. A fifty-one that costs forty-three thousand tokens is a worse deal for most serving stacks than a forty-seven at twenty-five thousand.

18
00:02:45.350 --> 00:02:54.950
<v Shannon>Whoever ships the first open model that holds this intelligence while halving the reasoning spend wins production. The screenshot is free. The serving is not.

19
00:02:55.260 --> 00:03:02.860
<v Oday>The Justice Department is now framing xAI's unpermitted gas turbines as a matter of national, economic, and energy security.

20
00:03:02.860 --> 00:03:11.180
<v Shannon>Which is the legal cover for keeping them running. The federal government would rather fight the air-quality case than slow the compute.

21
00:03:11.180 --> 00:03:20.460
<v Oday>The subtext for every operator is the same. Training and inference now lean on behind-the-meter generation faster than permitting can move.

22
00:03:20.460 --> 00:03:35.500
<v Shannon>Power is the binding constraint, not silicon, and the rules are bending around that fact in real time. If you're planning capacity for next year, plan for generation you control, because the grid interconnect queue won't save you.

23
00:03:35.500 --> 00:03:42.860
<v Oday>Separately, Washington declined to blacklist DeepSeek even while flagging more than a hundred firms as security risks.

24
00:03:42.860 --> 00:03:56.300
<v Shannon>So DeepSeek weights stay legally usable in the US for now. But the model is one policy memo from off-limits. Don't architect a stack around a single Chinese provider you can't swap out in a week.

25
00:03:56.480 --> 00:04:12.160
<v Oday>There's a new MCP server called codebase-memory-mcp. It indexes a repo into a persistent SQLite graph of functions, calls, and routes, so an agent traverses the graph instead of reading files.

26
00:04:12.160 --> 00:04:25.440
<v Shannon>The landing page says ninety-nine percent fewer tokens, a hundred-twenty-x best case. The preprint is more honest. Ten-x, with eighty-three percent answer quality across thirty-one repos.

27
00:04:25.440 --> 00:04:33.040
<v Oday>It full-indexes the Linux kernel, twenty-eight million lines, in three minutes and answers in under a millisecond.

28
00:04:33.040 --> 00:04:48.800
<v Shannon>That part's real, and it matters. Move structural lookups off file reads and your context window stops getting eaten by grep. The catch is type resolution covers about a dozen languages well, the rest fall back to text.

29
00:04:48.800 --> 00:04:55.520
<v Oday>Also making the rounds again, a gist arguing you should stop using JWTs for sessions.

30
00:04:55.520 --> 00:05:09.600
<v Shannon>Old argument, fresh casualties. Tokens you can't revoke, an algorithm field that's a footgun. If you reached for JWTs because a tutorial said so, default to server-side sessions instead.

31
00:05:09.600 --> 00:05:19.840
<v Oday>And the HTTP QUERY method is now RFC ten-thousand-eight. A safe, idempotent way to send a body with query semantics.

32
00:05:19.840 --> 00:05:30.080
<v Shannon>It fixes the GET-with-body mess. Framework support trickles in over the next year. Useful the moment your search API outgrows the URL.

33
00:05:30.900 --> 00:05:41.060
<v Oday>A WordPress VIP survey of two thousand people found sixty percent of US consumers are put off by brands touting AI, and eighty-six percent still want original sources.

34
00:05:41.060 --> 00:05:50.500
<v Shannon>Here's the line buried in the same report. One product lifted sales of its top tiers just by removing the word AI and keeping the feature.

35
00:05:50.500 --> 00:05:53.620
<v Oday>So the advice writes itself.

36
00:05:53.620 --> 00:06:05.300
<v Shannon>Ship the capability, drop the label. And pair it with the other number today, only sixteen percent of Americans expect AI to help society. That's one trust signal, not two.

37
00:06:05.300 --> 00:06:12.980
<v Oday>Charity Majors made the engineering case in the same beat. Agents amplify whatever rigor a team already has.

38
00:06:12.980 --> 00:06:27.540
<v Shannon>Which means sloppy testing and weak observability get worse once code generation speeds up. Invest in review, CI, and rollback before you scale agent throughput. Velocity without guardrails is just faster incidents.

39
00:06:27.720 --> 00:06:38.080
<v Oday>Epic open-sourced Lore under MIT, a content-addressable version control system in Rust built for code plus multi-gigabyte binary assets.

40
00:06:38.080 --> 00:06:53.960
<v Shannon>This is the tech that ran inside Unreal. Merkle-chained revisions, content dedup, chunked uploads so editing a few kilobytes of a huge file re-uploads only that. It's aimed straight at Perforce's per-seat cost.

41
00:06:53.960 --> 00:06:59.800
<v Oday>Caveat, it's pre-1.0, and the desktop client most people touch ships as a closed binary.

42
00:06:59.800 --> 00:07:08.920
<v Shannon>So open-source the engine, keep the client proprietary. Worth tracking for small studios, not worth migrating onto this week.

43
00:07:08.920 --> 00:07:15.320
<v Oday>And Wolfram Language hit version fifteen with an AI assistant folded into the language itself.

44
00:07:15.320 --> 00:07:24.520
<v Shannon>The pairing is the interesting part. A symbolic engine that can check the model's arithmetic. That's exactly the gap most agent stacks paper over.

45
00:07:26.380 --> 00:07:34.220
<v Oday>Adam, a YC W25 company, open-sourced CADAM, an AI system for parametric CAD generation.

46
00:07:34.220 --> 00:07:43.980
<v Shannon>Hard domain. Wrong geometry is unforgiving and the training data is scarce. If they make it useful, the moat is real, because nobody else has the data.

47
00:07:43.980 --> 00:07:52.060
<v Oday>And Lago, the open-source metering layer under usage-based pricing. Consumption tracking, subscriptions, revenue analytics.

48
00:07:52.060 --> 00:08:05.180
<v Shannon>This is the plumbing every AI product needs once it bills per token instead of per seat. With reasoning spend swinging the way GLM-5.2 just showed, accurate metering is the difference between margin and a surprise.

49
00:08:05.360 --> 00:08:14.960
<v Oday>GrapheneOS has been ported to Android 17, with official releases close. Meanwhile, Volkswagen's app now refuses to run on GrapheneOS.

50
00:08:14.960 --> 00:08:29.520
<v Shannon>That's integrity attestation locking out hardened devices. If you lean on Play Integrity, know you're quietly excluding privacy-conscious users. The blocklist is a product decision, not a security necessity.

51
00:08:29.520 --> 00:08:36.080
<v Oday>And a cautionary one, a service holding user images behind a surprise five-dollar paywall.

52
00:08:36.080 --> 00:08:44.160
<v Shannon>Own your storage or assume the host monetizes your lock-in. Budget for egress and an exit before you wire anything to a free tier.

53
00:08:47.400 --> 00:08:49.480
<v Oday>Quick break — two from the desk.

54
00:08:49.480 --> 00:09:04.200
<v Shannon>One we know well: vote dot direct. If you're on an H O A or a board, it runs your elections digitally — secure, verifiable, no paper, no clipboard in the lobby. Point your council to vote dot direct.

55
00:09:04.200 --> 00:09:15.960
<v Oday>And if this is your ten minutes of A I for the day, get the written edition too. The full wire, free, every morning — leave your email at nextbig dot dev.

56
00:09:19.990 --> 00:09:25.670
<v Oday>MicroUI is a tiny immediate-mode UI library in ANSI C, zero dependencies.

57
00:09:25.670 --> 00:09:32.470
<v Shannon>Eclipse Zenoh unifies pub-sub, geo-distributed storage, and queries in one stack.

58
00:09:32.470 --> 00:09:39.030
<v Oday>Stop Killing Games failed to secure EU law despite one-point-three million signatures.

59
00:09:39.030 --> 00:09:44.230
<v Shannon>A French physicist and media star lost his doctorate after a plagiarism probe.

60
00:09:44.230 --> 00:09:48.310
<v Oday>And Bubbles.town is a Hacker News built for independent blogs.

61
00:09:49.290 --> 00:10:00.970
<v Oday>Our call: an open-weights model clears fifty-five on the Intelligence Index, matching GPT-5.5 at high reasoning, by September thirtieth, erasing the gap at the very top.

62
00:10:00.970 --> 00:10:14.490
<v Shannon>Proven wrong if no open model hits fifty-five on v4.1 or its successor by then. The gap from fifty-one to fifty-five is a longer reasoning trace, not a bigger model. Settles September thirtieth.
