WEBVTT
NOTE The Rundown — nextbig.dev daily audio edition, 2026-06-07

1
00:00:00.000 --> 00:00:08.944
<v Alex>Hey everyone, welcome to Builder's Briefing for June 7th, 2026. I'm Alex, joined as always by Sam. We've got a packed one today — a new open-source AI memory system that just topped the benchmarks, Google paying SpaceX almost a billion dollars a month for compute, and some really practical dev tool advice that could save you a lot of headaches.

2
00:00:08.944 --> 00:00:13.856
<v Sam>Yeah, there's a clear theme running through today's stories — compute is expensive, AI companies still aren't profitable, and the smart move is building lean and flexible. Let's get into it.

3
00:00:13.856 --> 00:00:22.206
<v Alex>Alright, so the big story today is MemPalace. It just dropped as the top-benchmarked open-source AI memory system, and it's completely free. If you've been duct-taping together vector stores and summarization chains and ad-hoc context windows to give your agents persistent memory — this is the project you need to look at.

4
00:00:22.206 --> 00:00:31.796
<v Sam>This is huge for anyone building agents that need to actually remember things across sessions. It handles storage, retrieval, relevance scoring — basically all the plumbing that separates a demo from something you'd actually ship. And the fact that it's open-source means you can self-host, audit the retrieval logic, customize scoring for your domain. No vendor lock-in.

5
00:00:31.796 --> 00:00:37.820
<v Alex>Right, and what's wild is the signal this sends. Six months ago, startups were raising entire funding rounds on "AI memory" as a product category. Now the best-benchmarked version is free on GitHub with over twenty-two hundred stars.

6
00:00:37.820 --> 00:00:45.989
<v Sam>The memory layer is commoditizing fast. The differentiation is moving up the stack — it's not about whether your agent has memory anymore, it's about what it does with that memory. If you're building anything memory-dependent, adopt open-source as your baseline and focus your energy on the actual application logic.

7
00:00:45.989 --> 00:00:53.046
<v Alex>Staying in the AI world, there's a really practical piece from a developer who published their methodology for teaching AI coding agents test-driven development. Basically constraining the agent to follow TDD loops — write the test, watch it fail, implement, watch it pass.

8
00:00:53.046 --> 00:01:00.258
<v Sam>That's interesting because most people just let coding agents freewheel, and then wonder why the output is unpredictable. Defining these kinds of behavioral boundaries is where the real craft of agent engineering is heading. Link in the briefing if you want the full methodology.

9
00:01:00.258 --> 00:01:05.402
<v Alex>Also in AI news — England and Wales have told police to stop using AI for drafting court statements. A real-world example of AI-generated content hitting hard regulatory walls in high-stakes domains.

10
00:01:05.402 --> 00:01:10.650
<v Sam>If you're building AI writing tools for legal, compliance, government — anything with stakes — expect this pattern to repeat everywhere. Design for human-in-the-loop from day one. Don't bolt it on later.

11
00:01:10.650 --> 00:01:14.993
<v Alex>Okay, let's talk infrastructure, because this next number is staggering. Google is paying SpaceX nine hundred and twenty million dollars per month for compute capacity.

12
00:01:14.993 --> 00:01:20.861
<v Sam>Per month! That's over eleven billion a year. That tells you that demand for AI training and inference infrastructure has outstripped what even the hyperscalers can build themselves. Google can't build data centers fast enough.

13
00:01:20.861 --> 00:01:27.427
<v Alex>The practical implication for builders is pretty clear — compute costs aren't coming down anytime soon. Optimize your inference, use smaller models where you can, and keep an eye on whether capacity constraints start affecting your cloud provider's SLAs.

14
00:01:27.427 --> 00:01:34.691
<v Sam>And on the flip side of that, Nvidia is proposing what they're calling a beast-mode CPU architecture for Windows PCs. If that ships, local inference on desktop hardware could get a lot more viable. Relevant if you're building desktop AI apps or thinking about on-device deployment.

15
00:01:34.691 --> 00:01:39.060
<v Alex>Quick detour to a really practical dev tools story. There's a great deep-dive making the rounds on why random UUIDs as primary keys absolutely murder SQLite performance.

16
00:01:39.060 --> 00:01:45.756
<v Sam>Oh, this is one of those things that bites people hard and they don't understand why. Random UUIDs cause B-tree fragmentation, tons of page splits, slower inserts — it's a mess. And more builders are using SQLite now, especially for local-first and edge apps.

17
00:01:45.756 --> 00:01:49.194
<v Alex>The fix is simple — use ULIDs, UUID v7, or just integer primary keys. Small change, massive performance impact. Link in the briefing.

18
00:01:49.194 --> 00:01:52.580
<v Sam>Honestly, that's the kind of thing you should grep your codebase for today. Five-minute fix that could save you real pain at scale.

19
00:01:52.580 --> 00:01:56.329
<v Alex>Over in startups and funding — the S&P 500 index committee has rejected SpaceX and won't waive their profitability rules for OpenAI or Anthropic.

20
00:01:56.329 --> 00:02:01.757
<v Sam>This is telling. The biggest AI companies in the world still can't meet basic profitability requirements for index inclusion. It confirms what a lot of us suspect — unit economics in AI haven't been solved yet.

21
00:02:01.757 --> 00:02:06.695
<v Alex>And if you're building on top of these players' APIs, that should make you a little nervous. Pricing and API costs could shift as they chase profitability. Architect for provider flexibility.

22
00:02:06.695 --> 00:02:13.571
<v Sam>Which actually connects back to a quick hit — there's an open-source Go tool called go-stock for local-first AI stock analysis that supports DeepSeek, OpenAI, Ollama, and more. It's a nice reference architecture for how to stay backend-agnostic across LLM providers.

23
00:02:13.571 --> 00:02:20.396
<v Alex>A couple more quick hits — OpenAI's Whisper repo is trending again, still the go-to for self-hosted speech recognition. And someone ported Pokémon Emerald to WebAssembly and it's hitting a hundred thousand FPS, which is just a delightful WASM performance showcase.

24
00:02:20.396 --> 00:02:22.438
<v Sam>A hundred thousand FPS on a Game Boy Advance game. That's absurd and I love it.

25
00:02:22.438 --> 00:02:27.686
<v Alex>So here's the takeaway for today. The AI memory layer is going open-source and commoditizing — if you're building agents, stop rolling your own memory and adopt something like MemPalace as your baseline.

26
00:02:27.686 --> 00:02:33.735
<v Sam>And the Google-SpaceX deal and the S&P rejections are pointing at the same thing — inference costs are high, the big players aren't profitable, and pricing is going to shift. Build lean, build flexible, optimize aggressively for cost.

27
00:02:33.735 --> 00:02:37.044
<v Alex>That's the briefing for June 7th. All the links are in the show notes. We'll be back tomorrow — until then, ship something good.

28
00:02:37.044 --> 00:02:38.000
<v Sam>See you all tomorrow. Happy building.
