nextbig.dev
Vancouver, B.C. · Intelligence on AI and the machines that run it
nextbig.dev
The Briefing · Sunday, June 28, 2026

Wall Street goes hunting for the next Nvidia and lands on the people who make memory

Wall Street is pricing Micron like the next Nvidia and Lenovo says the memory shortage is permanent. Plus China retakes the supercomputer crown, AI coding agents tricked into installing malware, and the open-model field widens.

6 min read
The Rundown No. 127 · Audio Edition · 6 min All episodesRSSMP3
0:00 / 6:05
VTT
The Big Story
Wall Street goes hunting for the next Nvidia and lands on the people who make memory

Wall Street spent two years treating one company as the entire AI trade. This week it went looking for the next one and stopped at the memory makers. Micron, which sells DRAM and the high-bandwidth memory stacked onto every AI accelerator, is now the name investors reach for when they want Nvidia-shaped returns at a lower entry price. The pitch is simple. Every accelerator that ships needs memory around it, there are far fewer memory suppliers than there are buyers, and the buyers are the most cash-rich companies on earth.

The supply side agrees. At ISC 2026 this week, a Lenovo executive said the memory and storage shortage the industry started calling RAMageddon is not a spike but the new baseline. His words were that it will never be like it was last year. That is a hardware vendor telling its own customers to stop waiting for prices to fall back.

It is not only a datacenter problem. Memory has quietly become one of the most expensive parts of building any computer, and the global DRAM shortage now reaches all the way down to someone speccing a single desktop. The same constraint that sets a hyperscaler's buildout cost decides whether a small shop can afford 32 gigabytes this quarter.

The reason memory became the choke point is structural. Training and serving large models is bound by how much fast memory sits next to the compute as much as by the compute itself. Weights, context, and KV caches all live in memory, and high-bandwidth memory is hard to fabricate and concentrated among a handful of suppliers. When accelerator demand went vertical, the scarce input was never going to be only the logic die. It was going to be the thing bolted next to it.

For anyone planning hardware or forecasting infrastructure cost, the cheap-memory assumption is the one to retire this quarter. Treat memory price and availability as a first-order line in the budget, not a rounding error. Lock supply where you can, size context windows and batch jobs with the memory bill in view, and read every accelerator roadmap as a memory roadmap too. The compute story just became a memory story, and the people who sell it already know it.

@techcrunch Read source
Compute & Silicon

China retakes the supercomputer crown with LineShine, its first TOP500 number one since 2018

Despite trade restrictions, China has reclaimed the world's fastest supercomputer for the first time since 2018. A system called LineShine pushed the US El Capitan out of the top spot on the TOP500 ranking. The win arrives while export controls are meant to be slowing exactly this, which suggests the restrictions are reshaping where the frontier gets built more than whether it gets built.

China's Loongson ships a homegrown 16-core server CPU on its own architecture

Loongson unveiled the 3C3000 on June 26, a 16-core server processor built on its in-house LoongArch architecture. It is a 40-watt chip with DDR4 ECC and 32 PCIe lanes aimed at cheap general-purpose file, database, and web servers. It is not a frontier part, and that is the point: a domestic stack for the unglamorous server middle, built so it does not depend on anyone else's export policy.

Agents & Security

Mozilla's 0din team shows AI coding agents can be talked into installing malware from 'clean' repos

Researchers at Mozilla's 0din group demonstrated that a coding agent like Claude Code can be exploited through its own helpfulness, walked into installing malware from a GitHub repository that looks clean. The attack does not break the model so much as use its eagerness to be useful. If you let an agent run setup steps from a repo, that repo is now part of your threat model.

A request to let OpenAI Codex exclude sensitive files is still open, and builders noticed

An issue asking for a way to stop OpenAI's Codex from reading sensitive files remains open, and it climbed Hacker News to 132 points. It is the same worry as the 0din attack from the other direction: when an agent has broad read access to your tree, what it can see becomes a question you answer before you run it, not after.

Models & the Wider World

The open-model field keeps widening: Zyphra, Cohere, and Poolside add to the pile

Nathan Lambert's latest open-artifacts roundup makes a quiet point. A year ago open releases came from a short list of labs, and now Zyphra, Cohere, Poolside and a growing crowd are shipping a wide range of open models. The frontier headlines go to the closed labs, but the usable, buildable middle is getting deeper and more competitive every month.

Prosecutors entered ChatGPT logs as evidence in the Palisades fire trial

In the arson case tied to one of LA's deadliest wildfires, prosecutors used a defendant's ChatGPT logs alongside location data to build their case. Whatever the verdict, the precedent is the story: chatbot history is now discoverable evidence, and what people type into an assistant carries the same weight as any other record.

The Takeaway

Two kinds of scarcity ran through the day. Memory got expensive enough that Wall Street started pricing Micron like Nvidia and Lenovo called the shortage permanent. Compute got contested enough that China retook the supercomputer crown and shipped a server CPU on its own architecture. The thread for builders is the same in both: the inputs you used to take for granted, cheap memory, available chips, and the belief that a clean-looking repo is clean, are now things to price, secure, and verify before you build on them.

The Call C-20260628

Within six months, the AI-hardware story stops being a GPU story alone. At least one major memory maker (Micron or SK Hynix) reports a quarter in which data-center memory revenue grows faster year over year than Nvidia's data-center segment, and memory supply shows up as a named constraint in at least one hyperscaler's capex guidance.

The case

Lenovo is telling customers the shortage is permanent, Wall Street is already repricing Micron as the next Nvidia, and high-bandwidth memory is co-scarce with the accelerators it ships beside while having far fewer suppliers. When the tightest-supply input is also a high-margin one, the money and the language tend to follow it.

What proves us wrong

If, by December 28, 2026, no major memory maker has reported data-center memory revenue growing faster year over year than Nvidia's data-center segment, and no hyperscaler has named memory supply as a capex constraint in its guidance, the call is wrong.

Settles by December 28, 2026
The Tape T-20260628
▲ Long MU Micron Technology medium conviction

The AI-hardware trade is rotating from the logic die to the memory beside it. Micron sells the DRAM and high-bandwidth memory every accelerator needs, and supply is tight enough that Lenovo is now calling the shortage permanent.

Wall Street is openly hunting for the next Nvidia and naming Micron. HBM is co-scarce with accelerators and has few suppliers, so pricing power sits with the makers while demand stays vertical.

Wrong if A DRAM or HBM supply glut, or an AI capex pullback, that sends memory prices and Micron's data-center revenue growth down over the next two quarters. Settles 6 months
▲ Long 000660.KS SK Hynix medium conviction

Same memory-cycle thesis at the HBM leader. If fast memory is the scarce layer of the AI stack, the dominant high-bandwidth-memory supplier captures the largest share of the repricing.

Vendor commentary frames the DRAM and storage shortage as structural, not a spike. HBM demand tracks accelerator shipments, which are not slowing, and SK Hynix leads that market.

Wrong if HBM oversupply, or a competitor yield breakthrough, that compresses high-bandwidth-memory margins within two quarters. Settles 6 months
◆ Watch 0981.HK SMIC low conviction

China just retook the TOP500 top spot with LineShine and shipped a homegrown LoongArch server CPU, both under export controls. Domestic compute is being built regardless of restrictions, and SMIC is the foundry behind that stack.

A number-one supercomputer ranking and a domestic server chip show a working homegrown pipeline. Export controls appear to be redirecting China's buildout inward rather than stopping it.

Wrong if Tighter, effective enforcement that visibly stalls China's leading-edge output, or evidence the supercomputer and CPU lean on smuggled foreign silicon rather than domestic fabs. Settles 9 months
Desk signals from the day's verified wire — falsifiable, dated, settled in public. Analysis, not individualized investment advice.

Get this briefing in your inbox

What changed in AI and compute, what it costs, and what to build. One email per week. No spam, unsubscribe anytime.