nextbig.dev
Vancouver, B.C. · Intelligence on AI and the machines that run it
nextbig.dev
The Briefing · Friday, July 3, 2026

"Right to Local Intelligence" got a manifesto this week, and the models to make it real

A "Right to Local Intelligence" manifesto trended this week beside working guides for running state-of-the-art models on your own hardware, and beside Virginia banning geolocation-data sales and warnings of an American privacy emergency. Why cheap open weights plus a privacy mandate is turning on-prem inference from a hobby into a purchase order.

8 min read
The Rundown No. 132 · Audio Edition · 5 min All episodesRSSMP3
0:00 / 5:13
VTT
The Big Story
"Right to Local Intelligence" got a manifesto this week, and the models to make it real

A manifesto went up this week under a plain address, righttointelligence.org, arguing that access to capable AI you run yourself is turning into a civil liberty rather than a consumer product. On most weeks that reads as activist branding running ahead of reality. This week it trended next to a working guide for running frontier-class models locally, and the pairing is the story: the argument and the how-to arrived on the same day. The slogan is early. The capability under it is not.

What makes the local option real is the same run of news we have covered all week. An open Chinese model caught Claude nine days ago. Sonnet 5 put flagship-grade agents at mid-tier prices on Tuesday. Kimi and GLM landed inside the tools developers already use on Wednesday. Stack those and the practical result is a model you can download, run on a single workstation, and point at your own code and documents, close enough to the frontier for most real work. A year ago running your own model meant paying a heavy capability tax. This week that tax got small enough to argue about.

The other half of the story is why anyone would bother, and the week supplied that too. Virginia banned the sale of geolocation data outright. Scott Aaronson, not a man given to alarms, wrote about an American privacy emergency. Grant the honest objection first: most people will never run a local model, convenience beats principle nearly every time, and a right nobody exercises is just a slogan. Fair. Now the finer cut. The people who most need their data to stay put, in health, in law, in finance, in any regulated shop, are the ones for whom a local model stops being a hobby and becomes compliance. A hospital that cannot send patient notes to a cloud model can now run a capable one inside the building.

Name the category correctly and the closed labs' exposure comes into view. The pure API business sells intelligence as a metered service you reach over the wire, which works right up until the data you would send is data you are legally or commercially bound to keep home. Every regulated buyer who wakes up to that is a customer the metered API cannot fully serve and an on-prem open weight can. This is the slice of the market the whole month's price collapse was quietly building toward. Cheap capable weights plus a privacy mandate is how on-prem inference goes from a niche to a purchase order.

For anyone building, take this literally now. If you serve a regulated vertical, a local or on-prem deployment of an open model is a feature you can sell this quarter, and the guides to stand one up sit on the same front page as the manifesto. Price it against what your customers already pay to keep data in a cloud they do not fully trust. For everyone else the near-term reality is softer: you will keep calling an API because it is easier, and that is fine. The thing that changed this week is the option. The leverage in any negotiation with a model vendor is a credible ability to walk, and for the first time that ability is something you can download.

@righttointel Read source
Local Intelligence

A "Right to Local Intelligence" manifesto makes the political case

The argument at righttointelligence.org is that running capable models on hardware you own is becoming a liberty worth protecting, not a niche preference. The framing is early and the movement is small. What gives it weight this week is timing: it landed alongside the cheap, capable open models that make running your own a real choice rather than a compromise.

A working guide to running frontier open models on your own machine

The local-LLM guide from Jamesob is the how-to under the slogan: the concrete steps to stand up a frontier-class open model like GLM 5.2 or Kimi on your own machine, today. The manifesto is only as real as the tooling beneath it. When the practical guide and the political argument trend together, the idea has left the seminar and reached the workstation.

The Forcing Function

Virginia bans the sale of geolocation data outright

Virginia moved from consent theater to a flat prohibition on selling location data, one of the hardest privacy lines any US state has drawn. Rules like this are what turn data-locality from a preference into a legal constraint, and a legal constraint is what pushes a regulated buyer toward a model that runs where the data already lives.

Scott Aaronson calls it an American privacy emergency

Aaronson is not an alarmist by temperament, which is why the post lands. The through-line to this edition is direct: as the alarm about where personal data goes gets louder, sending it to a metered cloud model becomes the exact behavior buyers and regulators want to stop, and the local option becomes the answer that does not require trusting anyone.

From Our Desk

Wednesday: open models walked into the tools you already use

Kimi K2.7 Code went generally available in GitHub Copilot; z.ai shipped a GLM 5.2 harness. Distribution was the missing piece, and it arrived. The same models now one dropdown away in Copilot are the ones you can pull down and run offline, which is what makes the local case more than theory.

Tuesday: the price of a capable model fell to where local makes sense

Sonnet 5 cut the price of a hosted agent, but the deeper move all week has been open weights getting good enough to self-host. Cheap hosted and free-to-run are two answers to the same fact: raw capability is no longer scarce. Only one of those answers keeps your data in the building.

The Takeaway

The run of news that started with an open model catching Claude has a politics now, and it showed up this week as a "Right to Local Intelligence" manifesto trending next to a guide for running frontier-class models on your own machine. Treat the slogan skeptically: most people will never self-host, and convenience wins almost every time. Then follow it to the buyer it is actually for. Virginia just banned selling geolocation data, Scott Aaronson is warning of a privacy emergency, and any shop in health, law, or finance that legally cannot ship data to a metered cloud now has an alternative that did not exist a year ago: a capable open model running inside its own walls. That is the slice of the market the whole month's price collapse was building toward, and it is the one the pure-API labs can least serve. For builders in a regulated vertical, on-prem open-model deployment is a feature to sell this quarter, not a research project. For everyone else, the option to walk is now a download.

The Call C-20260703

Within six months, at least one venture-backed company raises $20M or more explicitly to sell on-prem or local deployment of frontier-class open models to regulated industries, positioning on the promise that the data never leaves the customer's control.

The case

Two things came true at once this week. Open weights are now close enough to the frontier to run real work on a single workstation, and privacy law is tightening, with Virginia banning geolocation-data sales and the alarm about data control rising. Buyers in health, law, and finance cannot send regulated data to a metered API, and now they have a capable alternative that stays in the building. Funding follows a requirement, and data-locality is becoming one.

What proves us wrong

If, by January 3, 2027, no venture-backed company has raised at least $20M explicitly to sell on-prem or local open-model deployment to regulated industries on a data-sovereignty pitch, the call is wrong.

Settles by January 3, 2027
The Tape T-20260703
◆ Watch AMD AMD medium conviction

On-prem and local inference favors accessible silicon outside the hyperscale datacenter, and AMD spans that range from workstation parts to MI-series accelerators. If the right-to-local-intelligence thesis converts into regulated buyers standing up their own inference, it is a demand pool that does not run through a cloud contract.

Local and on-prem deployment shifts inference spend toward hardware a customer owns and operates. AMD is positioned across that stack and priced below the datacenter leader, which is the combination on-prem buyers optimize for.

Wrong if On-prem inference stays a niche and regulated buyers accept private-cloud or VPC offerings from the closed labs instead, or AMD fails to convert the opening into share over the next two quarters. Settles 9 months
◆ Watch Private Anthropic low conviction

The metered-API model is the least able to serve data that legally cannot leave the building. Anthropic has enterprise and VPC options that soften this, which is why it is a watch and not a short, but the regulated-data slice is a real gap an open weight running on-prem can fill and a pure API cannot.

As privacy law tightens, the buyers who cannot ship data to a cloud become a segment the API-first labs structurally under-serve. Their answer has to be a genuine on-prem or fully isolated offering, not just a compliance page.

Wrong if Anthropic ships a credible on-prem or fully isolated deployment that regulated buyers accept, keeping the segment inside its business rather than ceding it to open weights. Settles 6 months
◆ Watch NVDA Nvidia low conviction

On-prem inference expands total compute demand, a tailwind, but it moves that demand out of the hyperscale cloud where Nvidia's volume economics are cleanest and into smaller, mixed-vendor deployments where buyers shop harder on price.

More places running inference is more accelerators sold overall. The mix shift toward on-prem and toward cost-sensitive buyers is where cheaper alternatives get their opening, which is the offset to the volume tailwind.

Wrong if On-prem inference growth shows up cleanly in Nvidia's numbers without margin pressure, or the local thesis fails to move real deployment volume over the next two quarters. Settles 9 months
Desk signals from the day's verified wire — falsifiable, dated, settled in public. Analysis, not individualized investment advice.

Get this briefing in your inbox

What changed in AI and compute, what it costs, and what to build. One email per week. No spam, unsubscribe anytime.