# A Chinese open model caught Claude this week, and the moat that's left isn't the model

> China's open models caught the Western frontier this week: Semgrep's cyber eval put Zhipu's GLM 5.2 level with Claude, at a fraction of the cost, with the weights open to download. Why the closed labs' moat is no longer the model.

- Published: Monday, June 29, 2026 (2026-06-29)
- Publisher: nextbig.dev — daily AI & compute briefing, written by Oday Brahem with nextbig.dev's AI agent
- Sources analyzed: 9 articles from 300+ curated accounts
- Canonical URL: https://www.nextbig.dev/daily/2026-06-29

## The Big Story

### A Chinese open model caught Claude this week, and the moat that's left isn't the model

A security shop called Semgrep published its cyber-eval numbers this week under a blunt title: "we have Mythos at home." The claim was that GLM 5.2, the open model out of Beijing's Zhipu, matched or beat Claude on the security tasks they run, at a fraction of the cost. One vendor's benchmark is one data point, and cyber evals are noisy. But it lands in a week where the direction is no longer in question. The best open model in the world now comes from China, ships its weights for anyone to download, and trades blows with the strongest model a Western lab will rent you.

That collapses a gap the whole business was priced on. For two years the pitch for a closed frontier model was one sentence: it is meaningfully better than anything you can run yourself, so the premium and the lock-in are worth it. When an open model you can download matches Claude on the work your engineers actually do, that pitch needs a second sentence, and nobody has written it yet. The capability is no longer the product. What is left to sell is everything wrapped around it: the uptime, the integrations, the support, the promise that the thing answers when you call it.

We should own a miss here. Eight days ago this desk put Zhipu on the tape as a short, reading it as a fast follower that would stay a step behind. GLM 5.2 makes that wrong. What we underweighted is now the whole game. Open models are closing the gap by distilling the closed ones, and a paper doing exactly that circulated again this week, next to a wave of Asian labs shipping Mythos-class systems and fresh open releases from Zyphra, Cohere, and Poolside. You do not have to out-research the frontier if you can copy its outputs cheaply and give the result away.

Put this beside Sunday. Our last edition was about China taking the supercomputer crown back and shipping a server CPU on its own architecture. The hardware-sovereignty story and the model story are the same story now, one layer apart. Beijing is assembling a stack it controls end to end: the chips it makes, the memory it is racing the world for, and models that match the West. It is handing the software layer out for free while everyone else meters it by the token. An open weight is an industrial-policy instrument as much as a product.

For anyone building, the move is concrete. You can now stand up a Claude-class model on your own hardware, behind your own walls, at something like a tenth of the per-token cost, with no one able to rate-limit you or revoke your access. That is real control, and it is worth the work of running it yourself. What you give up is the wrapper, the managed uptime and the contract, which is exactly the thing the closed labs now have to be worth. The benchmark that should worry Anthropic and OpenAI is not GLM 5.2's cyber score. It is the invoice a mid-size team writes the first month it stops paying by the token.

Source: @semgrep — https://semgrep.dev/blog/2026/we-have-mythos-at-home-glm-52-beats-claude-in-our-cyber-benchmarks/

## Models & Open Weights

### GLM 5.2 matched Claude on Semgrep's cyber benchmark, at a fraction of the price

Semgrep ran its security-agent evaluations against Zhipu's open GLM 5.2 and published the result under the title "we have Mythos at home": the model traded blows with Claude on the tasks they test while costing far less to run. Treat one vendor's cyber eval as a single, noisy data point. The signal is that an open, downloadable model is now close enough to argue about, on the exact class of task where the closed labs claimed a durable lead.

Source: @semgrep — https://semgrep.dev/blog/2026/we-have-mythos-at-home-glm-52-beats-claude-in-our-cyber-benchmarks/

### Asian labs are shipping Mythos-class open models while the export ban drags on

A run of Asian startups launched open models aimed squarely at Anthropic's Mythos tier, framing restricted Western access as the opening. The pattern is consistent: take the frontier's published behavior as a target, train an open model to hit it, release the weights. Export controls meant to slow rivals are instead seeding a parallel, unrestricted supply.

Source: @techcrunch — https://techcrunch.com/2026/06/27/asian-ai-startups-launch-mythos-like-models-as-anthropics-export-ban-drags-on/

### The open ecosystem is widening, not just deepening: Zyphra, Cohere, Poolside

Nathan Lambert's latest open-artifacts roundup tracks new releases from Zyphra, Cohere, and Poolside, spreading capability across more labs and more license terms. Breadth matters as much as any single benchmark. A field with a dozen credible open models is far harder for one closed vendor to out-run than a field with one.

Source: @interconnects — https://www.interconnects.ai/p/artifacts-22-zyphra-cohere-and-poolside

## How They Closed the Gap

### The method is distillation: copy the closed model's outputs, cheaply

A 2024 paper on knowledge distillation of black-box language models made the rounds again this week, because it is the playbook. You do not need the frontier lab's data or compute if you can query its model, harvest the outputs, and train a smaller open model to imitate them. It turns a closed API into free training signal, and it is most of how the gap got this small this fast.

Source: @arxiv — https://arxiv.org/abs/2401.07013

### Training is becoming reproducible code, and the token math is shifting under it

Two threads that circulated this week point the same way: "model training as code" pushes pipelines toward versioned, reproducible builds anyone can rerun, and the "tokenmaxxing is dead" argument says raw token throughput has stopped being the thing to optimize. Together they lower the skill and spend it takes to get a competitive open model out the door.

Source: @Aleph__Alpha — https://aleph-alpha.com/en/blog/model-training-as-code/

## From Our Desk

### Sunday: China took the supercomputer crown and shipped its own server CPU

We argued the hardware-sovereignty story was accelerating: a new number-one supercomputer and a homegrown server chip on a domestic architecture. Today's open-model surge is the same push, one layer up the stack. Read them together.

Source: @nextbigdev — https://www.nextbig.dev/daily/2026-06-28

### Friday: OpenAI gated its best model to about twenty approved partners

Sol shipped to a government-approved list and no one else. Set that next to an open model anyone can download today, and the two frontier strategies could not be further apart: one rations access by permission, the other gives the weights away.

Source: @nextbigdev — https://www.nextbig.dev/daily/2026-06-26

### Wednesday: Claude rationed capacity at peak hours

Anthropic returned overload errors at the exact hours US teams ship. The open model that caught it this week has no such ceiling, because you run it yourself. Availability keeps turning out to be the part of the pitch that bites.

Source: @nextbigdev — https://www.nextbig.dev/daily/2026-06-24

## The Takeaway

Two weeks ago the safe read was that the closed labs owned capability and everyone else fought over price. That read is gone. An open model out of Beijing now matches Claude on a working security eval, ships its weights, and runs for a fraction of the cost, and it got there by copying the frontier's outputs rather than out-spending its research. Put it beside Sunday's edition and the shape is clear: China is building a stack it controls from the chips up and giving the software layer away. For builders the upside is real. You can run a Claude-class model on your own hardware, behind your own walls, with no one able to throttle or revoke it. The cost is the wrapper the closed labs sell, the uptime and the integrations and the support, which is now the only thing they can charge the premium for. The number to watch is not GLM 5.2's benchmark. It is how many teams stop paying by the token next quarter.

## The Call

Within six months, a Chinese open-weights model holds the number-one position on at least one major public leaderboard that also ranks the closed frontier models, and holds it for a sustained stretch rather than a single-day spike.

The case: GLM 5.2 already trades blows with Claude on a working cyber eval, and the gap it closed was closed by distillation, not by a research lead that takes years to build. Open weights mean the whole world tunes the model in parallel, and a fraction-of-the-cost serving price means adoption compounds on itself. When an open model is this close, the top of a public leaderboard is a few months of community tuning away, not a generation.

What proves us wrong: If, by December 29, 2026, no Chinese open-weights model has held the top position on a major public leaderboard that also ranks GPT, Claude, or Gemini, for a sustained period rather than a single day, the call is wrong.

Settles: by December 29, 2026

## The Tape

The market desk's signals from the day's verified wire. Falsifiable analysis, settled in public — not individualized investment advice.

### WATCH Z.ai / Zhipu AI — medium conviction

We were short this eight days ago, reading Zhipu as a fast follower that would stay a step behind. GLM 5.2 matching Claude on a cyber eval makes that wrong. We are covering the short to watch: the open-model leader is now a real threat to the closed premium, but it monetizes weakly by design.

The mechanism: An open-weights leader captures mindshare and displaces closed-model spend without capturing much of the revenue it destroys. The value shows up on customers' invoices as savings, not on Zhipu's as sales.

Wrong if: Zhipu converts the GLM lead into a paid API or enterprise business that materially grows its revenue, or a US restriction removes it from the Western conversation entirely.

Settles: 6 months

### WATCH GOOGL (Alphabet) — medium conviction

Of the Western labs, Google is the one built to absorb an open-cost shock. It already ships open weights with Gemma and runs Gemini on its own TPUs, so it can meet a price war without renting someone else's chips.

The mechanism: The open-model surge pressures per-token pricing across the board. A vendor that owns its silicon and already runs an open line has the most room to cut and the least margin to lose.

Wrong if: Google cedes price-sensitive share to open models anyway, or its TPU cost advantage fails to show up in Gemini API pricing over the next two quarters.

Settles: 6 months

### WATCH Anthropic — medium conviction

Anthropic is the closed lab most exposed to this. Its whole pitch is the best model, and an open model just caught it on a security eval in the same week it was rationing capacity. The answer has to be the model or the availability, not the marketing.

The mechanism: When the capability gap narrows, a pure-play frontier lab with no chips of its own and a premium price has the least cover. It either re-opens a clear capability lead or it competes on the one thing weights cannot copy: being reliably callable and deeply integrated where the work happens.

Wrong if: Anthropic ships a release that restores a clear, durable capability lead, or a capacity-and-integration story that visibly holds enterprise share despite the price gap.

Settles: 6 months

### WATCH NVDA (Nvidia) — low conviction

Cheaper open models expand total inference demand, which is good for compute. But China's best open models increasingly train and serve on domestic silicon, so a growing share of the marginal token from this trend runs on chips Nvidia is not allowed to sell into China.

The mechanism: Open-weights adoption lifts aggregate inference, a tailwind. Export controls and China's homegrown-chip push route more of that inference off Nvidia hardware, a partial offset that gets larger the more the open-China stack matures.

Wrong if: Open-model inference growth shows up clearly in Nvidia's data-center revenue over the next two quarters, or China's domestic accelerators fail to take share in training the next GLM.

Settles: 6 months

---
Cite as: "nextbig.dev Daily AI Briefing, 2026-06-29" — https://www.nextbig.dev/daily/2026-06-29