The Wire · Anthropic
Claude, the research, and Anthropic's product moves — the lab whose models power much of what builders ship, this site included.
GLM-5.2 ships open MIT weights with coding scores four points behind Claude Opus and a sixth the cost, 48 hours after US export controls pulled a frontier model.
Recurrent triggers that wake an agent on a schedule and survive container restarts would unblock cron-style autonomous workflows.
Read full story →Non-Latin scripts tokenize less efficiently, so the same prompt costs multiples more in some languages.
Read full story →An agent-native planning step that sketches a solution shape before writing any code, aimed at reducing wasted generation.
Read full story →113 points, 83 comments on HN
A projection that next-gen models could sustain 100-hour autonomous task horizons on METR, if it holds.
The AlphaFold lead joins Anthropic amid a broader exodus from Google DeepMind, a signal of where research talent is concentrating.
History from PGP to spyware suggests new controls on Anthropic-linked tools won't stop determined actors.
Surfaced code points to a standalone Claude environment that runs Claude Code and persists, going well past the current desktop chat.
67 points, 34 comments on HN
JPMorgan cuts Hong Kong staff off Claude, a sign export-style controls now reach into who can call frontier APIs.
Nobel-winning protein researcher's move signals Anthropic's push into scientific reasoning and a talent drain at DeepMind.
Commentary on two senior departures days apart, framing it as a talent blow without new figures or terms.
Claude Fable 5 tops the eval at $31 per task while DeepSeek V4 Flash runs ~$0.04, so model choice can swing inference spend three orders of magnitude.
The standard aims to keep existing enterprise apps central, blunting OpenAI and Anthropic's drive to make agents the control layer.
Coding agents move into the ticket queue, so work can route to Claude from the assignee dropdown or a comment.
An independent briefing for builders: the whole field read continuously, every story scored for relevance, and the noise left off the page.
300+ curated sources. Every story scored 1–10 for builder relevance by Claude's frontier model. The filler never makes it to the page.
GPUs, datacenters, power deals, and inference economics: the infrastructure layer that decides what every builder pays. Our signature coverage.
Every story is sourced. Every score is computed. We show our work and link to originals.
Every briefing closes with The Call: one falsifiable claim with a date on it. When we're wrong, we say so in print. Opinions are cheap; ours get scored.