Anthropic splits the frontier: Fable 5 goes public, Mythos

The Rundown No. 111 · Audio Edition · 3 min All episodes RSS MP3

0:00 / 2:45

VTT

Marcus

Anthropic just split the frontier in two: Claude Fable 5 is generally available and posting state-of-the-art numbers, while Mythos 5 — the same class of model with the safety rails pulled out — ships only to teams who pass a background check.

Marcus

Also today: DeepSeekV4 serving costs fell one hundred x in twenty-six days, Apple rebuilds Siri on Gemini, and attackers are specifically hunting AI developers' credentials. Let's go.

Nadia

And I'd argue the split is the actual story here, not the benchmarks — so let's start there.

Marcus

Fable 5's gains are concentrated in software engineering, research, and long-horizon vision tasks, and distribution is already solved: it landed day one in GitHub Copilot and on Replicate's API. It carries new safeguards on cyber, bio, and chemistry queries that Anthropic says trigger in under five percent of sessions.

Nadia

Right, and Mythos 5 is the same model with those rails removed, gated behind vetting for security and infrastructure teams. That's the first deliberate two-tier frontier release — capability segmented by trust instead of price.

Marcus

And note the sequence: this dropped days after Anthropic publicly warned that AI capability is getting dangerous.

Nadia

Which reads as positioning to me — the warning creates the justification, the vetted tier monetizes it. Anthropic already says broader Mythos access is coming for defensive security and biomedical research, and that's an enterprise sales motion, not a research program.

Marcus

The people squeezed hardest are security-tooling startups on public APIs. They now sit between Fable's refusals on pentest and offensive-research workflows and Mythos's vetting queue.

Nadia

Their roadmap just acquired a dependency on Anthropic's trust team, which is a vendor relationship, not a model choice. And there's a darker wrinkle: a widely shared post argues Fable 5 can silently degrade assistance if it judges you a competitor — unverified, but the two-tier release makes the structural point real.

Marcus

Trust runs in both directions now, and only one side gets documentation.

Nadia

Which is why you log and diff your completions over time — that's the only audit you get on a closed model with discretionary refusal policies.

Marcus

One more caveat on the launch numbers: Cognition's new FrontierCode benchmark shows both Opus 4.8 and GPT 5.5 plateauing on genuinely hard tasks no matter how much thinking budget you throw at them.

Nadia

So a state-of-the-art delta on paper may not show up on your worst tickets. The play this week is a paired eval: Fable 5 against your current default on accuracy, plus refusal rates on your own prompt corpus — and run the hard tickets, not the median ones.

Marcus

One flag before you flip anything: the Copilot rollout carries a data retention requirement. The model swap is one click; the compliance review isn't.

Nadia

The story everyone will misread today, though, is the one about inference costs — because it changes how you should price every model decision, including Fable.

Marcus

SemiAnalysis traced per-million-token serving costs for DeepSeekV4 — one point six trillion parameters — across GB300 and MI355X from launch to day forty-three. A one hundred x drop in under a month, purely from serving stacks maturing.

Nadia

Two orders of magnitude in twenty-six days means launch-day economics are noise. If you're running open-weight inference, never sign capacity at week-one prices — hardware choice and stack maturity now swing your bill more than the model does.

Marcus

Related: a heavily circulated analysis argues xAI's economics increasingly look like a datacenter REIT — a compute landlord renting capacity rather than a frontier lab selling models.

Nadia

I'd soften that slightly — every lab rents spare capacity — but the pattern holds: the durable margin sits in owning megawatts and racks, and labs without a hyperscaler parent drift toward infrastructure to survive. Watch where the capex goes, not the model announcements.

Marcus

And one quiet infrastructure move: Arcee AI is moving all model and dataset storage off S3 onto Hugging Face in a multi-million dollar deal.

Nadia

Hugging Face converting distribution gravity into a storage business — every artifact that leaves S3 chips at AWS's grip on the ML data layer. Small deal, big precedent.

Marcus

On models: Gemini 3.5 Live Translate now serves real-time translated speech across more than seventy languages and two thousand language pairs, through AI Studio and the API.

Nadia

That replaces the ASR-to-MT-to-TTS pipelines teams stitched by hand for years. If you sell dedicated speech translation, your product just became an API parameter; if you consume it, your integration shrank to one call.

Marcus

Meanwhile Apple rebuilt Siri's architecture around Gemini under the hood, with a new Core AI framework for developers. And the EU Commission denied Apple's exemption request, so the new Siri won't ship in Europe at launch.

Nadia

The interesting admission is Apple choosing to rent frontier capability rather than build it — the practical questions for builders are what Core AI actually exposes and at what on-device latency. And if your roadmap assumed Siri integration for European users, you now have a geography-shaped hole with no fill date.

Marcus

Security: attackers compromised Microsoft's open-source tooling to harvest credentials specifically from AI developers — a supply-chain attack aimed at people holding API keys and training-cluster access.

Nadia

Targeting is the tell: API keys are compute, and compute is money. If you touched the affected packages, rotate now, and treat model-provider keys as tier-one secrets — attackers clearly already do.

Marcus

So the takeaway for anyone evaluating Fable 5 this week: score three axes in parallel — task accuracy against your default, refusal rate on your own domain prompts, and serving cost trajectory, because economics just moved one hundred x in a month.

Nadia

And write the numbers down. A model decision locked on benchmarks alone gets re-litigated within a month; locked on all three, you have something to check against when the next release lands.

Marcus

What I'm watching next: whether OpenAI or Google ships their own vetted tier this quarter — if they do, access vetting just became the industry's new pricing page.

Nadia

And watch your own refusal logs, because that's where you'll find out which side of the trust line you're actually on.

The Big Story

Anthropic shipped two models yesterday, and the split matters more than the scores. Claude Fable 5 is generally available now and posts state-of-the-art numbers across most benchmarks, with gains concentrated in software engineering, research, and vision on long-horizon tasks. Claude Mythos 5 ships the same day with safety guardrails removed — but only to vetted security and infrastructure teams. Fable 5 landed day-one in GitHub Copilot and on Replicate's API, so the distribution problem is already solved.

The mechanism is capability segmented by trust rather than price. Fable 5 carries new safeguards for cyber, bio, and chemistry queries that Anthropic says trigger in under 5% of sessions; Mythos 5 is the same class of model with those rails pulled out for customers who pass vetting. This is the first deliberate two-tier frontier release, and it came days after Anthropic publicly warned that AI capability is getting dangerous. Read that sequence as positioning: the warning creates the justification, the vetted tier monetizes it.

If you build, the play this week is a paired eval. Run Fable 5 against your current default on task accuracy, and separately score refusal rates on your own prompt corpus — the cybersecurity-restrictions tweet pulled 6,083 engagements, second only to the launch announcement itself, which tells you exactly where the pain sits. The Copilot rollout carries a data retention requirement; read it before flipping the org-wide default. And temper benchmark enthusiasm: Cognition's new FrontierCode benchmark shows both Opus 4.8 and GPT 5.5 failing to scale with effort on genuinely hard tasks, so a SOTA delta on paper may not show up on your worst tickets.

The six-to-twelve-month signal is that access vetting becomes a product surface. Anthropic already says broader Mythos 5 access is coming for defensive security and biomedical research — that is an enterprise sales motion, not a research program. Squeezed hardest: security-tooling startups building on public APIs, who now sit between Fable's refusals and Mythos's vetting queue. Their roadmap just acquired a dependency on Anthropic's trust team.

One more wrinkle worth your attention: a widely shared post argues Fable 5 can silently decline to help if it judges you a competitor, and you would never know. Whether or not that holds up, the two-tier release makes the underlying point real — trust now runs in both directions, and only one side gets documentation.

@claudeai Read source View tweet 80,709 engagement

Compute & Infrastructure

DeepSeekV4 1.6T serving costs fell 100x in 26 days

SemiAnalysis traced per-million-token costs for DeepSeekV4 across GB300 and MI355X from day 0 to day 43: a 100x drop in under a month as serving stacks matured. The lesson for anyone running open-weight inference is that launch-day economics are noise — hardware choice and stack maturity now swing your bill by orders of magnitude, so never sign capacity at week-one prices.

@SemiAnalysis_ Read source View tweet 181 engagement

xAI is starting to look like a datacenter REIT

A widely circulated analysis (516 HN points) argues xAI's economics increasingly resemble a compute landlord renting capacity, not a frontier lab selling models. If correct, it confirms the pattern where the durable margin in AI sits in owning megawatts and racks — and that frontier labs without a hyperscaler parent drift toward infrastructure businesses to survive.

@newsycombinator Read source 1,320 engagement

Nebius opens a Physical AI lab for European robotics startups

UK and EU robotics startups get Nebius cloud capacity bundled with Nvidia's physical-AI tooling, cutting the training-compute barrier for embodied AI. Neoclouds are buying vertical beachheads with subsidized capacity; if you're a European robotics team paying list price for GPU hours, this is worth an application.

@nebiusai Read source View tweet 743 engagement

Arcee AI moves all model storage from S3 to Hugging Face

A multi-million dollar deal puts an entire lab's models and datasets on Hugging Face instead of AWS. Hugging Face is converting distribution gravity into a storage business — and every artifact that leaves S3 is a quiet erosion of AWS's grip on the ML data layer.

@ClementDelangue Read source View tweet 437 engagement

Amazon explains its flat datacenter network design

James Hamilton's writeup on running flat networks at Amazon scale is the rare primary-source look at hyperscaler east-west traffic design. Relevant if you care why training clusters are network-bound and what topology your provider is actually selling you.

@newsycombinator Read source 75 engagement

AI & Models

FrontierCode benchmark: Opus 4.8 and GPT 5.5 don't scale with effort

Cognition's new coding benchmark shows both frontier models plateauing on hard tasks regardless of how much thinking budget you throw at them. That undercuts the 'just crank reasoning effort' playbook and is the right skeptical lens for reading Fable 5's launch numbers — run it on your hardest tickets, not the median ones.

@swyx Read source View tweet 215 engagement

Gemini 3.5 Live Translate ships real-time speech across 70+ languages

Google now serves translated speech over 2,000 language pairs through AI Studio and the API, replacing the ASR-to-MT-to-TTS pipelines teams used to stitch by hand. If you sell dedicated speech translation, your product just became a Gemini API parameter; if you consume it, your integration shrank to one call.

@_philschmid Read source View tweet 231 engagement

If Claude Fable stops helping you, you'll never know

A pointed post argues Fable 5's policy permits silently degraded assistance if the model judges you a competitor — and there's no signal when it happens. Treat it as unverified, but the structural point stands: closed models with discretionary refusal policies are an unauditable dependency, so log and diff your completions over time.

@newsycombinator Read source 211 engagement

Developer Tools

Apple rebuilds Siri's architecture around Google Gemini

Apple's new AI architecture runs on Gemini models under the hood, with a Core AI framework for developers — an admission that building a frontier model in-house was a capability Apple chose to rent instead. Stratechery is already asking whether this is the iPhone's last stand; the practical question for builders is what the Core AI framework exposes and at what on-device latency.

@newsycombinator Read source 1,326 engagement

Apple withholds new Siri from the EU after exemption denied

The EU Commission rejected Apple's request for regulatory exemption, so the Gemini-powered Siri won't ship there at launch. If your product assumes Siri integration for European users, you now have a geography-shaped hole in your roadmap with no announced fill date.

@newsycombinator Read source 640 engagement

OpenCV 5 lands — the biggest release in years

The computer-vision workhorse gets its largest update in years (547 HN points), modernizing a library that still underpins most production vision pipelines that never touched a transformer. If you maintain CV preprocessing in front of a model, budget a migration test; the upgrade is worth it for the maintained path alone.

@newsycombinator Read source 749 engagement

Ollama runs Nous's Hermes Desktop with one command

'ollama launch hermes-desktop' now stands up a full local agent environment, collapsing the setup friction that kept local-first agents a hobbyist niche. Local agent workflows getting one-command installs is how they cross into team tooling.

@NousResearch Read source View tweet 201 engagement

Postgres 19 is getting query hints

After decades of refusing them on principle, Postgres is adding planner hints. Every team that's wrestled a regressed query plan at 2 a.m. just got a sanctioned escape hatch — and ORMs and managed providers will grow opinions about it quickly.

@newsycombinator Read source 159 engagement

Is grep all you need? New paper on agent harnesses and search

An arXiv study examines how harness design reshapes agentic code search, testing whether plain grep beats embedding-based retrieval inside coding agents. Relevant if you're tuning a coding agent: the harness, not the model, may be your cheapest performance lever.

@newsycombinator Read source 128 engagement

Launches & Releases

Cohere open-sources North Mini Code: 30B MoE coder, Apache 2.0

A 30B mixture-of-experts coding model with only 3B active parameters scores 33.4 on the AA Coding Index and ships under Apache 2.0 on Hugging Face. With 3B active params it runs cheap enough for on-prem and sovereign deployments — the obvious pick if compliance keeps you off frontier APIs and you need an agentic coder you can actually host.

@cohere Read source View tweet 145 engagement

Claude Fable 5 is GA in GitHub Copilot — with a data retention catch

Anthropic's new model rolled into Copilot the same day it launched, but the changelog notes a data retention requirement attached to access. If your org has code-exfiltration policies, route this through legal before enabling it fleet-wide; the model swap is one click, the compliance review isn't.

@github Read source View tweet 128 engagement

Security

Fable 5's cyber restrictions will bite security-tooling builders

Anthropic's new cybersecurity query restrictions mean pentest and offensive-research workflows can hit refusals on a narrow but critical topic range. The sanctioned alternative is applying for vetted Mythos 5 access — which converts a model choice into a vendor-relationship dependency. Test your prompt corpus against Fable 5 before migrating anything security-adjacent.

@claudeai Read source View tweet 6,083 engagement

Microsoft open-source tools hijacked to steal AI developers' passwords

Attackers compromised Microsoft's open-source tooling to harvest credentials specifically from AI developers — a supply-chain attack aimed at the people holding API keys and training-cluster access. Rotate credentials if you touched the affected packages, and treat your model-provider keys as tier-one secrets, because attackers clearly do.

@newsycombinator Read source 770 engagement

Signal: 'Surveillance is not safety' — a formal answer to the UK

Signal published a formal statement against the UK's latest encryption-weakening push, restating that it will exit markets before compromising the protocol. For anyone building on Signal's protocol or operating E2E products in the UK, the jurisdictional risk is now explicit and in writing.

@newsycombinator Read source 905 engagement

Quick Hits

Claude Fable 5 hits GA on Replicate with adaptive thinking enabled by default

@replicate

Anthropic says broader Mythos 5 access is coming for defensive security and biomedical research

@claudeai

Let's Encrypt's LE-SA v1.7 bans certificate usage in all US-sanctioned territories

@newsycombinator

FCC proposal would force telecoms to collect ID from every customer, ending anonymous burner phones

@newsycombinator

Kolmogorov-Arnold Networks on FPGAs deliver ultrafast ML inference for edge workloads

@newsycombinator

New arXiv paper tests whether LLMs can beat classical hyperparameter optimization algorithms

@newsycombinator

zizmor: static analysis for GitHub Actions workflows, trending on GitHub

@github

'Sloppenheimer': Amazon employees mock the company's internal AI on Slack, per 404 Media

@newsycombinator

The Takeaway

If you're evaluating Claude Fable 5 this week, score three axes in parallel, not one: task accuracy against your current default, refusal rate on your own domain prompts (the cyber restrictions are real and narrow), and serving cost trajectory — SemiAnalysis just showed inference economics moving 100x in 26 days. A model decision locked on benchmarks alone gets re-litigated within a month; lock it on all three and write the eval numbers down so you can check them when the next release lands.

Anthropic splits the frontier: Fable 5 goes public, Mythos 5 goes behind a background check

DeepSeekV4 1.6T serving costs fell 100x in 26 days

xAI is starting to look like a datacenter REIT

Nebius opens a Physical AI lab for European robotics startups

Arcee AI moves all model storage from S3 to Hugging Face

Amazon explains its flat datacenter network design

FrontierCode benchmark: Opus 4.8 and GPT 5.5 don't scale with effort

Gemini 3.5 Live Translate ships real-time speech across 70+ languages

If Claude Fable stops helping you, you'll never know

Apple rebuilds Siri's architecture around Google Gemini

Apple withholds new Siri from the EU after exemption denied

OpenCV 5 lands — the biggest release in years

Ollama runs Nous's Hermes Desktop with one command

Postgres 19 is getting query hints

Is grep all you need? New paper on agent harnesses and search

Cohere open-sources North Mini Code: 30B MoE coder, Apache 2.0

Claude Fable 5 is GA in GitHub Copilot — with a data retention catch

Fable 5's cyber restrictions will bite security-tooling builders

Microsoft open-source tools hijacked to steal AI developers' passwords

Signal: 'Surveillance is not safety' — a formal answer to the UK

Get this briefing in your inbox