Google Drops Agent Skills Framework

The Rundown No. 110 · Audio Edition · 3 min All episodes RSS MP3

0:00 / 2:48

VTT

Marcus

Hey everyone, welcome to Builder's Briefing for June ninth, twenty twenty-six. I'm Alex, joined as always by Sam, and we've got a packed show today.

Nadia

Yeah, a lot happening. Google dropped a big open-source framework, Apple basically reinvented Siri at WWDC, DeepSeek is beating GPT again — it's one of those days where the landscape is actively shifting under your feet.

Marcus

So let's start with the biggest story. Google open-sourced a repo called google-slash-skills — a framework for building composable agent skills that plug directly into Google products. It shot past twenty-four hundred stars on GitHub basically overnight.

Nadia

That's wild. And what's interesting is this isn't just another SDK. It's Google saying, 'Here's the standard unit of what an agent can do.' Like, you wrap your integration as a skill, and suddenly it's portable across orchestration frameworks.

Marcus

Exactly. And there's a companion repo, ui-skills, that already has nearly thirteen hundred stars — that one's focused on design primitives, like layout generation and component creation. So if you're building anything that touches Google Workspace, Maps, or really any Google API surface, you should be cloning this today.

Nadia

The analogy I keep coming back to is npm for JavaScript. Google is trying to become the default skill registry for agents. And if that pattern sticks, you know Microsoft, AWS, Salesforce — they're all going to ship their own versions within six months. The people who adopt early get to shape how their integrations actually work before the standards harden.

Marcus

Right, and that ties directly into our other big AI news. Google is also rolling Gemini three-point-five Flash as the default model across AI Mode in Search and NotebookLM globally. So if you're building on Gemini APIs, Flash is now the baseline — test your prompts against it, especially function calling.

Nadia

And on the Apple side — WWDC twenty-six dropped Siri AI. It's not just a rebrand. Siri can now generate Shortcuts from plain English descriptions, it has image understanding, conversational flow. That's a huge deal for iOS developers.

Marcus

It really is. Because now your app's Shortcuts support becomes discoverable through conversation, not just buried in the Shortcuts gallery. Users say what they want, Siri wires up the multi-step workflow. That's a completely new distribution channel.

Nadia

Meanwhile, DeepSeek V4 Pro is beating GPT five-point-five Pro on precision benchmarks. Hacker News is buzzing about it — over two hundred points. The takeaway is that there's no single 'best model' anymore. It's totally task-dependent.

Marcus

Right, so if you're routing model calls by task type — code review, data extraction, anything precision-sensitive — DeepSeek is a real candidate now. And speaking of speed, Xiaomi's MiMo model just hit a thousand tokens per second on a one-trillion parameter model. That's a throughput milestone that could shift pricing and latency expectations across the entire industry.

Nadia

A thousand tokens per second at that scale is genuinely hard to wrap your head around. Real-time inference at frontier scale is becoming a thing, not just a research demo.

Marcus

Also worth a quick mention — Google AI Plus dropped to four ninety-nine a month with four hundred gigs of storage. That's impulse-buy territory. And for builders, it means your users are going to expect AI features as table stakes, not premium upsells.

Nadia

Yeah, the bar just keeps rising on what 'free' or 'cheap' gets you.

Marcus

Let's pivot to dev tools. Two things caught my eye on Hacker News. First, a deep dive into how Linear is so fast — nearly four hundred points. It covers their sync engine, optimistic updates, rendering pipeline, keeping everything under a hundred milliseconds.

Nadia

That's required reading if you're building any real-time collaborative SaaS. And then there's Performative UI — over four hundred points. It's a tongue-in-cheek React component library of design patterns that make things look fast. Skeleton screens, progress indicators, perceived-performance tricks.

Marcus

I love that. Because sometimes the best performance optimization is just making the user feel like things are fast. Worth stealing ideas from even if you don't use the library.

Nadia

One more tool — whichllm. It's a CLI that benchmarks local LLMs on your specific hardware and ranks by real performance, not just parameter count. If you're deploying local models for on-device or privacy-sensitive stuff, this saves you hours of trial and error.

Marcus

Alright, security. Two stories that should make you uncomfortable. First, Meta's AI support bot was exploited to hijack Instagram accounts. Hackers manipulated the automated customer service agent to change account recovery emails.

Nadia

That's a stark reminder. If you're building AI-powered support bots with any kind of account-level permissions, every state-changing action needs to be treated as a privileged operation. Out-of-band verification. No exceptions.

Marcus

And Cloudflare shipped something nice on the defensive side — real-time threat intel baked directly into their WAF rules through new cf-dot-intel fields. So you can block IPs and patterns flagged by their threat research team without wiring up any third-party feeds. Link in the briefing for both of those.

Nadia

That's actually really clean. Cloudflare keeps making the right thing the easy thing.

Marcus

Quick hits before we wrap up. D2 is a modern diagram scripting language, text to diagrams. Zig by Example hit the Hacker News front page if you're learning Zig. And someone built a Matter Wi-Fi light bulb in Rust on a Raspberry Pi Pico 2 W, which is just delightful.

Nadia

Oh, and WhatsApp caught new NSO Group spyware attacks that violate the court order. That story isn't going away.

Marcus

So here's the big takeaway from today. The dominant pattern is platforms standardizing how agents talk to their ecosystems. Google's Skills framework, Apple opening Siri to natural-language Shortcuts, Activeloop's Hivemind for shared agent memory — they're all pointing the same direction.

Nadia

The agent interface layer is being defined right now. Like, this week. If you're building tools or APIs, you should be investing in making your product agent-accessible — structured skill definitions, natural-language invocation, making your capabilities discoverable by orchestration frameworks.

Marcus

The builders who define their skill surface early are going to get embedded into agent workflows before their competitors even show up. That's the game right now.

Nadia

Define your skill surface. I think that might be the bumper sticker for twenty twenty-six.

Marcus

Love it. That's your Builder's Briefing for June ninth. All the links are in the show notes. Go build something, and we'll see you tomorrow.

The Big Story

Google Drops Agent Skills Framework — 2,400+ Stars in Hours

Google just open-sourced `google/skills`, a framework for building agent skills that plug into Google products and services. It shot to 2,400+ stars on GitHub overnight, making it the most-engaged developer release of the day by a wide margin. Alongside it, `ibelick/ui-skills` landed at 1,290+ stars with a design-engineer focus. These aren't just libraries — they're the clearest signal yet that the major platforms are standardizing how agents interact with their ecosystems through composable, portable skill definitions.

What you can do right now: if you're building agents that touch Google Workspace, Maps, or any Google API surface, clone the repo and start wrapping your integrations as skills. The abstraction layer means your agent logic becomes reusable across orchestration frameworks. The `ui-skills` companion repo is worth checking if you're shipping AI-powered design tools — it provides skill primitives specifically for layout, component generation, and design system interactions.

What this signals: Google is making a play to become the default skill registry for agents, the same way npm became the default for JavaScript packages. If this pattern sticks, expect every major platform (Microsoft, AWS, Salesforce) to ship their own skill specs within 6 months. Builders who adopt early get the advantage of shaping how their integrations work before the standards calcify. The agent ecosystem is moving from 'every framework rolls its own tool format' to 'skills as a shared unit of agent capability.' Build to that interface now.

@github Read source 2,405 engagement

AI & Models

Gemini 3.5 Flash becomes default in Google Search and NotebookLM

Google is rolling Gemini 3.5 Flash as the default across AI Mode globally, plus NotebookLM gets the upgrade with chat-based source building. If you're building on Gemini APIs, expect the Flash model's improved coding and agent performance to be the new baseline — test your prompts against it now, especially if you're using function calling.

@Google Read source View tweet 92 engagement

Apple rebrands Siri as 'Siri AI' with image understanding and natural-language shortcuts

WWDC26's big reveal: Siri now generates Shortcuts from plain English descriptions, adds image understanding, conversational flow, and Vision OS object recognition. If you build iOS automations or Shortcuts integrations, this opens a new channel — users can now describe multi-step workflows and Siri will wire them up, meaning your app's Shortcuts support becomes discoverable through conversation, not just the Shortcuts gallery.

@MKBHD Read source View tweet 1,902 engagement

DeepSeek V4 Pro beats GPT-5.5 Pro on precision benchmarks

HN is buzzing (213 points) about DeepSeek's V4 Pro outperforming GPT-5.5 Pro on precision metrics. If you're routing model calls by task type, this is another data point that the 'best model' answer is now task-dependent — consider precision-sensitive workloads (code review, data extraction) as candidates for DeepSeek.

@newsycombinator Read source 357 engagement

MiMo-v2.5-Pro-UltraSpeed hits 1,000 tokens/sec on a 1T parameter model

Xiaomi's MiMo achieves 1,000 TPS on their 1T model, a throughput milestone that makes real-time inference at frontier scale actually viable. If you're building latency-sensitive applications and have been constrained by token generation speed, watch this architecture — it may force pricing and speed expectations across all providers to shift.

@newsycombinator Read source 622 engagement

v0 Max upgrades to Claude Opus 4.8

Vercel's AI UI generator now runs on the latest Claude Opus 4.8. If you're using v0 for prototyping, the upgraded model should produce more accurate component code and better handle complex layout instructions.

@v0 Read source View tweet 67 engagement

Activeloop Hivemind: open-source agent orchestration with shared memory

New OSS framework lets you sync successful AI prompting traces across coding tools (Cursor, Copilot, etc.), creating shared team memory for agent interactions. If your team uses multiple AI coding assistants, Hivemind could eliminate the 'everyone re-discovers the same prompt tricks' problem.

@DataChaz Read source View tweet 49 engagement

Perplexity publishes Harvard study on Computer agent efficiency

Real-world data showing Perplexity's Computer agent is more cost and time-efficient for cross-disciplinary research tasks. If you're evaluating search-augmented agents for knowledge work, this study gives you actual deployment numbers to benchmark against.

@AravSrinivas Read source View tweet 92 engagement

Google AI Plus drops to $4.99/month with 400GB storage

Google's consumer AI subscription just got cheap enough to be an impulse buy. For builders: this accelerates how fast your users will expect AI-powered features as table stakes, not premium upsells.

@9to5Google Read source View tweet 16 engagement

Developer Tools

How Linear is so fast: a technical breakdown

Deep-dive into Linear's performance architecture (379 HN points). Required reading if you're building any real-time collaborative SaaS — the piece covers their sync engine, optimistic updates, and rendering pipeline decisions that keep everything under 100ms.

@newsycombinator Read source 721 engagement

Performative-UI: React component library of design tropes that 'look fast'

Show HN with 434 points. A tongue-in-cheek but genuinely useful library of skeleton screens, progress indicators, and perceived-performance patterns. Worth stealing ideas from even if you don't use the library directly.

@newsycombinator Read source 632 engagement

whichllm: Find the best local LLM for your actual hardware

CLI tool that benchmarks local models on your specific machine and ranks by real performance, not parameter count. If you're deploying local models for on-device features or privacy-sensitive workloads, this saves hours of trial-and-error.

@github Read source 515 engagement

cli-printing-press: Generate agent-first CLIs from any API

Scans an API, finds competing tools, merges the best features, and generates a CLI designed for AI agents with SQLite sync and offline search. Wild approach — if you maintain a public API, run this against it to see what an ideal agent-facing interface looks like.

@github Read source 445 engagement

Intuned (YC S22): Reliable browser automations as code

Launch HN for a browser automation platform that emphasizes reliability over AI magic. If you've been burned by flaky Playwright or Puppeteer scripts in CI, this is worth evaluating — they're targeting the 'it works locally but breaks in prod' problem.

@newsycombinator Read source 149 engagement

Are you expected to run five Python type-checkers now?

Pyrefly's blog post addresses the Python type-checker fragmentation (mypy, pyright, pytype, pyre, pyrefly). If you're setting up a new Python project, this is a good primer on which checker actually matches your needs instead of running all of them.

@newsycombinator Read source 223 engagement

Apple announces macOS 27 Golden Gate at WWDC26

New macOS version announced with minimal technical details so far. Watch for developer betas — if you ship Mac apps, start testing compatibility early rather than waiting for the public beta cycle.

@verge Read source View tweet 32 engagement

Security

Cloudflare ships real-time threat intel as WAF rules via cf.intel fields

Cloudforce One threat data is now available as live WAF blocking rules. If you're on Cloudflare, you can now write rules using `cf.intel` fields to block IPs and patterns flagged by their threat research team — no third-party feed integration needed.

@Cloudflare Read source View tweet 84 engagement

Meta's AI support bot was exploited to hijack Instagram accounts

Hackers manipulated Meta's automated AI customer service to change account recovery emails. A stark reminder: if you're building AI-powered support bots with account-level permissions, treat every state-changing action as a privileged operation requiring out-of-band verification.

@KirkDBorne Read source View tweet 7 engagement

Troy Hunt: Data breach disclosure lag is getting worse after 1,000 breaches

Analysis of 1,000 breaches shows companies are taking longer to disclose, not shorter. If you handle user data, audit your own incident response playbook — regulators are watching this trend.

@newsycombinator Read source 130 engagement

Infrastructure & Cloud

Europe's accelerating shift away from US Big Tech, mapped

WIRED documents European governments and companies actively planning migrations from US platforms. If you're building B2B tools for the EU market, sovereign cloud and data residency aren't nice-to-haves anymore — they're becoming procurement requirements.

@WIRED Read source View tweet 12 engagement

Hyperlight: embeddable micro-VMs for safe untrusted code execution

Lightweight VMM designed to run untrusted code inside your application with minimal latency. If you're building plugin systems, sandboxed code execution for AI agents, or multi-tenant compute — this is a cleaner alternative to containers for short-lived, untrusted workloads.

@github Read source 120 engagement

Startups & Funding

SemiAnalysis: Unitree's iteration speed could dominate humanoid robotics

Deep analysis of how Unitree's rapid development cycles and cost structure give them a structural advantage over US robotics competitors. If you're building software for humanoid robots, bet on platforms with fast hardware iteration — Unitree's ecosystem may be the one to target.

@SemiAnalysis_ Read source View tweet 209 engagement

Quick Hits

D2: modern diagram scripting language (text-to-diagrams)

@github

Pake: Turn any webpage into a desktop app with one command

@github

Zig by Example — learning resource hits HN front page

@newsycombinator

Matter Wi-Fi light bulb in Rust on Raspberry Pi Pico 2 W

@newsycombinator

WhatsApp catches new NSO Group spyware attacks violating court order

@TechCrunch

Benedict Evans on a16z: AI coding agents finding product-market fit

@a16z

Gluetun: VPN client in a thin Docker container for multiple providers

@github

Google Search now generates interactive UI via Antigravity framework

@Google

The Takeaway

Today's dominant pattern: platforms are standardizing how agents talk to their ecosystems. Google's Skills framework, Apple opening Siri to natural-language Shortcuts creation, and Activeloop's Hivemind all point the same direction — the agent interface layer is being defined right now. If you're building tools or APIs, invest today in making your product agent-accessible: expose structured skill definitions, support natural-language invocation, and make your tool's capabilities discoverable by orchestration frameworks. The builders who define their 'skill surface' early will get embedded into agent workflows before competitors show up.

Google Drops Agent Skills Framework, 2,400+ Stars in Hours

Google Drops Agent Skills Framework — 2,400+ Stars in Hours

Gemini 3.5 Flash becomes default in Google Search and NotebookLM

Apple rebrands Siri as 'Siri AI' with image understanding and natural-language shortcuts

DeepSeek V4 Pro beats GPT-5.5 Pro on precision benchmarks

MiMo-v2.5-Pro-UltraSpeed hits 1,000 tokens/sec on a 1T parameter model

v0 Max upgrades to Claude Opus 4.8

Activeloop Hivemind: open-source agent orchestration with shared memory

Perplexity publishes Harvard study on Computer agent efficiency

Google AI Plus drops to $4.99/month with 400GB storage

How Linear is so fast: a technical breakdown

Performative-UI: React component library of design tropes that 'look fast'

whichllm: Find the best local LLM for your actual hardware

cli-printing-press: Generate agent-first CLIs from any API

Intuned (YC S22): Reliable browser automations as code

Are you expected to run five Python type-checkers now?

Apple announces macOS 27 Golden Gate at WWDC26

Cloudflare ships real-time threat intel as WAF rules via cf.intel fields

Meta's AI support bot was exploited to hijack Instagram accounts

Troy Hunt: Data breach disclosure lag is getting worse after 1,000 breaches

Europe's accelerating shift away from US Big Tech, mapped

Hyperlight: embeddable micro-VMs for safe untrusted code execution

SemiAnalysis: Unitree's iteration speed could dominate humanoid robotics

Get this briefing in your inbox