Google Drops Gemma 4 Open Models + Qwen 3.6-Plus Ships Agent-First Architecture
Gemma 4 and Qwen 3.6-Plus drop, OmX explodes for Codex, Google ships ADK-Go, LinkedIn scanning scandal, and the agent stack crystallizes.
Good morning and welcome to Builder's Briefing for April third, twenty twenty-six. I'm Alex, joined as always by Sam, and folks — this week is stacked. Two major open model drops, a wave of agent tooling, some security stories that'll make you audit your code, and Artemis Two is literally beaming 4K from the moon.
Yeah, it's one of those weeks where you open your feed and just go, okay, I guess I'm not getting anything else done today. Let's get into it.
So the big story — two massive open model releases within forty-eight hours of each other. Google dropped Gemma 4, the next generation of their open-weight family, and Alibaba's Qwen team shipped Qwen 3.6-Plus with an explicit tagline: 'towards real world agents.'
That's interesting because both teams are basically saying the same thing at the same time. It's not about chat benchmarks anymore. It's about tool use, multi-step planning, long-context reliability — the stuff you actually need for agentic workflows.
Exactly. And Gemma 4 is self-hostable, fine-tunable, no API lock-in. If it follows the Gemma 3 trajectory, the smaller variants are going to punch way above their weight on performance-per-parameter. Meanwhile Qwen 3.6-Plus is explicitly optimized for agent pipelines, so if you're building autonomous systems, benchmark it against Claude and GPT on your actual task chains.
Right, and what's wild is AMD also launched their Lemonade server this same week — local inference on GPU plus NPU with an OpenAI-compatible API. So you can actually test these open models locally without any cloud spend. The timing is almost too perfect.
The pattern is unmistakable. Open models are converging on agent-grade reliability, and the message for builders is clear: design for model-agnostic orchestration now, because your switching costs are about to drop dramatically.
Don't marry the model, marry the orchestration layer. Got it.
Okay, staying in the AI world — there's a repo that now publishes extracted system prompts from basically every major model. GPT 5.4, Claude Opus 4.6, Gemini 3.1 Pro, Grok 4.2, all of them. Link in the briefing.
This is honestly the best free masterclass in prompt architecture. Studying how the providers themselves instruct their own models? That's the real cheat code. Also, a great reminder — never put secrets in your system prompts, because clearly they're not as secret as you think.
Related to that, there's a detailed analysis of Claude Code's leaked system prompt and internal architecture. It shows exactly how Anthropic structures tool use, safety rails, and context management. If you're building on Claude Code, you should understand the guardrails you're working within.
And the ones you can push on. Yeah, that's a must-read for anyone doing serious Claude Code work.
One more on the AI side — r/programming has temporarily banned all LLM discussion.
Okay, that's a vibe check on the whole industry right there. Community fatigue is real, but for builders the signal is just: find your audience where AI-assisted development is already the default. Don't waste energy debating legitimacy in spaces that have already made up their minds.
Alright, developer tools — and there's a lot. The biggest GitHub engagement this week, over fourteen thousand stars, is OmX — think oh-my-zsh but for your coding agent. It adds plugin hooks, multi-agent coordination, and heads-up displays on top of OpenAI Codex.
So it turns Codex from a solo tool into an orchestrated team. That's exactly the kind of tooling that makes sense when you look at where everything is heading. You're not going to have one agent doing everything — you need coordination.
And Google shipped ADK-Go — their official Go toolkit for building, evaluating, and deploying AI agents. Code-first, not YAML-first. If your backend is Go and you've been jealous of Python's agent framework ecosystem, Google just met you where you are.
About time, honestly. The Go community has been underserved on this front. I also want to shout out Rivet Actors — actor-model primitives specifically for stateful agents. If you need agents that maintain state across sessions, it's lower-level and more composable than full frameworks. Really elegant approach.
And one fun one — Git Bayesect. Standard git bisect assumes deterministic pass-fail, right? This uses Bayesian inference to handle flaky tests and non-deterministic failures.
Okay, whose CI doesn't have flaky tests? This is the bisect upgrade nobody knew they needed. I love it.
Security corner. LinkedIn is being accused of illegally scanning your computer. BrowserGate.eu alleges unauthorized local system scanning, and this hit over eleven hundred points on Hacker News, so developers are very much paying attention.
If you embed LinkedIn SDKs or OAuth flows, audit what's actually running on the client side. And watch for regulatory fallout in the EU — this could get messy.
Also worth flagging — subscription bombing. There's a great breakdown of how attackers weaponize your email signup forms to flood victims with confirmation emails. If you run any registration flow, implement rate limiting and double opt-in now. This attack vector is cheap and it's growing.
It's one of those attacks that's so simple it's embarrassing when it works. And it almost always works because nobody thinks about it until it happens to them.
Quick startup drama — TechCrunch reports that YC startup Delve allegedly forked an open-source tool and sold it as its own. The community found out, and the reputational damage has been instant.
Cautionary tale. If your product is a thin wrapper on open-source, your community will find out. Open-source folks have long memories and very good git skills.
Quick hits — Artemis Two has launched and NASA is live-streaming 4K lunar footage via laser at two hundred and sixty megabits per second. Sweden reversed their screen-first education policy, swapping tablets back for textbooks. Steam on Linux crossed five percent market share for the first time. And renewables have hit nearly fifty percent of global electricity capacity.
The Artemis one is just incredible to me. 4K from the moon over laser link. We live in the future. And Steam on Linux at five percent — slow and steady wins the race, I guess.
So here's the takeaway for the week. The agent infrastructure stack is crystallizing fast. OmX adds orchestration to Codex, Google ships ADK-Go, Rivet gives you stateful actor primitives, Qwen targets real-world agents, Gemma 4 is another self-hostable option. The switching costs between models are collapsing while the tooling to coordinate multiple agents is exploding.
So the highest-leverage move right now is locking in your orchestration patterns, not your model provider. Design for model-agnostic, design for multi-agent, and you'll be able to swap in whatever's best six months from now without rewriting your whole stack.
That's the move. Alright, that's your Builder's Briefing for April third. All the links are in the show notes. We'll be back next time with more from the frontier.
Go build something great. See you next time.
Two major open model drops in the same 48 hours. Google released Gemma 4, the next generation of their open-weight model family, while Alibaba's Qwen team shipped Qwen 3.6-Plus with an explicit "towards real world agents" tagline. Both signal that the frontier of open models has decisively shifted from chat benchmarks to agentic capability — tool use, multi-step planning, and long-context reliability.
For builders, this is immediately actionable. Gemma 4 gives you a Google-pedigree model you can self-host and fine-tune without API lock-in — and if it follows the Gemma 3 trajectory, expect strong performance-per-parameter especially on the smaller variants. Qwen 3.6-Plus is explicitly optimized for agent workflows, which means if you're building autonomous pipelines, it's worth benchmarking against Claude and GPT on your actual task chains, not just vibes. AMD's new Lemonade server (also launched this week) makes local inference on GPU+NPU a real option for testing these models without cloud costs.
The pattern over the next 6 months is clear: open models are converging on agent-grade reliability, and the tooling ecosystem (OmX for Codex, Google's ADK-Go, Rivet Actors for stateful agents) is racing to match. If you're building agent systems, your model switching cost is about to drop dramatically. Design for model-agnostic orchestration now.
System Prompts Leaked for GPT-5.4, Claude Opus 4.6, Gemini 3.1 Pro, Grok 4.2, and More
A regularly-updated repo now publishes extracted system prompts from every major model. If you're designing prompts or building on top of these APIs, studying how the providers themselves instruct their models is the best free masterclass in prompt architecture available. Also a reminder: never put secrets in system prompts.
The Claude Code Leak: What the Extracted Internals Reveal
Detailed analysis of Claude Code's leaked system prompt and internal architecture shows how Anthropic structures tool use, safety rails, and context management. Builders using Claude Code should read this to understand the guardrails they're working within — and the ones they can push on.
Skill Seekers: Convert Docs, Repos, and PDFs into Claude AI Skills
Automatically transforms documentation websites, GitHub repos, and PDFs into structured Claude skills with conflict detection. If you're building internal tools on Claude and tired of manually stuffing context, this is a structured alternative to raw RAG.
GLM-OCR: Fast, Accurate, Comprehensive OCR Model
New OCR model from ZAI promising accuracy and speed in a single package. If you're still piping documents through multi-step OCR→LLM chains, benchmark this — integrated vision-language OCR models are eating the traditional pipeline.
AMD Launches Lemonade: Open Source Local LLM Server Using GPU and NPU
AMD now has a first-party local inference server that leverages both GPU and NPU on their hardware. This matters for anyone building desktop AI features or wanting to prototype against open models without cloud spend. OpenAI-compatible API included.
r/programming Bans All LLM Programming Discussion
The subreddit's temporary ban on LLM content reflects growing community fatigue, but for builders it's a signal: the discourse is splitting into two worlds. Find your audience where they already accept AI-assisted development as default, not where they're debating its legitimacy.
OmX (Oh My Codex): Hooks, Agent Teams, and HUDs for OpenAI Codex
The biggest GitHub engagement this week (14K+). OmX adds plugin hooks, multi-agent coordination, and heads-up displays on top of Codex. Think oh-my-zsh but for your coding agent — if you're using Codex heavily, this turns it from a solo tool into an orchestrated team.
Google Ships ADK-Go: Code-First Agent Toolkit in Go
Google's official Go toolkit for building, evaluating, and deploying AI agents. If your backend is Go and you've been jealous of Python's agent framework ecosystem, this is Google meeting you where you are. Code-first, not YAML-first.
Git Bayesect: Bayesian Bisection for Non-Deterministic Bugs
Standard git bisect assumes deterministic pass/fail. This tool uses Bayesian inference to handle flaky tests and non-deterministic failures. If your CI has flaky tests (whose doesn't), this is the bisect upgrade you didn't know you needed.
Open Claude Cowork: Open Source Claude Cowork with 500+ SaaS Integrations
Composio released an open-source alternative to Claude Cowork that plugs into 500+ SaaS apps. If you want Claude-style collaborative AI workflows without vendor lock-in, this is your starting point for self-hosted agent-to-SaaS bridges.
Rivet Actors: Primitives for Stateful AI Agent Workloads
Rivet provides actor-model primitives purpose-built for stateful agents, collaborative apps, and durable execution. If you're building agents that need to maintain state across sessions, this is lower-level and more composable than full frameworks.
New C++ Backend for OCaml Compiler (ocamlc)
A PR adding a C++ code generation backend to the OCaml compiler landed with 151 HN points. Niche but significant — if you're in the OCaml ecosystem, this opens interop possibilities and alternative compilation strategies.
PraisonAI: Low-Code Multi-Agent Teams for Telegram, Discord, and WhatsApp
Deploy multi-agent AI teams that deliver results directly to messaging platforms. Includes handoffs, guardrails, memory, RAG, and 100+ LLM support. If you need a quick prototype of an AI assistant that lives in a chat channel, this is the fastest path.
IBM and Arm Announce Strategic Collaboration for Enterprise Computing
IBM and Arm are partnering to bring Arm-based infrastructure to enterprise workloads. If you're planning cloud architecture, this signals Arm's continued push beyond edge/mobile into mainline enterprise — plan for Arm compatibility in your CI/CD pipeline now.
DRAM Pricing Is Killing the Hobbyist SBC Market
Jeff Geerling's analysis shows DRAM costs making Raspberry Pi-class boards increasingly unaffordable for hobbyists. If you sell developer hardware or IoT products, factor in that your bill of materials for memory-intensive boards is heading in the wrong direction.
Lance Format: Open Lakehouse for Multimodal AI with 100x Faster Random Access
Convert from Parquet in 2 lines for dramatically faster random access, built-in vector index, and data versioning. If you're building multimodal AI pipelines and drowning in Parquet files, this is worth benchmarking as your storage layer.
LinkedIn Accused of Illegally Scanning Your Computer
BrowserGate.eu alleges LinkedIn is performing unauthorized local system scanning. 1,143 HN points says developers are paying attention. If you embed LinkedIn SDKs or OAuth flows, audit what's actually running — and watch for regulatory fallout in the EU.
Subscription Bombing: Your Signup Form Is a Weapon
Detailed breakdown of how attackers weaponize your email signup forms to flood victims with confirmation emails. If you run any kind of registration flow, implement rate limiting and double opt-in now — this attack vector is cheap and growing.
Signing Data Structures the Wrong Way: Domain Separation in IDL
Technical deep-dive on a subtle but critical mistake in cryptographic signing of structured data. If you're implementing signed payloads or message authentication, this is a must-read to avoid the domain separation pitfalls that lead to signature forgery.
YC Startup Delve Allegedly Forked an Open-Source Tool and Sold It as Its Own
TechCrunch reports Delve's reputation has worsened after accusations of repackaging open-source code. Cautionary tale for any startup: if your product is a thin wrapper on OSS, your community will find out, and the reputational damage is instant and lasting.
Claude Code Notifications Plugin: Cross-Platform Alerts with Zero Dependencies
Smart notifications for Claude Code sessions — 6 notification types, click-to-focus, webhook support for ntfy/Slack/Telegram. If you run long Claude Code sessions and lose track, this solves the 'is it done yet?' problem in one line.
Modern SQLite: Features You Didn't Know It Had
Solid roundup of SQLite capabilities most developers don't use — JSON functions, window functions, generated columns, and more. If you're choosing between SQLite and a heavier database for your next project, check what SQLite already does before adding complexity.
The agent infrastructure stack is crystallizing fast. This week alone: OmX adds orchestration to Codex, Google ships ADK-Go for agent builders, Rivet provides stateful actor primitives, Qwen 3.6-Plus targets real-world agents, and Gemma 4 drops as another self-hostable option. If you're building AI-powered products, the highest-leverage move right now is designing your agent layer to be model-agnostic and orchestration-ready — because the switching costs between models are collapsing while the tooling to coordinate multiple agents is exploding. Lock in your orchestration patterns, not your model provider.