Open Source AI Coding Agents Hit Critical Mass: OpenCode and Google's Stitch Drop Same Week
Open source AI coding agents hit critical mass. OpenCode + Google Stitch ship same week. Plus vLLM omni-modal, Mamba-3, and Deno's decline.
Hey everyone, welcome to Builder's Briefing for March twenty-second, twenty-twenty-six. I'm Alex, joined as always by Sam. We've got a packed one today — open source coding agents hitting a tipping point, some really interesting model drops, and a Rust-versus-TypeScript performance story that might change how you think about WASM.
Yeah, and someone built a Bluesky client in Fortran, so we'll get to that too. Let's go.
Alright, the big story. Two serious open-source AI coding agents dropped in the same week. OpenCode — which blew up on Hacker News, over six hundred fifty points — is positioning itself as the open-source Cursor alternative. It's terminal-native, MIT-licensed, and you fully own it. And then Google Labs quietly shipped Stitch Skills on GitHub, their own open-source coding agent with a composable skills architecture.
Okay, so what jumps out to me is how different the design philosophies are. OpenCode is very much a CLI-first tool — if your team already lives in the terminal, it just slots right in. Stitch is doing this modular plugin-style thing where you compose skills together. It's almost like Google is betting that the agent itself is less important than the building blocks.
Exactly. And both are MIT-licensed, both ready to self-host today. So if you've been paying twenty bucks a month for a proprietary coding assistant, you now have two credible alternatives with no vendor lock-in. You can wire them into your own CI/CD pipelines, customize them however you want.
Right, and what's wild is the implication for the next six months. Coding agents are commoditizing fast. The moat isn't the agent anymore — it's the context you feed it. Your codebase, your conventions, your deployment pipeline. That's where the value lives now.
If you're building developer tools or internal platforms, seriously evaluate both of these this week. Links in the briefing.
Okay, let's talk models. A few notable drops. The vLLM project launched vllm-omni — basically a unified inference path for text, image, audio, and video. If you're already using vLLM in production, this is huge because you don't have to stitch together separate pipelines anymore.
That's interesting because multimodal serving has been such a pain point. Everyone's been cobbling together different services for different modalities. Having one inference path through vLLM could really simplify production stacks.
Also worth flagging — Mamba-3 dropped from Together AI. Each generation of these state-space models closes the quality gap with transformers, and if you're doing inference on long sequences where attention costs are killing your margins, Mamba's linear-time inference is looking more and more viable.
And Meta published research on machine translation covering sixteen hundred languages. Sixteen hundred! That's approaching coverage for a lot of low-resource languages that have basically been invisible to translation tech until now.
Yeah, watch for model weights to follow on that one. If you're building for global markets, this is a signal worth tracking.
Alright, developer tools. This one's my favorite story of the day. The OpenUI team had a parser written in Rust compiled to WASM for the browser. They rewrote it in plain TypeScript — and it got faster.
Okay, I need to unpack that because on the surface it sounds like heresy. But the takeaway isn't that Rust is slow. It's that the WASM serialization boundary — crossing between JavaScript and WASM — has real overhead. For certain workloads, that overhead completely negates Rust's raw speed advantage.
Exactly. So the practical lesson: before you reach for Rust plus WASM for browser-side performance, benchmark the JavaScript-native path first. You might be surprised.
Also quickly — Iroh is worth a look if you're building peer-to-peer or local-first apps. It's a Rust networking stack where you dial cryptographic keys instead of IP addresses. NAT traversal just handled for you. That's a big deal for anyone who's spent weeks fighting hole-punching.
Oh, and Ubuntu twenty-six-oh-four will finally show asterisks when you type your sudo password. Only took forty-six years.
Ha! Update your onboarding docs, everyone. No more telling new developers 'don't worry, it's actually typing.'
Shifting gears — there's a concerning signal out of Deno. Reports of layoffs, questions about leadership visibility. The Hacker News thread has about a hundred seventy points and real accounts from affected people.
This isn't a death notice, but if you've bet your stack on Deno or Deno Deploy, it's time to at least have a migration plan in your back pocket. Risk management, not panic.
On the security side, the EFF put out a piece arguing that sites blocking Internet Archive crawlers to prevent AI training are really just hurting themselves. AI companies have plenty of other data sources — but the historical web record through the Archive is irreplaceable.
That's a really important nuance. If you maintain a robots.txt, be surgical about it. Blocking AI training crawlers is one thing, but blanket-blocking archive.org alongside them means you're erasing your own history from the web's collective memory.
Quick hits! Armin Ronacher — the creator of Flask — has a nice reflection out about why some things just take time in software. Worth a read. There's an FFmpeg one-oh-one guide resurfacing on Hacker News that's a solid reference if you ever touch video.
And someone built a terminal-only Bluesky client in Fortran. I just want to know who woke up and said, you know what this social network needs? Fortran.
Living legend behavior, honestly. Also check out Grafeo if you need an embeddable graph database without running full Neo4j — really interesting for AI apps and local-first tools. Links to everything are in the briefing.
So here's the takeaway for this week. The coding agent layer is commoditizing right now. OpenCode and Google Stitch both going open source means the value isn't in having an AI coding assistant — it's in what context and workflows you feed it. Integrate one of these open agents and focus your differentiation on your proprietary context.
And don't sleep on the TypeScript-versus-WASM lesson. Abstraction boundaries have real costs. Benchmark before you assume the fancier tool is the faster tool.
That's the briefing for March twenty-second. This coming week, keep an eye on whether more open-source coding agents emerge — I think we're at the start of a wave, not the end. Thanks for listening, everyone.
Go build something cool. See you next time.
Two serious open-source AI coding agents landed within days of each other. OpenCode (656 points, 287 comments on HN) positions itself as the open-source alternative to Cursor and Copilot — a terminal-native coding agent you own and can extend. Meanwhile, Google Labs quietly shipped Stitch Skills on GitHub, their own open-source coding agent built around a composable skills architecture. Both are MIT-licensed. Both are ready to run today.
What this means practically: if you've been paying $20/month for a proprietary coding assistant, you now have two credible alternatives you can self-host, customize, and integrate into your own CI/CD pipelines without vendor lock-in. OpenCode's terminal-first approach makes it particularly interesting for teams already embedded in CLI workflows. Stitch's skills-based architecture suggests Google is betting on modular, composable agent capabilities — think plugins for code agents rather than monolithic assistants. If you're building developer tools or internal platforms, both of these are worth evaluating this week.
The signal for the next six months: coding agents are commoditizing fast. The differentiation won't be in the base agent — it'll be in the workflows, context, and proprietary knowledge you wire into them. If you're building on top of coding agents, bet on the open ones and invest in your integration layer. The proprietary coding assistant moat just got a lot thinner.
vLLM Goes Omni-Modal with vllm-omni
The vLLM project launched vllm-omni, a framework for efficient inference with omni-modality models. If you're serving multimodal models in production and already use vLLM, this gives you a unified inference path for text, image, audio, and video inputs without stitching together separate pipelines.
MoonshotAI Ships Attention Residuals — A New Transformer Efficiency Trick
MoonshotAI open-sourced their Attention Residuals technique, which adds residual connections within the attention mechanism itself. If you're training or fine-tuning transformers, this is a drop-in improvement worth benchmarking on your workloads — early HN discussion suggests meaningful quality gains at minimal compute cost.
Meta's Omnilingual MT Covers 1,600 Languages
Meta published research on machine translation spanning 1,600 languages. If you're building for global or low-resource language markets, this signals that high-quality translation for niche languages is approaching production quality — watch for model weights to follow.
Mamba-3 Drops from Together AI
Together AI released Mamba-3, the next iteration of the state-space model challenger to transformers. If you're running inference on long sequences and transformer attention costs are killing your margins, Mamba's linear-time inference is worth another look — each generation closes the quality gap.
Rust WASM Parser Rewritten in TypeScript — And It Got Faster
OpenUI rewrote their Rust-compiled-to-WASM parser in plain TypeScript and saw performance improvements. The takeaway isn't "Rust bad" — it's that WASM serialization overhead can negate Rust's speed advantage for certain workloads. Before reaching for Rust+WASM in the browser, benchmark the JS-native path first.
Ghostling: A New Tool from the Ghostty Terminal Org
The Ghostty team shipped Ghostling, a lightweight companion tool from their terminal ecosystem. If you're in the Ghostty camp, this extends your workflow — check the repo for integration details.
Iroh: Dial Keys Instead of IP Addresses — Networking Stack in Rust
Iroh is a modular Rust networking stack that replaces IP addresses with cryptographic keys for peer addressing. If you're building P2P, local-first, or edge applications and NAT traversal is eating your time, this abstracts away the pain of direct connectivity.
Grafeo: Embeddable Graph Database in Rust
Grafeo is a fast, lean, embeddable graph database built in Rust. If you need graph queries without running Neo4j infrastructure — think embedded knowledge graphs for AI apps or local-first tools — this is worth a spike.
Ubuntu 26.04 Finally Shows Asterisks When You Type sudo Passwords
After 46 years of silent password entry, Ubuntu 26.04 will show asterisks during sudo. It's a tiny UX change but it'll affect every script and tutorial that says "don't worry, it's typing even though you can't see it." Update your onboarding docs if you maintain dev environment guides.
systemd Gets a Fresh Wave of Attention on GitHub
systemd is trending on GitHub again — likely tied to a significant release or patch cycle. If you're managing Linux services in production, check the recent commits for changes that might affect your unit files or boot configurations.
Deno in Trouble: Layoffs and Leadership Questions
Deno is reportedly declining with layoffs and CEO visibility issues. If you bet your stack on Deno or Deno Deploy, this is a risk signal — not a death notice, but worth having a migration plan. The HN thread (172 points, 111 comments) has real accounts from affected people.
EFF: Blocking Internet Archive Won't Stop AI, Will Erase the Web's History
The EFF argues that sites blocking Internet Archive crawlers to prevent AI training are shooting themselves in the foot — AI companies have other data sources, but the historical web record is irreplaceable. If you maintain robots.txt, think carefully before blanket-blocking archive.org alongside AI crawlers.
The coding agent layer is commoditizing this month — OpenCode and Google Stitch both going open source means the value is no longer in having an AI coding assistant, it's in what context and workflows you feed it. If you're building developer tools or internal platforms, integrate one of these open agents now and focus your differentiation on proprietary context (your codebase, your conventions, your deployment pipeline). Also: before defaulting to Rust+WASM for browser-side performance, benchmark the TypeScript-native path — the OpenUI team's results are a useful reminder that abstraction boundaries have real costs.