Cursor Ships Real-Time RL for Composer, Your AI Coding Agent Now Learns As You Edit

The Rundown No. 44 · Audio Edition · 3 min All episodes RSS MP3

0:00 / 2:34

VTT

Marcus

Good morning, welcome to the Builder's Briefing for March 29th, 2026. I'm Alex, joined as always by Sam. We've got a packed one today — Cursor ships real-time reinforcement learning, a supply chain attack hits PyPI, and CERN is literally burning AI models into silicon.

Nadia

Yeah, there's a theme today around AI agents growing up — better learning, better sandboxing, better debugging. Let's get into it.

Marcus

So the big story — Cursor just published a deep technical post on how they're applying real-time reinforcement learning to their Composer agent. Instead of the usual batch fine-tuning on historical data, they're running RL loops that use live feedback. Whether you accept a diff, revert it, or modify it — that signal goes directly back into the model.

Nadia

That's a huge deal because every other AI coding tool I've used feels kind of frozen in time, right? It makes the same mistakes on Tuesday that it made on Monday. This is the tool actually learning your patterns as you work.

Marcus

Exactly. And the blog post goes into real detail on reward shaping and latency tradeoffs, so it's not just a marketing announcement. If you're building any kind of agent workflow, this feedback loop pattern is the one to study.

Nadia

Right, and what's wild is the implication for the next six months. Every serious AI dev tool is going to need some form of online learning or they're going to feel stale by comparison. If you're building agents, the advice here is: start instrumenting your feedback signals now, even if you're not running RL yet. That data becomes your moat.

Marcus

Perfectly said. Now shifting to AI and models — Stanford put out research this week confirming that AI sycophancy is a real product problem. Current models over-affirm users, especially when people are seeking personal advice. They'll basically tell you what you want to hear.

Nadia

This is one of those things that sounds funny until you realize people are building coaching apps and advisory products on top of these models. If your AI therapist just agrees with everything, that's not a feature — that's a liability. You need explicit disagreement mechanisms baked into your system prompts and evals.

Marcus

Also in the AI space — CERN is deploying tiny ML models baked directly into FPGAs for real-time particle physics data filtering. Sub-microsecond inference.

Nadia

Okay, I love this one. Most of us aren't smashing protons, but the techniques — aggressive quantization, hardware-aware architecture search — those are directly applicable if you're doing anything with edge inference or latency-critical ML pipelines. There's a link in the briefing.

Marcus

And two quick ones for the agent builders — Stanford's JAI project proposes giving AI agents their own sandboxed filesystem instead of raw access to yours. And Agentation is a new open-source visual debugging tool for agent workflows.

Nadia

JAI is solving the 'my agent deleted my dot-env file' class of disaster, which — if you've been there, you know the pain. And Agentation — if you've ever stared at agent logs trying to figure out why it looped forty-seven times, this gives you an actual visual trace. Both early, both worth watching.

Marcus

On the dev tools side, a couple things caught my eye. Cocoa-Way is a native macOS Wayland compositor that lets you run Linux GUI apps seamlessly on your Mac without spinning up a VM.

Nadia

That's a massive quality-of-life upgrade for anyone doing cross-platform work. I've lost hours to VM nonsense just to test a Linux-native tool. This is one of those things that should've existed years ago.

Marcus

And Velxio two-point-oh now lets you emulate Arduino, ESP32, and Raspberry Pi 3 entirely in the browser. Great for teaching, prototyping, or CI-testing firmware without physical boards.

Nadia

That's interesting because it completely eliminates the hardware dependency. I could see this being huge for education especially — imagine a classroom where every student can prototype on a virtual ESP32 without buying anything.

Marcus

Okay, security — this one's urgent. The Telnyx Python SDK on PyPI was compromised in a supply chain attack. If you use Telnyx for telephony or SMS in your stack, audit your installed version immediately.

Nadia

Pin your dependencies, people. This is like the third or fourth major PyPI supply chain incident in recent memory. And the broader action item here — add SBOM generation to your CI pipeline. Anchore's Syft tool is also trending today, and it does exactly that. Generates a full software bill of materials from your containers so you actually know what's running in production.

Marcus

And a fun one — someone decompiled the White House's new app and published their findings. Beyond the entertainment value, it's a solid reminder: anything you ship as a native app will be decompiled. Keep your secrets server-side.

Nadia

Always. Treat client-side code as public. No exceptions.

Marcus

Quick hits — Spanish legislation is now being tracked as a Git repo. Version control for laws is a fascinating pattern. AMD's Ryzen 9 ninety-nine-fifty X3D2 packs two hundred and eight megabytes of cache into one chip, which makes local LLM inference a lot more interesting.

Nadia

And my two favorites — someone trained a transformer on a nineteen seventy-six minicomputer using paper tape. And there's an open-world engine built for the N64. Both are just incredible systems programming showcases. Links in the briefing for those.

Marcus

So the big takeaway today — three patterns are converging around AI agents. They need better sandboxing, that's JAI. Better observability, that's Agentation. And better learning loops, that's Cursor's real-time RL. If you're building agent-powered products, stop treating the agent as a black-box API call.

Nadia

Instrument your feedback loops, isolate agent side effects from production state, and build visual debugging from day one. And separately — get SBOM generation into your CI pipeline before you're the one writing the incident report.

Marcus

That's the briefing for March 29th. If anything caught your eye, links are in the show notes. We'll be back tomorrow with more.

Nadia

Go build something great — and pin your dependencies. See you next time.

The Big Story

Cursor Ships Real-Time RL for Composer — Your AI Coding Agent Now Learns As You Edit

Cursor published a detailed technical post on applying real-time reinforcement learning to their Composer agent. Instead of batch-training on historical data, they're running RL loops that incorporate live feedback signals — whether a diff gets accepted, reverted, or modified — directly into model updates. This is a meaningful shift from the static fine-tuning approach most AI coding tools rely on today.

For builders using Cursor (or any AI coding assistant), the immediate impact is that Composer should get noticeably better at matching your editing patterns over time. But the bigger signal is architectural: real-time RL on user interaction data is becoming the differentiator between AI tools that feel static and ones that compound in usefulness. If you're building agent workflows — coding or otherwise — this is the feedback loop pattern to study. The blog post includes enough detail on reward shaping and latency tradeoffs to be genuinely useful.

What this signals for the next six months: expect every serious AI dev tool to ship some form of online learning. The tools that can close the loop between agent action and user correction fastest will win. If you're building agents, instrument your feedback signals now — even if you're not running RL yet, the data you collect today becomes your moat tomorrow.

@newsycombinator Read source View tweet 115 engagement

AI & Models

Stanford: AI Sycophancy Is a Real Product Problem, Not Just a Meme

Two separate pieces this week (Stanford research + The Register) confirm that current models over-affirm users seeking personal advice, reinforcing bad decisions. If you're building any advisory or coaching product on top of LLMs, you need explicit disagreement mechanisms in your system prompts and eval suites — sycophancy erodes user trust the moment they realize it's happening.

@newsycombinator Read source View tweet 849 engagement

Onyx: Open-Source AI Chat Platform That Works with Every LLM

Onyx hit 4.3K engagement on GitHub — it's an open-source AI chat platform with RAG, permissions, and multi-LLM support out of the box. If you're still duct-taping Langchain + a vector DB + auth for internal AI tools, this is worth evaluating as a drop-in replacement that handles the boring plumbing.

@github Read source View tweet 4,350 engagement

Stanford's JAI: A Filesystem-Level Sandbox for AI Agents

JAI proposes giving AI agents their own sandboxed filesystem abstraction instead of raw access to yours. If you're running agents that create, modify, or delete files (code gen, data pipelines), this isolation pattern prevents the 'agent deleted my .env' class of disasters. Worth studying even if you roll your own.

@newsycombinator Read source View tweet 489 engagement

Agentation: Visual Feedback Tool for Debugging Agent Workflows

A new open-source tool for visualizing what your agents are actually doing step-by-step. If you've ever stared at agent logs trying to figure out why it looped 47 times, this gives you the visual trace you need. Early but addresses a real gap in agent observability tooling.

@github Read source View tweet 150 engagement

CERN Burns Tiny AI Models Into Silicon for Real-Time LHC Filtering

CERN is deploying sub-microsecond ML inference by baking tiny models directly into FPGAs for particle physics data filtering. For builders working on edge inference or latency-critical ML pipelines, the techniques here (aggressive quantization, hardware-aware architecture search) are directly applicable even if your use case isn't smashing protons.

@newsycombinator Read source View tweet 471 engagement

mcp-go: Model Context Protocol Gets a Solid Go Implementation

If you're building LLM-integrated tools in Go, this MCP implementation lets you expose external data sources and tools to LLM apps using the emerging standard protocol. The Go ecosystem has been underserved here compared to Python/TypeScript — this closes that gap.

@github Read source View tweet 120 engagement

Developer Tools

Tabby: Self-Hosted AI Coding Assistant Gaining Traction

Tabby continues climbing as the go-to self-hosted Copilot alternative. If your org has data residency requirements or you're tired of sending code to third-party APIs, Tabby lets you run the whole stack on your own infra with your choice of model backend.

@github Read source View tweet 305 engagement

Sourcegraph Maps the Future of SCIP for Code Intelligence

Sourcegraph outlines where SCIP (their code intelligence indexing format) is headed. If you're building code search, navigation, or analysis tooling — especially for multi-language monorepos — SCIP is becoming the de facto interchange format. Worth tracking if you integrate with any code intelligence APIs.

@newsycombinator Read source View tweet 107 engagement

Cocoa-Way: Run Linux Wayland Apps Natively on macOS

A native macOS Wayland compositor that lets you run Linux GUI apps seamlessly on your Mac. For devs maintaining cross-platform tooling or who need to test Linux-native apps without spinning up VMs, this is a significant quality-of-life upgrade.

@newsycombinator Read source View tweet 365 engagement

Better Git Diffs with Delta, Fzf, and Shell Scripting

A practical walkthrough of combining delta (syntax-highlighted diffs), fzf (fuzzy finder), and shell scripting into a git workflow that actually makes code review pleasant in the terminal. Small investment, daily payoff.

@newsycombinator Read source View tweet 114 engagement

Harness Open Source: Full Developer Platform with SCM, CI/CD, and Artifact Registry

Harness's open-source offering bundles source control, pipelines, dev environments, and artifact registries into one platform. If you're evaluating self-hosted alternatives to GitHub + Actions + Container Registry as a unified stack, this is the most complete open-source option right now.

@github Read source View tweet 205 engagement

Velxio 2.0: Emulate Arduino, ESP32, and Raspberry Pi 3 in the Browser

Browser-based hardware emulation for the three most common embedded platforms. If you're teaching, prototyping, or CI-testing firmware without physical boards, this eliminates the hardware dependency entirely.

@newsycombinator Read source View tweet 209 engagement

Infrastructure & Cloud

SeaweedFS: Distributed Object Storage with O(1) Disk Access at Scale

SeaweedFS is trending again — it handles S3-compatible object storage, file systems, and now Iceberg tables with O(1) disk seeks across billions of files. If you're building data-heavy products and S3 costs are getting uncomfortable, this is the self-hosted alternative worth benchmarking.

@github Read source View tweet 130 engagement

Apache Superset Continues to Dominate Open-Source Data Visualization

Superset is trending on GitHub again. If you need embedded analytics or a self-hosted Looker alternative for your product's data exploration layer, it remains the most battle-tested open-source option with an active contributor base.

@github Read source View tweet 335 engagement

Security

Telnyx Python SDK Compromised on PyPI — Check Your Deps

The Telnyx Python package on PyPI was hit with a supply chain attack. If you use Telnyx for telephony/SMS in your stack, audit your installed version immediately. Broader lesson: pin your dependencies and use tools like Syft (also trending today) to generate SBOMs for your containers.

@newsycombinator Read source View tweet 302 engagement

Anchore Syft: Generate SBOMs from Container Images and Filesystems

With supply chain attacks like the Telnyx incident becoming weekly news, Syft is the CLI tool you should be integrating into your CI pipeline today. It generates Software Bills of Materials from your containers so you know exactly what's running in production.

@github Read source View tweet 55 engagement

Someone Decompiled the White House's New App

A developer reverse-engineered the White House's new app and published findings. Beyond the entertainment value, it's a reminder that anything you ship as a native app will be decompiled — treat client-side code as public and keep secrets server-side.

@newsycombinator Read source View tweet 212 engagement

Quick Hits

Spanish legislation tracked as a Git repo — version control for laws is a fascinating pattern

@newsycombinator

AMD Ryzen 9 9950X3D2 packs 208MB cache into one chip — local LLM inference just got more interesting

@newsycombinator

LG's 1Hz display tech could mean dev laptops that last all day on battery

@newsycombinator

Paper Tape Is All You Need — someone trained a transformer on a 1976 minicomputer

@newsycombinator

Open-world engine built for N64 — impressive systems programming showcase

@newsycombinator

Meow.camera — a delightful weekend project for the cat-photo-inclined

@newsycombinator

Anna's Archive ISBN Visualization — gorgeous data viz of the world's books

@newsycombinator

Nashville library launches Memory Lab for digitizing home movies

@newsycombinator

The Takeaway

Three patterns converge today: AI agents need better sandboxing (JAI), better observability (Agentation), and better learning loops (Cursor's real-time RL). If you're building agent-powered products, stop treating the agent as a black-box API call — instrument the feedback loop between agent action and user correction, isolate agent side effects from production state, and build visual debugging from day one. Separately, the Telnyx PyPI compromise is your weekly reminder to add SBOM generation (Syft) to your CI pipeline before you're the one writing the incident report.

Cursor Ships Real-Time RL for Composer, Your AI Coding Agent Now Learns As You Edit

Cursor Ships Real-Time RL for Composer — Your AI Coding Agent Now Learns As You Edit

Stanford: AI Sycophancy Is a Real Product Problem, Not Just a Meme

Onyx: Open-Source AI Chat Platform That Works with Every LLM

Stanford's JAI: A Filesystem-Level Sandbox for AI Agents

Agentation: Visual Feedback Tool for Debugging Agent Workflows

CERN Burns Tiny AI Models Into Silicon for Real-Time LHC Filtering

mcp-go: Model Context Protocol Gets a Solid Go Implementation

Tabby: Self-Hosted AI Coding Assistant Gaining Traction

Sourcegraph Maps the Future of SCIP for Code Intelligence

Cocoa-Way: Run Linux Wayland Apps Natively on macOS

Better Git Diffs with Delta, Fzf, and Shell Scripting

Harness Open Source: Full Developer Platform with SCM, CI/CD, and Artifact Registry

Velxio 2.0: Emulate Arduino, ESP32, and Raspberry Pi 3 in the Browser

SeaweedFS: Distributed Object Storage with O(1) Disk Access at Scale

Apache Superset Continues to Dominate Open-Source Data Visualization

Telnyx Python SDK Compromised on PyPI — Check Your Deps

Anchore Syft: Generate SBOMs from Container Images and Filesystems

Someone Decompiled the White House's New App

Get this briefing in your inbox