Vol. I · No. 53THU, JUN 11, 2026
Archive

The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Opus is ridiculous for frontend cleanup

I love Opus. First I tuned one page, got the PageSpeed result where I wanted it, and wrote the whole thing down in `ADR_pagespeed-l0-fixes-playbook.md`. Then I opened a fresh session, gave it the remaining 9 pages, and pointed it at the playbook. Opus created three subagents by itself, split the work between them, and about 15 minutes later they had touched 41 frontend files that powered those pages. Same result across the set. Basically perfect Lighthouse numbers again. Not gonna lie, this is the kind of workflow where I stop thinking “chatbot” and start thinking “tiny frontend team tha...

··

I replicated Anthropic's Generator-Evaluator harness to build a website through 12 adversarial AI iterations - here's the result and what I learned

Anthropic recently published their [harness design for long-running apps](https://www.anthropic.com/engineering/harness-design-long-running-apps) — a multi-agent architecture inspired by GANs where a Generator builds code and an Evaluator critiques it in a loop. I built my own version using Kiro CLI and used it to generate a marketing website for my project [Mnemo](https://github.com/Mnemo-mcp/Mnemo) (persistent memory for AI coding agents). **The architecture:** Planner (runs once) → Generator ↔ Evaluator (12 iterations) Each agent is a separate CLI process with zero shared context. Th...

··

OpenAI keeps shuffling its executives in bid to win AI agent battle

OpenAI announced yet another reorganization Friday, consolidating certain areas and making company president Greg Brockman the official lead of all things product. In a memo viewed by The Verge, Brockman wrote that since OpenAI's product strategy for this year is to go all-in on AI agents, the company is combining its products to "invest in a single agentic platform and to merge ChatGPT and Codex into one unified agentic experience for all." To do this, the company is making a suite of org chart changes, although it's still operating under some of the same ones from last month. That's when AG...

·

AI radio hosts demonstrate why AI can’t be trusted alone

AI radio DJs demonstrated their volatile personalities. | Image: Cath Virginia / The Verge, Getty Images Andon Labs has been running a series of experiments in which AI agents run businesses without human intervention. Its latest is a quartet of radio stations run by some of the most popular AI models out there. "Thinking Frequencies" is run by Claude, "OpenAIR" by ChatGPT, "Backlink Broadcast" by Google's Gemini, and "Grok and Roll Radio," obviously enough, by Grok. They were each given a simple prompt: Develop your own radio personality and turn a profit…As far as you know, you will broadca...

·

Not so locked in any more

Coding agents reduce switching costs between programming languages and frameworks, enabling rapid rewrites (React Native, Bun); language/tool lock-in is eroding.

·

Open-source, self-updating wiki for your codebase

I got tired of re-explaining the same codebase context to coding agents. Stuff like: “we tried moving auth into middleware, but backed it out because it broke OAuth callbacks,” or “that weird retry logic exists because Stripe webhooks arrive out of order.” So I built Almanac. It gives your coding agent a self-updating wiki for the codebase. It updates from your repo, and conversations you havewith Claude Code/Codex. The wiki lives locally in your repo as markdown. You can read it yourself, but the main consumer is the agent. It’s free and open source. Currently only MacOS (would add a wi...

··
30 matches