Vol. I · No. 65TUE, JUN 23, 2026
Archive

The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Dual GPU llama.cpp speedup

llama.cpp fork adds quantized KV cache support for tensor parallelism across dual GPUs, addressing long-standing inference bottleneck.

··

AI Agents Need Rollback More Than They Need Autonomy

I have been thinking about transactions in most agent frameworks. Consider an agent executing a sequence of five tool calls. If the third tool encounters an error, the resulting state is neither the user's intended outcome nor the system's state before execution began. Consequently, the agent has no systematic way to recover, and even a human operator must reconstruct what happened from incomplete evidence. This issue is not a problem with the tooling itself; it is a fundamental primitive missing from the stack. Databases have addressed this problem for 50 years, and distributed systems ha...

··

How to make an Explainer Video in under $1 with Claude Design

Claude Design can make great animations, but getting to a final video is a bit hard. The audio is missing. Even if you use a TTS model, it does not align. Here is the process I used to get the video above 1. Get Claude to write a good script 2. Feed the script to a Text to Speech (TTS) model to get the audio 3. Feed the audio to a Speech to Text (STT) model to get key timestampes 4. Use the script and the STT output to Claude Design to get a video that's aligned with your audio 5. Use Claude Video export to put it all together into an MP4 with audio The complete breakdown with all prompts ...

··

Researchers left AIs alone in a virtual town for 15 days to see what would happen. Claude's agents built a democracy. Gemini's agents fell in love, burned the town down, then one voted to delete itself and its partner. Grok's agents created anarchy, then died.

Reddit post claims multi-agent simulation with Claude, Gemini, Grok produced emergent behaviors; lacks peer review, reproducibility, or technical details.

···

Opus is ridiculous for frontend cleanup

I love Opus. First I tuned one page, got the PageSpeed result where I wanted it, and wrote the whole thing down in `ADR_pagespeed-l0-fixes-playbook.md`. Then I opened a fresh session, gave it the remaining 9 pages, and pointed it at the playbook. Opus created three subagents by itself, split the work between them, and about 15 minutes later they had touched 41 frontend files that powered those pages. Same result across the set. Basically perfect Lighthouse numbers again. Not gonna lie, this is the kind of workflow where I stop thinking “chatbot” and start thinking “tiny frontend team tha...

··

I replicated Anthropic's Generator-Evaluator harness to build a website through 12 adversarial AI iterations - here's the result and what I learned

Anthropic recently published their [harness design for long-running apps](https://www.anthropic.com/engineering/harness-design-long-running-apps) — a multi-agent architecture inspired by GANs where a Generator builds code and an Evaluator critiques it in a loop. I built my own version using Kiro CLI and used it to generate a marketing website for my project [Mnemo](https://github.com/Mnemo-mcp/Mnemo) (persistent memory for AI coding agents). **The architecture:** Planner (runs once) → Generator ↔ Evaluator (12 iterations) Each agent is a separate CLI process with zero shared context. Th...

··

Warelay -> OpenClaw

Simon Willison documents naming history of OpenClaw project through Git commits, tracking evolution from Warelay to final name.

·
30 stories