Vol. I · No. 69SAT, JUN 27, 2026
Archive

The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Llama.cpp MTP support now in beta!

llama.cpp adds beta MTP (Multi-Token Prediction) support, starting with Qwen3.5, closing performance gap with vLLM on token generation.

··

[Release] TinyMozart v2 85M 🎶

TinyMozart v2 85M, an unconditional MIDI piano generation model, released with improvements for chord and length control.

··

Most of my Claude usage was on work that didn't need Claude. Cut my bill 60x on bulk tasks with a tiny side model.

I looked at what was actually eating my Claude usage and it was embarrassing. Classifying files. Reformatting json. Pulling fields out of text. Summarizing docs I was going to skim anyway. None of that needed Sonnet. All of it cost the same as the work that did. Tried the obvious fixes first. Switching to Haiku for simple stuff (still wasteful at volume). Tighter prompts (helps a little). /compact (delays the problem). None of it changed the shape of the spend. What actually worked: a small cheap model running as a side worker, with one rule in CLAUDE.md telling Claude not to do the mechani...

··

Google Earnings, Meta Earnings

Stratechery analysis: Google's stock outperformed Meta's despite weaker core metrics; Google's AI monetization strategy (including Anthropic investment) cited as key driver.

·

Vibe Coding vs. Production reality

Reddit discussion on gap between AI-assisted prototyping speed and production-ready deployment, highlighting auth, compliance, and vendor lock-in risks.

··

Flagged chat????

User reports Claude responding with Andes virus information when asked about Hanta virus on cruise ship.

··

IM A GPU REPAIR TECH ANTHROPIC. WHAT IS THIS

https://preview.redd.it/ebm71bi4o1zg1.png?width=1864&format=png&auto=webp&s=944a6179a5be05c619b8ae8537866d8b7676a16f Sure i asked to reverse engineer some binaries used for testing gpu's to make them work for my specifics mods, but this is ridiculous and standing in the way of providing critical work for thousands of dollars worth of GPU's

··

Unprompted.

Pretty cool. I am probably being a bit careless running it freed like that but is still wild to see lol.

··
30 stories