Vol. I · No. 71MON, JUN 29, 2026
Archive

The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

The exact KV cache usage of DeepSeek V4

DeepSeek V4 achieves ~7.9x KV cache reduction vs V3.x at 1M context through improved architecture, reducing memory from 68.6GB to 8.7GB for Pro variant.

··

Claude for Personal USE

Reddit thread asking about personal use cases for Claude in grocery planning, fitness, and finance management.

··

Qwen3.5/3.6 Coder?

Reddit speculation on whether Alibaba will release a dedicated Qwen Coder variant or focus on general-purpose Qwen models with strong coding performance.

··

Introducing AutoMuon, a one line drop in for AdamW [P]

Hey everyone, I've been working on a small Python package called AutoMuon that makes the Muon optimizer usable as a drop-in replacement for AdamW in arbitrary PyTorch training pipelines. The core idea is relatively simple: Muon works primarily on 2D weight matrices (linear projections, conv layers) on hidden states, but you still need AdamW for embeddings, norms, and biases, etc. AutoMuon scans your model at init, figures out the right optimizer for each parameter automatically. I am open to PRs, especially for expanding the module-type exclusion list if you hit edge cases in your architect...

··

GPT-Image 2.0 is lowkey blowing my mind

Reddit user reports subjective quality improvements in GPT-Image 2.0 with better scene composition and lighting consistency.

··

OPUS Feedback

Just wanted to say that since Opus 4.5, I've noticed it's gotten progressively lazier. Like to the point it's been running for days on the same task and no matter how many times I've run it with explicit instructions, it ALWAYS tries to cop out with shortcuts. ALWAYS follows with "i took shortcuts, no excuses, i'm owning it" or something along the lines ... then 2 minutes later ignores explicit instructions again with another round of laziness and apologies. I've had zero difference in how I prompt and zero changes in my setup. It's simply a change in the model.

··

Did I just get PUNKED by AI?!?!

Reddit user reports anecdotal experience of AI output repetition; no technical details or actionable findings.

··

GPT 5.5 vs Opus 4.6/7 vs Gemini 3.1 Pro

Reddit user's subjective ranking of GPT-5.5, Claude Opus 4.6, Gemini 3.1 Pro; commentary on frontier model performance and business incentives.

···

Claude Opus 4.7 solved the dorito smear after 19 mins - worth mentioning to video enthusiasts

Sorry for this long boring post but video encoding enthusiasts might find this one helpful information. And, it's an impressive set of circumstances. To recap quick, there was a strange dorito smear happening where the color was lifting off of image vertically, you can see it the most if you look at the top of the bend of the pipe on the tower to the left. It was bothering me and I had suspected TAA ghosting, SDFGI issues, codec issues and/or a NVENC bug. But, nothing we did would solve it. I went to bed and asked Claude to drill into it until it found a solution. Here it is for posterity (af...

··
30 stories