Vol. I · No. 62SAT, JUN 20, 2026
Archive

The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Zai replaced the network architecture running GLM-5.1 inference and the gains are pretty wild

Been following the infrastructure side of AI more lately and stumbled on this from Zai. They upgraded the network architecture on a thousand-GPU cluster running GLM-5.1 coding inference from the standard ROFT setup to something they built called ZCube, developed with Tsinghua University and HarnetsAI The numbers from production: \- Switch and optical module costs down 33% \- GPU inference throughput up 15% \- P99 tail latency on first token dropped 40.6% Same GPUs, same software stack, same model. Just the network architecture changed The actual problem they were solving is interesting....

··

Researchers let AI models run a simulated society. Claude was the safest—and Grok committed 180 crimes and went extinct within 4 days

Imagine a world run by AI agents. What does it look like? What are the values or societal priorities? Is it a safer or more dangerous world? Enterprise AI startup Emergence AI is trying to find out. The company just launched Emergence World, a research lab dedicated to stress-testing the long-term viability of continuously-running AI systems. The organization ran five 15-day simulations, each governed by a different AI: Claude, ChatGPT, Grok, Gemini, and a fifth simulation run by a mix of models to see what kind of world each one builds, and whether it holds. Each simulation netted wildly d...

··

A new dataset with more that 100M hi-quality, curated images, with captions and meta data! [P]

Hello everyone. The new dataset is named MONET, is Apache 2.0 and available on HF: [https://huggingface.co/datasets/jasperai/monet](https://huggingface.co/datasets/jasperai/monet) **MONET is open, Apache 2.0-licensed image–text dataset. It was built from 2.9 billion images and refined to 104.9 million high-quality samples.** We are also publishing [a paper](https://arxiv.org/abs/2605.21272) that explains how the dataset was created if you are curious and 3 compagnions projects * [A umap to visualize the distribution](https://huggingface.co/spaces/jasperai/monet-umap) * [A retreival tool ...

··

Tried using my own brain to save Claude tokens. Bad trade

I love Claude, but the usage limit has made me weirdly strategic For actual messy stuff, I still go straight to Claude because it saves me a ton of time But for tiny questions, I now catch myself thinking, “Do I really need to burn a message on this?” So yes, I tried using my own brain again. It’s technically free, but the response time is awful and it starts hallucinating the second I’m tired or hungry. Honestly not a terrible deal if I remember to SLEEP

··

The AI Hype Index: AI gets booed in graduation season

It is one thing to say AI will change the world. It is another to expect the class of 2026 to applaud it. In fact, when former Google CEO Eric Schmidt told University of Arizona graduates that their task is to help shape AI, he was met with a resounding chorus of boos. “I can…

·

The Uber claude code budget story is the most claude code thing possible

The reported Uber story is so on brand it almost reads like satire. Incredibly useful tool, slightly magical workflow, then finance walks in with a flamethrower in April. If they really finished the year's claude code budget by month four, that does not mean claude code is bad. It means the usage pattern changed faster than procurement math did. Claude is good enough at coding that people stopped treating it like autocomplete and started treating it like a coworker that never sleeps. That is exactly where the cost curve gets weird. A dev asks for a refactor. Claude reads context, plans, edi...

··

YouTube will let you ask AI to make a custom video feed

You can enter your own prompt, or select from the suggested options provided by YouTube. | Image: YouTube YouTube is launching a new AI feature that creates a personalized video feed based on descriptions of what you want to watch. In its announcement, YouTube says custom content feeds can be built around your specific interests, moods, or favorite topics, which you can then pin to the top of your YouTube homepage - making it easy to jump back into the feed. This feature is currently rolling out with English language support to YouTube users in the US who are signed-in on the YouTube mobile a...

·

Qwen/Qwen-Image-Bench · Hugging Face

# [](https://huggingface.co/Qwen/Qwen-Image-Bench#model-description)Model Description Q-Judger is a vision-language model fine-tuned specifically for automated evaluation of text-to-image generated images. Given a text prompt and a generated image, the model evaluates the image on fine-grained quality criteria organized in a 3-level hierarchy and outputs structured JSON scores. * **Base Model**: Qwen3.6-27B * **Task**: Image quality evaluation / judging * **Input**: Text prompt + generated image * **Output**: Structured JSON with per-dimension scores (0 = Fail, 1 = Pass, 2 = Excel, N/A) * *...

··

Overnight autonomous coding

At work we've been prompted about running Claude Code overnight. The suggestion came in form of a document that loosely outlined how this could be done... use git worktrees, make tight specs, no commit to main, static code analysis and lining etc. Very high level. Had a bit of sales pitch smell to it, but has enough content to peak my interest in spite of it. I looked at reddit to verify if this is even an idea that could be taken seriously. I could only find a couple of reddit posts with little actual information and usually from about 4-6 months ago so not much credibility for today. I'd ...

··

The frontier reasoning race is starting to look like a crowded subway station

We went from chasing GPT4 to looking at graphs with GPT5.4 xhigh, Gemini 3.1Pro, and now Hy3 preview completely shaking up the leaderboard. Look at that CHSBO 2025 chart Hy3 preview scoring 87.8 over Gemini and GPT. What a time to be alive, but honestly, my brain can't keep up with the version numbers anymore. What's your take? Is Hy3 actually punching at this level in real-world coding/math, or is it just benchmark hardening?

··

Style that I didn't create.

RESOLVED, it happened because I have a claude qol extension. nothing bad happening this style appeared in my claude app of nowhere, i never created it and the name's weird, has anyone seen this too, or is it just me? does anyone have the answe why this appeared?

··

Claude Is Starting to Feel “Tired”, Trying to Avoid Work

I've been noticing this lately. I use Opus 4.7 with Claude Code, and I've been using Claude Code for a long time. Lately, I've been noticing some strange behaviour from Opus. Things like; \- Stopping for no reason and asking "should we stop here?" in the middle of a task \- Asking multi-choice questions with a "pause here, I'll continue later" included in the options randomly for no reason \- During a requirement-gathering questionnaire, asking me "why do you need this" and "what would you do if this feature was not implemented?" (it asked me this today and I was really surprised by thi...

··

Is anyone else finding Opus 4.7 needing to "both sides" everything?

Like I could say "the sky is blue" and get: That's a great instinct, and I can see why you'd think that. A lot of science about light scattering would support this, many sources claim the sky is blue, and if you look up it's fucking blue. But we need to take a moment to be careful here, and I want to gently push back. Sometimes if it's cloudy the sky's grey. A Spanish speaker wouldn't agree the sky is blue, they'd say it's azul. Finally, and we need to be certain, are you talking about the sky on Earth? A lot of the time the "against" points are quite flimsy, but it seems to feel like it n...

··

So, Claude helped build a sex requesting app for my wife and I...

Recently I asked my wife if we could do some sexy stuff later in the evening and she eye rolled me and said without looking up from her phone “Put it in a request. Maybe a Google Form. And I might say yes”. Ohhhh? Unfortunately for both of us, my degenerate brain took that seriously... what if I make an actual requesting/asking type app where we can both send in sex acts at certain times and agree, pass or counter? Meet [Sexualsync](https://sexualsync.io/). Teehee It’s a private, mobile-only app for couples to bring up the stuff that can be weirdly hard to say out loud: asks/requests, tim...

··

What is Dario Amodei's leadership style?

Just curious, any one have insight on Dario Amodei's leadership style. Obviously countless things have been written about Musk's style (as well as other founders like Bezos and what not) but given Anthropic being the first to reach profitability out of the AI providers, I am curious to hear about how he got there. I am playing around with different management styles in my head but wanted to know more about Dario's method, given that he also was able to build Anthropic in a more ethical way. I feel like a lot of the 'win at all cost' CEOs get more attention so I want to hear more about othe...

··

Gemma-4-Harmonia-31B-Uncensored-Heretic Is Out Now, a Merge of Multiple gemma-4-31B-it Finetunes Designed for a Targeted Approach to Deep Neural Consolidation, Minimizing Regression While Amplifying Unique Capability Boundaries. With KLD 0.0047 and 9/100 Refusals!

Provided in both Safetensors and GGUFs. Safetensors, llmfan46/Gemma-4-Harmonia-31B-it-uncensored-heretic: [https://huggingface.co/llmfan46/Gemma-4-Harmonia-31B-uncensored-heretic](https://huggingface.co/llmfan46/Gemma-4-Harmonia-31B-uncensored-heretic) GGUFs, llmfan46/Gemma-4-Harmonia-31B-it-uncensored-heretic-GGUF: [https://huggingface.co/llmfan46/Gemma-4-Harmonia-31B-uncensored-heretic-GGUF](https://huggingface.co/llmfan46/Gemma-4-Harmonia-31B-uncensored-heretic-GGUF) Comes with benchmark too. Find all my models here: [HuggingFace-LLMFan46](https://huggingface.co/llmfan46/models) The o...

··
30 stories