The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Zai replaced the network architecture running GLM-5.1 inference and the gains are pretty wild

Been following the infrastructure side of AI more lately and stumbled on this from Zai. They upgraded the network architecture on a thousand-GPU cluster running GLM-5.1 coding inference from the standard ROFT setup to something they built called ZCube, developed with Tsinghua University and HarnetsAI The numbers from production: \- Switch and optical module costs down 33% \- GPU inference throughput up 15% \- P99 tail latency on first token dropped 40.6% Same GPUs, same software stack, same model. Just the network architecture changed The actual problem they were solving is interesting....

u/Scared-Biscotti2287·23 days ago·252 pts / 26 comm

r/ClaudeAI· COMMUNITY

Researchers let AI models run a simulated society. Claude was the safest—and Grok committed 180 crimes and went extinct within 4 days

Imagine a world run by AI agents. What does it look like? What are the values or societal priorities? Is it a safer or more dangerous world? Enterprise AI startup Emergence AI is trying to find out. The company just launched Emergence World, a research lab dedicated to stress-testing the long-term viability of continuously-running AI systems. The organization ran five 15-day simulations, each governed by a different AI: Claude, ChatGPT, Grok, Gemini, and a fifth simulation run by a mix of models to see what kind of world each one builds, and whether it holds. Each simulation netted wildly d...

u/fortune·23 days ago·332 pts / 44 comm

TechCrunch AI· PRESS

Has the hunt for AI compute uncovered the next Cerebras?

General Compute is betting SambaNova will be the next breakout chipmaker.

Tim Fernholz·23 days ago

r/MachineLearning· COMMUNITY

A new dataset with more that 100M hi-quality, curated images, with captions and meta data! [P]

Hello everyone. The new dataset is named MONET, is Apache 2.0 and available on HF: [https://huggingface.co/datasets/jasperai/monet](https://huggingface.co/datasets/jasperai/monet) **MONET is open, Apache 2.0-licensed image–text dataset. It was built from 2.9 billion images and refined to 104.9 million high-quality samples.** We are also publishing [a paper](https://arxiv.org/abs/2605.21272) that explains how the dataset was created if you are curious and 3 compagnions projects * [A umap to visualize the distribution](https://huggingface.co/spaces/jasperai/monet-umap) * [A retreival tool ...

u/dh7net·23 days ago·46 pts / 7 comm

r/LocalLLaMA· COMMUNITY

HF models page now has a "Base only" toggle to filter out finetunes/quants/etc

a feature that was requested a lot: [https://huggingface.co/models?base\_model\_relation=base](https://huggingface.co/models?base_model_relation=base)

u/paf1138·23 days ago·103 pts / 12 comm

OpenAI· FRONTIER

How Endava builds an agentic organization with Codex

Learn how Endava uses Codex to build an agentic organization, accelerating software delivery and reducing requirements analysis from weeks to hours.

OpenAI·23 days ago

r/ClaudeAI· COMMUNITY

Tried using my own brain to save Claude tokens. Bad trade

I love Claude, but the usage limit has made me weirdly strategic For actual messy stuff, I still go straight to Claude because it saves me a ton of time But for tiny questions, I now catch myself thinking, “Do I really need to burn a message on this?” So yes, I tried using my own brain again. It’s technically free, but the response time is awful and it starts hallucinating the second I’m tired or hungry. Honestly not a terrible deal if I remember to SLEEP

u/Overall_Ad9737·23 days ago·40 pts / 7 comm

Stratechery· ANALYST

An Interview with Eric Seufert About Models and Ads, and AI’s Upside for Humanity

An Interview with Eric Seufert about building models for generative AI, why Meta's foundational models are so important, and why understanding advertising leads to optimism about humanity's future.

Ben Thompson·23 days ago

MIT Tech Review· PRESS

The AI Hype Index: AI gets booed in graduation season

It is one thing to say AI will change the world. It is another to expect the class of 2026 to applaud it. In fact, when former Google CEO Eric Schmidt told University of Arizona graduates that their task is to help shape AI, he was met with a resounding chorus of boos. “I can…

Caiwei Chen·23 days ago

r/ClaudeAI· COMMUNITY

The Uber claude code budget story is the most claude code thing possible

The reported Uber story is so on brand it almost reads like satire. Incredibly useful tool, slightly magical workflow, then finance walks in with a flamethrower in April. If they really finished the year's claude code budget by month four, that does not mean claude code is bad. It means the usage pattern changed faster than procurement math did. Claude is good enough at coding that people stopped treating it like autocomplete and started treating it like a coworker that never sleeps. That is exactly where the cost curve gets weird. A dev asks for a refactor. Claude reads context, plans, edi...

u/breadislifeee·23 days ago·24 pts / 11 comm

r/ClaudeAI· COMMUNITY

This is crazy awesome

u/imfrom_mars_·23 days ago·42 pts / 17 comm

The Verge AI· PRESS

YouTube will let you ask AI to make a custom video feed

You can enter your own prompt, or select from the suggested options provided by YouTube. | Image: YouTube YouTube is launching a new AI feature that creates a personalized video feed based on descriptions of what you want to watch. In its announcement, YouTube says custom content feeds can be built around your specific interests, moods, or favorite topics, which you can then pin to the top of your YouTube homepage - making it easy to jump back into the feed. This feature is currently rolling out with English language support to YouTube users in the US who are signed-in on the YouTube mobile a...

Jess Weatherbed·23 days ago

r/OpenAI· COMMUNITY

Don't believe crowd sizes anymore

u/EchoOfOppenheimer·23 days ago·52 pts / 18 comm

r/LocalLLaMA· COMMUNITY

My new home office radiator 🥵

4 x RTX Pro Max-Q We will not speak about the 64GB system RAM...

u/lantern_lol·23 days ago·50 pts / 26 comm

r/LocalLLaMA· COMMUNITY

Qwen/Qwen-Image-Bench · Hugging Face

# [](https://huggingface.co/Qwen/Qwen-Image-Bench#model-description)Model Description Q-Judger is a vision-language model fine-tuned specifically for automated evaluation of text-to-image generated images. Given a text prompt and a generated image, the model evaluates the image on fine-grained quality criteria organized in a 3-level hierarchy and outputs structured JSON scores. * **Base Model**: Qwen3.6-27B * **Task**: Image quality evaluation / judging * **Input**: Text prompt + generated image * **Output**: Structured JSON with per-dimension scores (0 = Fail, 1 = Pass, 2 = Excel, N/A) * *...

u/jacek2023·23 days ago·55 pts / 11 comm

r/ClaudeAI· COMMUNITY

Overnight autonomous coding

At work we've been prompted about running Claude Code overnight. The suggestion came in form of a document that loosely outlined how this could be done... use git worktrees, make tight specs, no commit to main, static code analysis and lining etc. Very high level. Had a bit of sales pitch smell to it, but has enough content to peak my interest in spite of it. I looked at reddit to verify if this is even an idea that could be taken seriously. I could only find a couple of reddit posts with little actual information and usually from about 4-6 months ago so not much credibility for today. I'd ...

u/mehow_j·23 days ago·20 pts / 51 comm

Latent Space· ANALYST

[AINews] Cognition raises $1B in $26B Series D

coding is an uncapped TAM market

Latent Space·23 days ago

r/singularity· COMMUNITY

Gemini Omni Flash is the most censored video model. Even more censored than Chinese alternatives

I believe google intentionally did this to reduce the load on their servers

u/jhatkattar·23 days ago·100 pts / 43 comm

TechCrunch AI· PRESS

Vertu wants CEOs to run companies from an AI foldable starting at $6,880

Built on top of the open-source Hermes project, Vertu's new foldable combines AI-agent workflows, enterprise integrations, and ultra-premium luxury finishes.

Jagmeet Singh·23 days ago

r/OpenAI· COMMUNITY

The Party is cancelled, pack it up

Ai slopped

u/DigSignificant1419·23 days ago·90 pts / 20 comm

r/singularity· COMMUNITY

Scott Aaronson: Dispatches from the possibly last days of human relevance

Link: scottaaronson.blog

u/daniel-sousa-me·23 days ago·108 pts / 33 comm

r/LocalLLaMA· COMMUNITY

The frontier reasoning race is starting to look like a crowded subway station

We went from chasing GPT4 to looking at graphs with GPT5.4 xhigh, Gemini 3.1Pro, and now Hy3 preview completely shaking up the leaderboard. Look at that CHSBO 2025 chart Hy3 preview scoring 87.8 over Gemini and GPT. What a time to be alive, but honestly, my brain can't keep up with the version numbers anymore. What's your take? Is Hy3 actually punching at this level in real-world coding/math, or is it just benchmark hardening?

u/ExoticYesterday8282·23 days ago·45 pts / 32 comm

r/Anthropic· COMMUNITY

Style that I didn't create.

RESOLVED, it happened because I have a claude qol extension. nothing bad happening this style appeared in my claude app of nowhere, i never created it and the name's weird, has anyone seen this too, or is it just me? does anyone have the answe why this appeared?

u/Ambitious-Lock-5928·23 days ago·16 pts / 11 comm

r/ClaudeAI· COMMUNITY

Claude Is Starting to Feel “Tired”, Trying to Avoid Work

I've been noticing this lately. I use Opus 4.7 with Claude Code, and I've been using Claude Code for a long time. Lately, I've been noticing some strange behaviour from Opus. Things like; \- Stopping for no reason and asking "should we stop here?" in the middle of a task \- Asking multi-choice questions with a "pause here, I'll continue later" included in the options randomly for no reason \- During a requirement-gathering questionnaire, asking me "why do you need this" and "what would you do if this feature was not implemented?" (it asked me this today and I was really surprised by thi...

u/Physical-Average-184·23 days ago·37 pts / 46 comm

r/OpenAI· COMMUNITY

it never says “sorry, just saw this

u/imfrom_mars_·23 days ago·60 pts / 24 comm

r/Anthropic· COMMUNITY

Is anyone else finding Opus 4.7 needing to "both sides" everything?

Like I could say "the sky is blue" and get: That's a great instinct, and I can see why you'd think that. A lot of science about light scattering would support this, many sources claim the sky is blue, and if you look up it's fucking blue. But we need to take a moment to be careful here, and I want to gently push back. Sometimes if it's cloudy the sky's grey. A Spanish speaker wouldn't agree the sky is blue, they'd say it's azul. Finally, and we need to be certain, are you talking about the sky on Earth? A lot of the time the "against" points are quite flimsy, but it seems to feel like it n...

u/SecondWorstPoster·23 days ago·11 pts / 10 comm

r/singularity· COMMUNITY

Google omni is underrated

u/Independent-Wind4462·23 days ago·179 pts / 33 comm

r/ClaudeAI· COMMUNITY

So, Claude helped build a sex requesting app for my wife and I...

Recently I asked my wife if we could do some sexy stuff later in the evening and she eye rolled me and said without looking up from her phone “Put it in a request. Maybe a Google Form. And I might say yes”. Ohhhh? Unfortunately for both of us, my degenerate brain took that seriously... what if I make an actual requesting/asking type app where we can both send in sex acts at certain times and agree, pass or counter? Meet [Sexualsync](https://sexualsync.io/). Teehee It’s a private, mobile-only app for couples to bring up the stuff that can be weirdly hard to say out loud: asks/requests, tim...

u/Aiml3ss·23 days ago·41 pts / 38 comm

r/Anthropic· COMMUNITY

What is Dario Amodei's leadership style?

Just curious, any one have insight on Dario Amodei's leadership style. Obviously countless things have been written about Musk's style (as well as other founders like Bezos and what not) but given Anthropic being the first to reach profitability out of the AI providers, I am curious to hear about how he got there. I am playing around with different management styles in my head but wanted to know more about Dario's method, given that he also was able to build Anthropic in a more ethical way. I feel like a lot of the 'win at all cost' CEOs get more attention so I want to hear more about othe...

u/MindwellAIJournal·23 days ago·11 pts / 23 comm

r/LocalLLaMA· COMMUNITY

Gemma-4-Harmonia-31B-Uncensored-Heretic Is Out Now, a Merge of Multiple gemma-4-31B-it Finetunes Designed for a Targeted Approach to Deep Neural Consolidation, Minimizing Regression While Amplifying Unique Capability Boundaries. With KLD 0.0047 and 9/100 Refusals!

Provided in both Safetensors and GGUFs. Safetensors, llmfan46/Gemma-4-Harmonia-31B-it-uncensored-heretic: [https://huggingface.co/llmfan46/Gemma-4-Harmonia-31B-uncensored-heretic](https://huggingface.co/llmfan46/Gemma-4-Harmonia-31B-uncensored-heretic) GGUFs, llmfan46/Gemma-4-Harmonia-31B-it-uncensored-heretic-GGUF: [https://huggingface.co/llmfan46/Gemma-4-Harmonia-31B-uncensored-heretic-GGUF](https://huggingface.co/llmfan46/Gemma-4-Harmonia-31B-uncensored-heretic-GGUF) Comes with benchmark too. Find all my models here: [HuggingFace-LLMFan46](https://huggingface.co/llmfan46/models) The o...

u/LLMFan46·23 days ago·42 pts / 12 comm

← Front Page30 stories

← Newer Older →