The Archive
Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.
FINAL-Bench/Darwin-36B-Opus · Hugging Face
Darwin-36B-Opus: 36B MoE model created via evolutionary breeding of Qwen3.6-35B and Claude-distilled variants, released as GGUF quantization.
I open sourced a project tracker for Claude Code that lives in .story/: tickets, issues, and session handovers as files
I built Storybloq (previously Claude Story) for my own Claude Code workflow, and used Storybloq itself to build Storybloq. The `.story/` directory in the repo has tracked every ticket, issue, and session handover across the project's development, so the tool is its own longest-running test case. Sharing it in case it's useful. It's free and open source. The problem: every new Claude Code session forgets the last one. So you re-explain architecture, re-litigate tradeoffs you already settled, and the codebase drifts a degree at a time on long projects. Storybloq gives your repo a `.story/...
OpenAI model releases over time
Reddit thread listing OpenAI's historical model releases without new analysis or announcement.
How it feels this month
Reddit thread capturing community sentiment about AI progress; lacks technical substance or novel claims.
Claude Status Update : Investigated elevated errors and slower responses on claude.ai on 2026-04-25T18:42:40.000Z
Claude.ai experienced elevated errors and slower responses on 2026-04-25; incident status available on official status page.
Using Claude Code as my viral video content pipeline now
User demonstrates Claude Code integration with TwoShot's content creation MCP for end-to-end video production from script to render.
Me after switching from Claude to Codex
Reddit post comparing Claude and Codex; lacks substantive technical content or benchmarking data.
OpenAI CEO apologizes to Tumbler Ridge community
In a letter to the residents of Tumbler Ridge, Canada, OpenAI CEO Sam Altman said he is “deeply sorry” that his company failed to alert law enforcement about the suspect in a recent mass shooting.
WHY ARE YOU LIKE THIS
ChatGPT Images 2.0 unexpectedly added creative flourishes (sarcastic sign text) to a complex multi-entity image generation prompt without explicit instruction.
GLM 5.1 Locally: 40tps, 2000+ pp/s
GLM 5.1 achieves 40 tokens/s and 2000+ prefill tokens/s on RTX 6000 Pro hardware with sglang optimization.
Why Cohere is merging with Aleph Alpha
Canadian AI startup Cohere is taking over Germany-based Aleph Alpha with support from Lidl’s owner, Schwarz Group. With the blessing of their governments, the companies intend to offer a sovereign alternative to enterprises in an AI landscape dominated by American players.
FP4 inference in llama.cpp (NVFP4) and ik_llama.cpp (MXFP4) landed - Finally
llama.cpp and ik_llama.cpp now support FP4 inference with different formats: NVFP4 (Nvidia E4M3) and MXFP4 (MX standard) across varying hardware backends.
Gas power projects for just 11 US data center 'campuses' could emit more greenhouse gases than entire countries, according to report
Report warns 11 US AI data center campuses powered by gas could emit emissions exceeding entire countries, raising sustainability concerns.
First time ever ChatGPT (5.5) was aware of its limitations and, fearing a wrong answer, refused to respond and asked for better inputs!
Reddit user reports ChatGPT 5.5 declining to answer dot-connecting task, citing uncertainty—anecdotal observation lacking reproducibility or systematic evidence.
What do you do while Claude is thinking?
Reddit discussion about user behavior during Claude processing latency; anecdotal survey of multitasking habits.
Why Tokyo is the most important tech destination of 2026
SusHi Tech Tokyo 2026 has four tightly defined technology domains, each backed by live demonstrations, dedicated exhibit floors, and sessions featuring the people actually building and funding these technologies globally.
Apple under Ternus: what comes next for the tech giant’s hardware strategy
John Ternus, Apple's incoming CEO, is a hardware guy, signaling Apple may be putting devices back at the center of its strategy.
[Demo] Real-time EEG analysis-driven guided-meditation system
Real-time EEG-driven meditation system uses OpenBCI and TouchDesigner to generate adaptive multimodal guidance via video, voice, light, and text.
Why does chatgpt never know when to say "I don't know"?
User question about ChatGPT's tendency to generate plausible-sounding false answers (hallucinations) instead of expressing uncertainty.
PSA: The string "HERMES.md" in your git commit history silently routes Claude Code billing to extra usage — cost me $200
TL;DR: If your git commits mention "HERMES.md" (uppercase), Claude Code quietly stops using your Max plan and starts billing you at API rates. Anthropic's support acknowledged the bug, thanked me for finding it, and refused a refund. Apparently their AI safety principles don't extend to your wallet. **The story** I'm on Max 20x ($200/month). Today Claude Code started throwing: \> "You're out of extra usage. Add more at [claude.ai/settings/usage](http://claude.ai/settings/usage) and keep going." Weird, because my plan dashboard showed 13% weekly usage and 0% current session. 86...
Field report: coding with Qwen 3.6 35B-A3B on an M2 Macbook Pro with 32GB RAM
Developer documents practical setup for running Qwen 3.6 35B on M2 MacBook Pro 32GB via llama.cpp, with performance notes and optimization tips.
What happened to Anthropic test cutting the MAX 20X plan limits by 50% and removing CC from Pro plan for 2% of users and? If it works, will they roll it out to everyone? What does that test mean?, and why are most users quiet about it? Would you pay $200 for 10X Pro? or $400 for your current 20X?
Reddit speculation about Anthropic pricing test reducing Claude Pro usage limits by 50% for subset of users.
Older models moving back to 200k context window. FYI
Anthropic reduces context window for older Claude models from current capability back to 200k tokens.
Can’t code and can’t track calories wtf anthropic
Reddit user reports Claude failing at calorie tracking task, requests feature debugging.
OpenAI CEO Sam Altman apologizes for not flagging mass shooter to police
Sam Altman apologizes for OpenAI's failure to report mass shooter threat to authorities; governance/safety process issue.
Decreased Intelligence Density in DeepSeek V4 Pro
DeepSeek V4 Pro exhibits degraded token efficiency vs. V3.2 despite 2.5x scale increase, suggesting intelligence density declined.
Anyone actually using Dispatch for something useful?
Reddit user questions practical use cases for Anthropic's Dispatch feature beyond basic remote task submission.