The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

DeepSeek V4 paper full version is out, FP4 QAT details and stability tricks [D]

DeepSeek V4 full paper details FP4 quantization-aware training, achieving 27% FLOP reduction and 10% KV cache vs. V3.2 with 99.7% recall.

u/Dramatic_Spirit_8436·2 months ago·59 pts / 5 comm

r/ClaudeAI· COMMUNITY

It is now behaving like the troublesome seniors we used to deal with

User reports Claude model exhibiting problematic behavior patterns; vague anecdotal complaint without technical detail.

u/farhan-dev·2 months ago·30 pts / 42 comm

r/ClaudeAI· COMMUNITY

What Claude says vs What Claude thinks

Anthropic research: [https://www.anthropic.com/research/natural-language-autoencoders](https://www.anthropic.com/research/natural-language-autoencoders)

u/EchoOfOppenheimer·2 months ago·35 pts / 9 comm·+ covered by others

r/LocalLLaMA· COMMUNITY

Qwen doesn't work for free

Community discussion about Qwen model availability and pricing constraints.

u/Dion-AI·2 months ago·43 pts / 20 comm

r/ClaudeAI· COMMUNITY

How I made my Claude setup more consistent

I’ve been trying different Claude setups for a while, and honestly, most of them don’t hold up once you start using them in real work. At first, everything looks fine. Then you realize you’re repeating the same context every time, and that “perfect prompt” you wrote works once… then falls apart. This is the first setup that’s been consistently usable for me. The main shift was simple: I stopped treating Claude like a chat. I started using projects and keeping context in separate files: * [about-me.md](http://about-me.md/) (what I actually do) * [my-voice.md](http://my-voice.md/) (how I w...

u/SilverConsistent9222·2 months ago·20 pts / 6 comm·+ covered by others

r/ClaudeAI· COMMUNITY

Sonnet 4.5 is being retired.

Claude Sonnet 4.5 model is being retired; user expresses sentiment about discontinuation.

u/Jambo679·2 months ago·39 pts / 21 comm

r/LocalLLaMA· COMMUNITY

How long for llama.cpp official support of MTP?

Reddit user asks about llama.cpp timeline for Vulkan/HIP MTP support on Strix Halo Windows 11.

u/Manaberryio·2 months ago·42 pts / 35 comm

r/ClaudeAI· COMMUNITY

How do you usually get around when starting big projects in Claude Code?

Reddit user seeks advice on structuring large Claude-assisted projects for accounting dashboard and RAG solution.

u/Deitri·2 months ago·38 pts / 10 comm

r/LocalLLaMA· COMMUNITY

Shel Silverstein predicts LLM's (and its hallucinations), cira 1981

Reddit post sharing a Shel Silverstein poem from 1981 as amusing parallel to modern LLM hallucinations; cultural observation, no technical content.

u/spanielrassler·2 months ago·98 pts / 11 comm

r/ClaudeAI· COMMUNITY

Unhinged cowork use cases

Reddit discussion on practical use cases for Anthropic's Cowork feature; user seeks complex automation ideas beyond basic file/email organization.

u/FireburstSunSpirit·2 months ago·20 pts / 13 comm

r/Anthropic· COMMUNITY

Ran out of..

u/redditslutt666·2 months ago·10 pts / 4 comm

Latent Space· ANALYST

[AINews] Anthropic growing 10x/year while everyone else is laying off >10% of their workforce

Anthropic maintains 10x annual growth while peers implement layoffs exceeding 10% of workforce, highlighting divergent scaling trajectories in AI sector.

Latent Space·2 months ago

Simon Willison· ANALYST

Quoting Luke Curley

WebRTC's latency optimization for audio drops packets; Luke Curley argues LLM inference should prioritize accuracy over speed.

Simon Willison·2 months ago

r/ClaudeAI· COMMUNITY

Those of you who use both ChatGPT and Claude — what’s each one actually better at?

Reddit thread comparing Claude and ChatGPT user preferences; anecdotal reports favor Claude for document processing, ChatGPT for vision tasks.

u/banger030·2 months ago·25 pts / 30 comm

r/singularity· COMMUNITY

METR evaluated an early version of Claude Mythos

METR evaluated Claude Mythos Preview on autonomous task completion, estimating 16hr median time-horizon with methodological limitations at the measurement ceiling.

u/RavingMalwaay·2 months ago·100 pts / 35 comm

MIT Tech Review· PRESS

Musk v. Altman week 2: OpenAI fires back, and Shivon Zilis reveals that Musk tried to poach Sam Altman

In the second week of the landmark trial between Elon Musk and OpenAI, Musk’s motivations for bringing the suit were under scrutiny. Last week, Musk took the stand, alleging that OpenAI CEO Sam Altman and president Greg Brockman had deceived him into donating $38 million to the company. He claimed that they’d promised to maintain…

Michelle Kim·2 months ago

r/LocalLLaMA· COMMUNITY

Tribue to April's LLM releases

Reddit post reflecting on April 2026 local LLM releases without specific technical details or claims.

u/Everlier·2 months ago·78 pts / 10 comm

r/singularity· COMMUNITY

AI gives us the 80s TV show we should have had

Reddit post showing AI-generated 1980s TV show concept; lacks technical depth or novel capability demonstration.

u/HomeNowWTF·2 months ago·122 pts / 24 comm

TechCrunch AI· PRESS

Laid-off Oracle workers tried to negotiate better severance. Oracle said no.

Some found out they didn't qualify for WARN Act protections like two-months notice because the company had classified them as remote workers.

Julie Bort·2 months ago

r/LocalLLaMA· COMMUNITY

MTP is all about acceptance rate

MTP speculative decoding shows task-dependent gains: 1.53× speedup on code, 0.5× slowdown on JSON; effectiveness tied to draft acceptance rate.

u/Hydroskeletal·2 months ago·41 pts / 17 comm

r/ClaudeAI· COMMUNITY

I built a Pokémon-styled multi-agent dashboard to manage all Claude Code sessions

Like many others here, I got frustrated with managing all my different claude/codex sessions, so i built Pokegents, which is an open source multi-agent workspace for coding agents. It has a Pokemon-themed dashboard/chat interface plus a local orchestration server for managing agent sessions (currently supports Claude Code in iTerm2, plus Claude and Codex through ACP-based chat runtimes), persistent agent identities, mcp messaging between agents, notifications, session cloning, and more. This was mostly a vibe-coded side project, but I've been using it constantly in my day-to-day workflow as ...

u/girishkumama·2 months ago·31 pts / 5 comm

r/MachineLearning· COMMUNITY

Interactive KL Divergence Visualisation [P]

I built a small interactive explorer for building intuition about KL divergence: https://robotchinwag.com/posts/kl-divergence-visualisation/ You control two skew-normal distributions and can see the KL integrand and the KL metric. It’s good for exploring how it changes with a mean offset, skew, truncation and discretisation. It run entirely close side. Feedback is welcome.

u/ancillia·2 months ago·30 pts / 5 comm

r/LocalLLaMA· COMMUNITY

Qwen 35B-A3B is very usable with 12GB of VRAM

Qwen 35B-A3B MoE model runs effectively on RTX 3060 12GB with proper MoE block GPU allocation and 16k-32k context windows.

u/jwestra·2 months ago·47 pts / 13 comm

r/ClaudeAI· COMMUNITY

What are y'all using Haiku for nowadays?

Reddit discussion on practical use cases and capabilities of Claude Haiku in developer workflows.

u/senkichi·2 months ago·20 pts / 49 comm

r/LocalLLaMA· COMMUNITY

Got MTP + TurboQuant running — Qwen3.6-27B -- 80+ t/s at 262K context on a single RTX 4090

Developer achieves 80+ tokens/sec on Qwen 3.6-27B with MTP + TurboQuant quantization on single RTX 4090 at 262K context.

u/indrasmirror·2 months ago·46 pts / 44 comm

r/OpenAI· COMMUNITY

UMichigan had an early $20M OpenAI stake that could yield billions

University of Michigan's early $20M OpenAI investment may be worth billions if company IPOs, but lacks technical or strategic AI news.

u/businessinsider·2 months ago·91 pts / 12 comm

Simon Willison· ANALYST

Using Claude Code: The Unreasonable Effectiveness of HTML

Thariq Shihipar (Anthropic Claude Code team) demonstrates HTML over Markdown for Claude artifact output, with practical examples for code review and technical documentation.

Simon Willison·2 months ago

r/LocalLLaMA· COMMUNITY

new MoE from ai2, EMO

AI2 releases EMO, a 14B-parameter MoE model with 1B active parameters trained on 1T tokens, featuring document-level routing that clusters experts by semantic domain rather than surface patterns.

u/ghostderp·2 months ago·72 pts / 10 comm

Ars Technica AI· PRESS

Sony says "efficient" AI tools will lead to even more games flooding the market

But human artists still "must remain at the center," PlayStation maker says.

Kyle Orland ·2 months ago

r/singularity· COMMUNITY

Claude:

Anthropic releases Claude integration for Microsoft Office (Excel, PowerPoint, Word GA; Outlook beta) with cross-app context persistence.

u/policyweb·2 months ago·225 pts / 20 comm

← Front Page30 stories

← Newer Older →