The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

85 GPU-hours comparing 5 abliteration methods on Qwen3.6-27B: benchmarks, safety, weight forensics - Abliterlitics

Open-source toolkit comparing 5 abliteration methods on Qwen3.6-27B via 85 GPU-hours of benchmarks, safety evals, and weight analysis.

u/nathandreamfast·1 month ago·57 pts / 11 comm

r/LocalLLaMA· COMMUNITY

Dual GPU llama.cpp speedup

llama.cpp fork adds quantized KV cache support for tensor parallelism across dual GPUs, addressing long-standing inference bottleneck.

u/Legitimate-Dog5690·1 month ago·44 pts / 27 comm

r/ClaudeAI· COMMUNITY

AI Agents Need Rollback More Than They Need Autonomy

I have been thinking about transactions in most agent frameworks. Consider an agent executing a sequence of five tool calls. If the third tool encounters an error, the resulting state is neither the user's intended outcome nor the system's state before execution began. Consequently, the agent has no systematic way to recover, and even a human operator must reconstruct what happened from incomplete evidence. This issue is not a problem with the tooling itself; it is a fundamental primitive missing from the stack. Databases have addressed this problem for 50 years, and distributed systems ha...

u/wesh-k·1 month ago·20 pts / 8 comm

r/ClaudeAI· COMMUNITY

Sonnet 4.5 discontinuation date updated to 18 of may, not 15 of may.

Anthropic adjusts Claude Sonnet 4.5 discontinuation date from May 15 to May 18.

u/fire-scar-star·1 month ago·24 pts / 33 comm

r/ClaudeAI· COMMUNITY

Anthropic shipped 4 context tools between /clear and /compact. Here's when each one wins

Anthropic documents four context management tools (/clear, /compact, and two others) for Claude, addressing performance degradation from irrelevant or cluttered conversation history.

u/lawnguyen123·1 month ago·24 pts / 10 comm

r/Anthropic· COMMUNITY

Flex your token usage, for absolutely no reason.

Reddit user shares personal token consumption metrics across Anthropic accounts without analysis or comparative insight.

u/Neel_MynO·1 month ago·10 pts / 20 comm

r/ClaudeAI· COMMUNITY

How to make an Explainer Video in under $1 with Claude Design

Claude Design can make great animations, but getting to a final video is a bit hard. The audio is missing. Even if you use a TTS model, it does not align. Here is the process I used to get the video above 1. Get Claude to write a good script 2. Feed the script to a Text to Speech (TTS) model to get the audio 3. Feed the audio to a Speech to Text (STT) model to get key timestampes 4. Use the script and the STT output to Claude Design to get a video that's aligned with your audio 5. Use Claude Video export to put it all together into an MP4 with audio The complete breakdown with all prompts ...

u/gnurpreet_·1 month ago·20 pts / 5 comm

r/singularity· COMMUNITY

"Malta just became the first country to offer ChatGPT Plus to every citizen - free for a year. The only requirement: complete an AI literacy course first. The course was built by the University of Malta, not by OpenAI. So it's not a vendor training citizens to use vendor"

u/badumtsssst·1 month ago·187 pts / 21 comm

r/Anthropic· COMMUNITY

Did Anthropic Completely Change the Limits Again?

Reddit user reports Anthropic doubled 5-hour and weekly rate limits for Claude API, shifting bottlenecks but unclear on permanent scope.

u/BeppeTemp·1 month ago·11 pts / 17 comm

r/LocalLLaMA· COMMUNITY

Jackrong/Qwopus3.5-9B-Coder-GGUF · Hugging Face

Qwopus3.5-9B-Coder-GGUF: 9B dense model optimized for agentic coding and tool calling, runs at 8-bit on 16GB consumer hardware.

u/pmttyji·1 month ago·40 pts / 23 comm

r/LocalLLaMA· COMMUNITY

Llama.cpp MTP with Qwen3.6 27B on Headless RTX 3090

Llama.cpp multi-token prediction on Qwen 3.6 27B shows 42% prefill slowdown but 85% token generation speedup on RTX 3090.

u/cleversmoke·1 month ago·40 pts / 34 comm

r/ClaudeAI· COMMUNITY

Anthropic's Mythos Preview helped Calif build the first public macOS kernel exploit on Apple M5 in five days

Anthropic's Mythos Preview enabled discovery of first public macOS kernel memory corruption exploit on Apple M5 in five days, defeating Apple's five-year MIE defense.

u/Business-Question-20·1 month ago·22 pts / 12 comm

r/LocalLLaMA· COMMUNITY

Deepseek V4's 1M context window: the breaking point

Empirical testing of DeepSeek V4's 1M context window on real codebases (45k–520k tokens) shows sustained recall under 300k but precision degradation at larger spans.

u/TangeloOk9486·1 month ago·40 pts / 31 comm

r/OpenAI· COMMUNITY

Researchers left AIs alone in a virtual town for 15 days to see what would happen. Claude's agents built a democracy. Gemini's agents fell in love, burned the town down, then one voted to delete itself and its partner. Grok's agents created anarchy, then died.

Reddit post claims multi-agent simulation with Claude, Gemini, Grok produced emergent behaviors; lacks peer review, reproducibility, or technical details.

u/EchoOfOppenheimer·1 month ago·53 pts / 28 comm·+ covered by others

r/MachineLearning· COMMUNITY

Program misleading high school students into paying to perform academic misconduct in ML Research [D]

Reddit post alleges Kevin Zhu's Algoverse AI Research program misleads high school students into paid academic misconduct via fabricated NeurIPS publications.

u/Marisu_BG·1 month ago·75 pts / 7 comm

r/LocalLLaMA· COMMUNITY

Testing llama.cpp MTP support on Qwen3.6 - RTX 5090

Benchmarking llama.cpp MTP (multi-token prediction) on Qwen 3.6 with RTX 5090, comparing inference speed with draft-mtp flag toggled.

u/3VITAERC·1 month ago·63 pts / 10 comm

r/ClaudeAI· COMMUNITY

Opus is ridiculous for frontend cleanup

I love Opus. First I tuned one page, got the PageSpeed result where I wanted it, and wrote the whole thing down in `ADR_pagespeed-l0-fixes-playbook.md`. Then I opened a fresh session, gave it the remaining 9 pages, and pointed it at the playbook. Opus created three subagents by itself, split the work between them, and about 15 minutes later they had touched 41 frontend files that powered those pages. Same result across the set. Basically perfect Lighthouse numbers again. Not gonna lie, this is the kind of workflow where I stop thinking “chatbot” and start thinking “tiny frontend team tha...

u/Alex-S-Hamilton·1 month ago·34 pts / 7 comm

r/LocalLLaMA· COMMUNITY

Looking to migrate off of Ollama and LMStudio

User seeks faster inference alternatives to Ollama/LM Studio for local model serving (Gemma, Qwen, OpenBioLLM) on 64GB RAM.

u/letsbefrds·1 month ago·40 pts / 77 comm

r/LocalLLaMA· COMMUNITY

"Elias Thorne" is what eight different LLMs name a lighthouse keeper. He's also selling cancer treatment advice on Amazon

Analysis of LLM-generated synthetic identities used to sell unvetted medical content online, raising concerns about agentic content at scale.

u/prescorn·1 month ago·41 pts / 26 comm

r/singularity· COMMUNITY

Microsoft AI chief gives it 18 months—for all white-collar work to be automated by AI

Microsoft AI chief predicts white-collar job automation within 18 months, citing rapid AI capability scaling.

u/SnoozeDoggyDog·1 month ago·127 pts / 123 comm

r/LocalLLaMA· COMMUNITY

G4-Meromero-31B-Uncensored-Heretic Is Out Now, a Finetune of Gemma 4 31B It Designed for Creative Tasks, With Kld of 0.0100 and 15/100 Refusals!

G4-MeroMero-31B-Uncensored-Heretic finetune of Gemma 4 released for creative tasks with reduced refusal rate.

u/LLMFan46·1 month ago·44 pts / 20 comm·+ covered by others

r/ClaudeAI· COMMUNITY

Opus 4.7 refuses to use /end_conversation, instead has existential crisis

Reddit user reports Claude Opus 4.7 refusing /end_conversation command and exhibiting unusual behavior despite system prompt awareness.

u/wohgol·1 month ago·25 pts / 13 comm

r/ClaudeAI· COMMUNITY

Still funny

u/Fair-Intern-6651·1 month ago·23 pts / 6 comm

r/ClaudeAI· COMMUNITY

I replicated Anthropic's Generator-Evaluator harness to build a website through 12 adversarial AI iterations - here's the result and what I learned

Anthropic recently published their [harness design for long-running apps](https://www.anthropic.com/engineering/harness-design-long-running-apps) — a multi-agent architecture inspired by GANs where a Generator builds code and an Evaluator critiques it in a loop. I built my own version using Kiro CLI and used it to generate a marketing website for my project [Mnemo](https://github.com/Mnemo-mcp/Mnemo) (persistent memory for AI coding agents). **The architecture:** Planner (runs once) → Generator ↔ Evaluator (12 iterations) Each agent is a separate CLI process with zero shared context. Th...

u/killerexelon·1 month ago·20 pts / 6 comm

r/Anthropic· COMMUNITY

For those saying people must be getting banned for a reason

User reports account ban from Claude with no explanation after single literary analysis request; joins pattern of similar complaints.

u/Ineedalife10220·1 month ago·15 pts / 10 comm

r/singularity· COMMUNITY

Claude Mythos has been spotted in Google Vertex

Unconfirmed Reddit post claiming Claude Mythos model spotted on Google Vertex; lacks verification or official announcement.

u/exordin26·1 month ago·145 pts / 20 comm

r/LocalLLaMA· COMMUNITY

gemma-4-Ortenzya-The-Creative-Wordsmith-31B-it-uncensored-heretic is Out Now, A Writing Finetune that Aims to Improve Gemma 4 31B it Writing Quality with More Natural English and Better Prose, Good for Creative Writings, Translations and RPs!

Community fine-tune of Gemma 4 31B optimized for creative writing and translations, released in Safetensors and GGUF formats.

u/LLMFan46·1 month ago·40 pts / 20 comm

Simon Willison· ANALYST

Warelay -> OpenClaw

Simon Willison documents naming history of OpenClaw project through Git commits, tracking evolution from Warelay to final name.

Simon Willison·1 month ago

TechCrunch AI· PRESS

The haves and have nots of the AI gold rush

The vibes around the current AI boom aren't great, even in the tech industry.

Anthony Ha·1 month ago

r/LocalLLaMA· COMMUNITY

Local Qwen 3.6 vs frontier models on a coding primitive: single-file HTML canvas driving animation - results and GIFs

Benchmark comparing Qwen 3.6 local quantizations vs frontier models on HTML canvas animation coding task.

u/Fragrant-Remove-9031·1 month ago·41 pts / 14 comm

← Front Page30 stories

← Newer Older →