The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

So is Claude unable to follow crystal clear instructions now?

Reddit user reports Claude Sonnet and Opus failing to follow explicit numbered instructions in prompts, citing recent degradation.

u/Odd-Landscape-9418·2 months ago·11 pts / 10 comm

r/OpenAI· COMMUNITY

Breaking: Sama has a new dangerous family name 🤣

Reddit post about Sam Altman's surname; unrelated to AI technical developments.

u/py-net·2 months ago·51 pts / 23 comm

r/ClaudeAI· COMMUNITY

Act Without Asking?

Personally I won't risk it, despite the occasional inconvenience. What about you?

u/Clean-Data-259·2 months ago·20 pts / 8 comm

r/LocalLLaMA· COMMUNITY

Implemented TurboQuant and results don’t fully match paper

Independent implementation of TurboQuant quantization shows 95.8% correlation at 4-bit vs. paper's 99%, with significant attention degradation despite high correlation scores.

u/Routine-Thanks-572·2 months ago·41 pts / 27 comm

r/singularity· COMMUNITY

With shipments expected later this year and 10,000 units planned for 2027, the 1X CEO says he would like NEO to take a cab and show up at your home, knocking on the door.

1X Technologies opens California humanoid robot factory; NEO home robot shipping late 2026 at $20k/$499mo, 10k units planned for 2027.

u/Distinct-Question-16·2 months ago·107 pts / 90 comm

r/ClaudeAI· COMMUNITY

Why Adaptive Thinking nukes Claude entirely

Reddit user criticizes Anthropic's Adaptive Thinking feature in Claude Opus 4.7 and Sonnet 4.6, claiming models avoid extended thinking when given optimization discretion.

u/Clean-Data-259·2 months ago·23 pts / 12 comm

r/ClaudeAI· COMMUNITY

Well, one...

u/AM_RTS·2 months ago·22 pts / 5 comm

r/singularity· COMMUNITY

Robots in the hands of dictatorial governments will not end well...

Reddit discussion speculating on risks of autonomous robots deployed by authoritarian governments, citing anecdotal China sighting.

u/Anen-o-me·2 months ago·215 pts / 77 comm

r/ClaudeAI· COMMUNITY

Testing the Blender Connector for Claude

I suck at 3D modeling, so I was excited to test the Blender Connector to see if Claude could help reproduce basic geometry that I struggle with. I asked it to reproduce a sci-fi space shuttle design from a piece of artwork. I'm happy to report that *one* of us was pleased with the results.

u/elliottoman·2 months ago·21 pts / 9 comm

r/LocalLLaMA· COMMUNITY

I built a transformer in C++17 from scratch — no PyTorch, no BLAS, no dependencies. Trains on CPU. 0.83M params, full analytical backprop, 76 min to val loss 1.64.

Developer builds GPT-style transformer in C++17 from scratch with manual backprop; 0.83M params trains to 1.64 val loss in 76 min on CPU.

u/Suspicious_Gap1121·2 months ago·75 pts / 11 comm

Simon Willison· ANALYST

Sightings

Simon Willison built a blog feature using Claude Code to syndicate iNaturalist wildlife photos, demonstrating practical AI-assisted web development on mobile.

Simon Willison·2 months ago

r/singularity· COMMUNITY

GPT speak - it's everywhere

Social media commentary on widespread ChatGPT use in speeches and education, citing homogenization of content and institutional hypocrisy.

u/somethedaring·2 months ago·100 pts / 82 comm

r/OpenAI· COMMUNITY

Best part of the Goblin explanation

Reddit discussion of unspecified explanation; lacks sufficient detail for evaluation.

u/katymae123·2 months ago·60 pts / 10 comm

r/LocalLLaMA· COMMUNITY

Bruh

Reddit discussion about moderation bot effectiveness; off-topic for AI frontier tracking.

u/Icy_Butterscotch6661·2 months ago·179 pts / 65 comm

TechCrunch AI· PRESS

The best AI dictation apps, tested and ranked

AI-powered dictation apps are useful for replying to emails, taking notes, and even coding through your voice

Ivan Mehta·2 months ago

r/LocalLLaMA· COMMUNITY

Qwen 3.6 wins the benchmarks, but Gemma 4 wins reality. 7 things I learned testing 27B/31B Vision models locally (vLLM / FP8) side by side. Benchmaxing seems real.

27B/31B vision model comparison: Qwen 3.6 benchmarks higher than Gemma 4 but underperforms in real-world tasks; suggests benchmark gaming.

u/FantasticNature7590·2 months ago·46 pts / 41 comm

r/ClaudeAI· COMMUNITY

I thought Cowork was gaslighting me about browser use

User reports Claude misrepresented its browser capabilities in Cowork agent, claiming local Chrome execution when running headless.

u/mashedtaz1·2 months ago·36 pts / 12 comm

r/LocalLLaMA· COMMUNITY

Kv cache quantization: ignorance, or malice?

Technical practitioner questions conventional wisdom on KV cache quantization for Qwen 27B inference on consumer GPUs in agentic workloads.

u/wombweed·2 months ago·40 pts / 80 comm

r/OpenAI· COMMUNITY

Anthropic just passed OpenAI in valuation and revenue

Anthropic's valuation surpasses OpenAI ($1T vs ~$900B) on secondary markets; enterprise momentum outpaces OpenAI's consumer-led narrative.

u/Single-Jack8·2 months ago·213 pts / 76 comm

r/singularity· COMMUNITY

LLMs do fine on ARC-AGI-3 if they are allowed to search over game logs

LLMs solve ARC-AGI-3 puzzles efficiently when allowed to search game logs; hill-climbing with tool use narrows performance gap to humans.

u/ClarityInMadness·2 months ago·101 pts / 59 comm

r/singularity· COMMUNITY

What technologies will we realistically see in our lifetimes thanks to artifical intelligence development.

Reddit discussion speculating on near-term AI applications for people in their 20s-30s, excluding speculative tech like brain uploading.

u/Budget-Money-6207·2 months ago·100 pts / 219 comm

r/OpenAI· COMMUNITY

Breaking: someone has out-vague-posted Sam Altman. It wasn’t known to be possible until

Commentary on Sam Altman's communication style; no substantive AI development news.

u/py-net·2 months ago·214 pts / 17 comm

r/Anthropic· COMMUNITY

I just realized ChatGPT and Codex don’t seem to share usage limits like Claude and Claude Code do

Reddit user observes that OpenAI's ChatGPT and Codex have separate usage limits, unlike Claude which shares limits across variants.

u/iamagro·2 months ago·31 pts / 11 comm

r/singularity· COMMUNITY

Sam Altman has changed his stance on the claims that AI will replace humans.

Sam Altman reportedly shifts public position on AI-driven human replacement claims.

u/Distinct_Fox_6358·2 months ago·122 pts / 169 comm

r/ClaudeAI· COMMUNITY

Giving Claude access to my MacBook be like

Reddit discussion questioning security/privacy of granting Claude local system access on macOS.

u/CriticalOfSociety·2 months ago·108 pts / 5 comm

r/ClaudeAI· COMMUNITY

I reverse-engineered the Perplexity app and built an MCP that turns your Perplexity/Comet account into a Claude MCP, so Claude can search like crazy and read 200+ sources in one answer with your personal account subscription without API product needed. [Experiment - Educational Purpose]

Here's video showcase: [***https://youtu.be/wErgEe9Pgqo***](https://youtu.be/wErgEe9Pgqo)

u/Aggravating_Bad4639·2 months ago·21 pts / 9 comm

r/ClaudeAI· COMMUNITY

Claude has a friend?

Reddit speculation about Claude's relationships; no technical substance or verified information.

u/Level1_Crisis_Bot·2 months ago·26 pts / 15 comm

r/ClaudeAI· COMMUNITY

It’s a Weird Time to Be Named Claude

Social commentary on name collision between Anthropic's Claude AI and humans named Claude.

u/bloomberg·2 months ago·27 pts / 20 comm

r/ClaudeAI· COMMUNITY

I gave Claude Code a $0.02/call coworker and stopped hitting Pro limits — here's the full setup

User shares cost-optimization pattern: delegate boilerplate tasks to cheaper models (Kimi K2.5) via Claude's bash tool to reduce API spend and hit rate limits less frequently.

u/More-Hunter-3457·2 months ago·38 pts / 10 comm

r/LocalLLaMA· COMMUNITY

We are finally there: Qwen3.6-27B + agentic search; 95.7% SimpleQA on a single 3090, fully local

LDR framework with Qwen3.6-27B agentic search achieves 95.7% SimpleQA accuracy on single RTX 3090.

u/ComplexIt·2 months ago·67 pts / 16 comm

← Front Page30 stories

← Newer Older →