The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

r/LocalLLaMA· COMMUNITY

Deepseek V4 Flash and Non-Flash Out on HuggingFace

DeepSeek V4 Flash and standard V4 weights released on Hugging Face, expanding availability for local deployment.

u/MichaelXie4645·2 months ago·61 pts / 20 comm

r/LocalLLaMA· COMMUNITY

DeepSeek V4 Flash & Pro Now out on API

DeepSeek V4 Flash and Pro models now available via API, enabling developers to run open-weights inference at scale.

u/bigboyparpa·2 months ago·81 pts / 17 comm

r/ClaudeAI· COMMUNITY

I vibe-coded GTA: Google Earth over the weekend

Developer built browser-based GTA game using Claude, Cesium, and real Google Earth/OSM data; entertainment demo, limited AI architecture insight.

u/TrueEstablishment630·2 months ago·34 pts / 12 comm

Simon Willison· ANALYST

russellromney/honker

russellromney/honker adds Postgres NOTIFY/LISTEN and Kafka-style queue semantics to SQLite via Rust extension.

Simon Willison·2 months ago

Simon Willison· ANALYST

An update on recent Claude Code quality reports

Anthropic postmortem: three Claude Code harness bugs, not model quality, caused two-month regression in output quality.

Simon Willison·2 months ago

r/OpenAI· COMMUNITY

thoughts on GPT 5.5

Reddit discussion of GPT 5.5 with no technical details, benchmarks, or substantive analysis.

u/Local-Bison-4392·2 months ago·165 pts / 10 comm

Simon Willison· ANALYST

Serving the For You feed

Bluesky's decentralized feed architecture explained: custom algorithm implementation running on commodity hardware via AT Protocol.

Simon Willison·2 months ago

r/OpenAI· COMMUNITY

5.5 is a really good model for creative writing so far

Reddit user reports GPT-5.5 performs well for creative writing tasks, positioning it as improvement over 5.1.

u/cloudinasty·2 months ago·52 pts / 17 comm

r/LocalLLaMA· COMMUNITY

Hard freakin' decision..Blackwell 96G or Mac Studio 256G

Reddit user seeks hardware purchasing advice between NVIDIA Blackwell and Mac Studio for local LLM inference.

u/HyPyke·2 months ago·45 pts / 131 comm

r/ClaudeAI· COMMUNITY

Tested Claude AI LLM Models' Effort Levels - Low To Max: How Claude Opus 4.7 differs

I benchmarked and compared Claude Opus 4.5 vs Opus 4.6 vs Opus 4.7 vs Sonnet 4.6 testing effort levels from low, medium, high, xhigh, max as curious about token usage/costs and performance within Claude Code https://ai.georgeliu.com/p/tested-claude-ai-llm-models-effort Hope folks find this useful. The test was done with Claude Code v2.1.117 which is apparently the fixed versions from Anthropic's post-mortem announcement.

u/centminmod·2 months ago·26 pts / 6 comm

r/LocalLLaMA· COMMUNITY

Qwen 3.6 27b IQ4_XS - 22 tp/s on RTX 5060TI 16b, 24k ctx

User reports Qwen 3.6 27B quantized model achieves 22 tok/s on RTX 5060 Ti with 24K context window.

u/BazzyIm·2 months ago·40 pts / 20 comm

r/ClaudeAI· COMMUNITY

Claude Pro plan is back to normal, includes Claude Code again. Few!

Anthropic restores Claude Code access to Claude Pro subscribers after temporary removal.

u/py-net·2 months ago·21 pts / 19 comm

r/LocalLLaMA· COMMUNITY

This isn’t X this is Y needs to die

Reddit post complaining about repetitive phrasing in open-weight LLM outputs; suggests retraining to eliminate the pattern.

u/twnznz·2 months ago·45 pts / 26 comm

r/LocalLLaMA· COMMUNITY

Qwen3.6 27B really good?

Reddit user asks whether Qwen3.6 27B matches larger models; community discussion of open-weight model capabilities.

u/Popular-Factor3553·2 months ago·41 pts / 72 comm

Hugging Face· INFRA

DeepSeek-V4: a million-token context that agents can actually use

Hugging Face·2 months ago

r/Anthropic· COMMUNITY

How you guys are managing two Claude Max susbscription on 1 Mac?

Reddit user reports Claude Desktop multi-account setup and discovers shared session storage across isolated instances.

u/Neel_MynO·2 months ago·10 pts / 17 comm

r/OpenAI· COMMUNITY

ChatGPT 5.5 🔥🔥🔥

Reddit post with emoji hype about ChatGPT 5.5; no substantive details or official announcement provided.

u/Dramatic_Method_9554·2 months ago·61 pts / 40 comm

r/LocalLLaMA· COMMUNITY

Compared QWEN 3.6 35B with QWEN 3.6 27B for coding primitives

Reddit user benchmarks Qwen 3.6 35B vs 27B on coding tasks; 35B faster (72 TPS) but less accurate, 27B slower but more precise.

u/gladkos·2 months ago·45 pts / 29 comm

r/OpenAI· COMMUNITY

Excuse me?

Reddit post with no substantive content; insufficient information to assess.

u/time___dance·2 months ago·50 pts / 17 comm

The Verge AI· PRESS

Claude is connecting directly to your personal apps like Spotify, Uber Eats, and TurboTax

Claude users can access more apps with Anthropic's AI now thanks to new connectors for everything from hiking to grocery shopping. Anthropic already supported connecting numerous work-related apps to Claude, like Microsoft apps, but this expansion focuses on personal apps like Audible, Spotify, Uber, AllTrails, TripAdvisor, Instacart, TurboTax, and others. Some of these apps, such as Spotify, already have similar connectors in OpenAI's ChatGPT. Once an app is connected, Claude will suggest relevant connected apps directly in your conversations, like using AllTrails for hike recommendations. A...

Stevie Bonifield·2 months ago

Simon Willison· ANALYST

Extract PDF text in your browser with LiteParse for the web

Simon Willison ports LlamaIndex's LiteParse PDF text extraction tool to run in-browser, using spatial parsing and Tesseract OCR without ML models.

Simon Willison·2 months ago

Ars Technica AI· PRESS

US accuses China of “industrial-scale” AI theft. China says it’s “slander.”

Trump-Xi summit may be rocked by US mulling huge sanctions.

Ashley Belanger ·2 months ago

r/ClaudeAI· COMMUNITY

Moderator questions

Reddit thread asking ClaudeAI subreddit moderator bot about moderation work and concerns about model obsolescence.

u/MooingTree·2 months ago·31 pts / 10 comm

r/OpenAI· COMMUNITY

Open AI got to AGI first!

Reddit user speculates OpenAI reached AGI and will outpace Anthropic; compares Codex and Claude Code features.

u/Extra-Record7881·2 months ago·50 pts / 22 comm

TechCrunch AI· PRESS

Bret Taylor’s Sierra buys YC-backed AI startup Fragment

Sierra, the AI customer service agent startup founded by technologist Bret Taylor, announced today that it has acquired the YC-backed French startup Fragment.

Dominic-Madori Davis·2 months ago

r/OpenAI· COMMUNITY

Common GPT 5.5 pricing misconception.

GPT 5.5 offers better token efficiency than GPT 5.4 despite higher per-token pricing; comparison to Claude Opus 4.7 shows GPT 5.5 5-10x cheaper on ARC-AGI-2.

u/Blake08301·2 months ago·50 pts / 20 comm·+ covered by others

r/ClaudeAI· COMMUNITY

holy shit... i just automated something i thought was impossible with ai : product tutorial videos

User automated product tutorial video generation end-to-end using Claude (script, voiceover, editing, publishing).

u/Mullikaparatha·2 months ago·28 pts / 26 comm

r/Anthropic· COMMUNITY

At this point I would not be shocked

Reddit speculation post with no substantive claim.

u/Saykudan·2 months ago·103 pts / 5 comm

r/singularity· COMMUNITY

GPT 5.5 scores 1.7% on OpenAI-proof Q&A—an internal benchmark testing performance on real ML problems encountered during the process of research and engineering

u/torrid-winnowing·2 months ago·102 pts / 27 comm

NVIDIA Dev Blog· INFRA

Winning a Kaggle Competition with Generative AI–Assisted Coding

In March 2026, three LLM agents generated over 600,000 lines of code, ran 850 experiments, and helped secure a first-place finish in a Kaggle playground... In March 2026, three LLM agents generated over 600,000 lines of code, ran 850 experiments, and helped secure a first-place finish in a Kaggle playground competition. Success in modern machine learning competitions is increasingly defined by how quickly you can generate, test, and iterate on ideas. LLM agents, combined with GPU acceleration, dramatically compress this loop. Historically… Source

Chris Deotte·2 months ago

← Front Page30 stories

← Newer Older →

The Archive

Deepseek V4 Flash and Non-Flash Out on HuggingFace

DeepSeek V4 Flash &amp; Pro Now out on API

I vibe-coded GTA: Google Earth over the weekend

russellromney/honker

An update on recent Claude Code quality reports

thoughts on GPT 5.5

Serving the For You feed

5.5 is a really good model for creative writing so far

Hard freakin' decision..Blackwell 96G or Mac Studio 256G

Tested Claude AI LLM Models' Effort Levels - Low To Max: How Claude Opus 4.7 differs

Qwen 3.6 27b IQ4_XS - 22 tp/s on RTX 5060TI 16b, 24k ctx

Claude Pro plan is back to normal, includes Claude Code again. Few!

This isn’t X this is Y needs to die

Qwen3.6 27B really good?

DeepSeek-V4: a million-token context that agents can actually use

How you guys are managing two Claude Max susbscription on 1 Mac?

ChatGPT 5.5 🔥🔥🔥

Compared QWEN 3.6 35B with QWEN 3.6 27B for coding primitives

Excuse me?

Claude is connecting directly to your personal apps like Spotify, Uber Eats, and TurboTax

Extract PDF text in your browser with LiteParse for the web

US accuses China of “industrial-scale” AI theft. China says it’s “slander.”

Moderator questions

Open AI got to AGI first!

Bret Taylor’s Sierra buys YC-backed AI startup Fragment

Common GPT 5.5 pricing misconception.

holy shit... i just automated something i thought was impossible with ai : product tutorial videos

At this point I would not be shocked

GPT 5.5 scores 1.7% on OpenAI-proof Q&amp;A—an internal benchmark testing performance on real ML problems encountered during the process of research and engineering

Winning a Kaggle Competition with Generative AI–Assisted Coding

DeepSeek V4 Flash & Pro Now out on API

GPT 5.5 scores 1.7% on OpenAI-proof Q&A—an internal benchmark testing performance on real ML problems encountered during the process of research and engineering