The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

I benchmarked caveman against the prompt "be brief"

Reddit user benchmarks "caveman" prompt technique against simple "be brief" instruction across 24 dev prompts; finds comparable token/quality tradeoffs.

u/max-t-devv·2 months ago·21 pts / 15 comm

The Verge AI· PRESS

General Motors is adding Gemini to four million cars

Gemini is coming to Cadillac, Chevrolet, Buick, and GMC vehicles. | Image: General Motors General Motors is planning to bring Google's Gemini AI assistant to around four million vehicles across the US. Model year 2022 and newer Cadillac, Chevrolet, Buick, and GMC vehicles with Google built-in will be eligible for the AI upgrade, which will be rolled out via over-the-air software updates for GM's infotainment system "over several months," according to GM's announcement. GM says this update represents "one of the largest deployments of Gemini in the industry," and that "customers will notice an...

Jess Weatherbed·2 months ago

r/LocalLLaMA· COMMUNITY

What it feels like to have to have Qwen 3.6 or Gemma 4 running locally

User reports Qwen 3.6 and Gemma 4 performing expert-level work locally on consumer hardware (RTX 3090), replacing skilled human labor.

u/GodComplecs·2 months ago·47 pts / 15 comm

r/ClaudeAI· COMMUNITY

Claude is my SEO strategist, content engine, and CTO. From 0 to 10,000 active users in 6 weeks, $0 on ads.

Founder describes using Claude for SEO, content strategy, and business operations to grow marketplace from 0 to 10K users in 6 weeks without paid ads.

u/BadMenFinance·2 months ago·57 pts / 17 comm

r/LocalLLaMA· COMMUNITY

Qwen3.6 27B on dual RTX 5060 Ti 16GB with vLLM: ~60 tok/s, 204k context working

Qwen 3.6 27B achieves 60 tok/s throughput and 204k context on dual RTX 5060 Ti 16GB with vLLM.

u/do_u_think_im_spooky·2 months ago·40 pts / 20 comm

r/LocalLLaMA· COMMUNITY

llama.cpp - NVFP4 native support on Blackwell from now - b8967

llama.cpp adds native NVFP4 quantization support for Blackwell GPUs with benchmark results on RTX 5090.

u/mossy_troll_84·2 months ago·40 pts / 32 comm·+ covered by others

r/Anthropic· COMMUNITY

How do i get unbanned

User reports Claude account ban without stated cause; discusses particle accelerators, neural systems, game engine coding.

u/tvtaseiland·2 months ago·12 pts / 24 comm

r/Anthropic· COMMUNITY

Opus 4.7: Are these first signs of model collapse?

I keep getting shocked by how bad the reasoning of Opus 4.7 is. It still seems fine for programming tasks, but when I ask it to advise me about things, it often produces illogical, nonsensical and flatout wrong responses and shows that it didn't understand simple concepts we had just discussed in the conversation. It is so much worse than previous models that I'm wondering whether we might be starting to see signs of model collapse: this term refers to more and more content on the internet being AI generated and how problematic it is to use such content as training data for new models. And ...

u/Flopperhop·2 months ago·13 pts / 5 comm

r/OpenAI· COMMUNITY

Achieved escape velocity" sounds like a nice way of not saying "recursive self-improvement

Reddit speculation that OpenAI's "escape velocity" language is euphemism for recursive self-improvement capabilities.

u/EchoOfOppenheimer·2 months ago·52 pts / 21 comm

r/LocalLLaMA· COMMUNITY

DeepSeek has began grayscale testing for DeepSeek with Vision

DeepSeek begins grayscale testing of multimodal vision capabilities for DeepSeek model.

u/MagicZhang·2 months ago·76 pts / 10 comm

r/singularity· COMMUNITY

Collecting training data for handling packages with a RobotEra L7

RobotEra L7 robot collecting package-handling training data; robotics application in logistics domain.

u/heart-aroni·2 months ago·101 pts / 20 comm

r/singularity· COMMUNITY

OpenAI's Sebastien Bubeck: [LLM] models are able to surpass humans [researchers] and ask [research] questions

Sebastien Bubeck discusses LLM capability in mathematical reasoning and autonomous research in OpenAI podcast episode.

u/Wadingwalter·2 months ago·132 pts / 21 comm

r/LocalLLaMA· COMMUNITY

I stumbled on a Gemma 4 chat template bug for tools and fixed it

Gemma 4 chat template bug identified: JSON Schema `anyOf` patterns render as empty `type` fields, breaking tool calling across inference engines.

u/EntertainmentBroad43·2 months ago·40 pts / 11 comm

r/ClaudeAI· COMMUNITY

How to make a Product Promo Video with Claude Design (Prompts inside)

I just made this product promo video completely with Claude code. Explaining the process here with the prompts. I also have a generic prompt at the bottom that you might want to use. # Step 1: Describe your video in scenes Don’t think in “design.” Think in scenes — like a director giving a shot list to a crew. This is the first prompt I used: Make a slick product intro video for my product https://claudevideoexport.com - Scene 1: Text animation — "How to get MP4 from Claude Design Animation" - Scene 2: Show a small browser window with "Claude Design" open. Pan to the t...

u/gnurpreet_·2 months ago·20 pts / 6 comm

r/LocalLLaMA· COMMUNITY

AMD has invented something that lets you use AI at home! They call it a "computer"

Satirical post mocking AMD hardware marketing; no substantive AI news.

u/9gxa05s8fa8sh·2 months ago·41 pts / 27 comm

r/LocalLLaMA· COMMUNITY

Hipfire dev update: full AMD arch validation incoming (RDNA 1 thru 4, plus Strix Halo and bc250)

Hipfire local LLM dev lab acquiring full AMD GPU stack (RDNA 1-4, Strix Halo) for architecture validation and performance optimization.

u/schuttdev·2 months ago·48 pts / 11 comm

r/OpenAI· COMMUNITY

GPT-6 Confirmed

Reddit post claims GPT-6 confirmation with no substantive details or official source.

u/DigSignificant1419·2 months ago·53 pts / 10 comm

r/LocalLLaMA· COMMUNITY

Deepseek v4 pricing is genuinely silly, did the math and now i am questioning my entire stack

DeepSeek v4-Pro priced at $0.145/M input tokens (35x cheaper than Claude Opus 4.7), with promotional rates reaching 138x cheaper on cached tokens through May.

u/Skid_gates_99·2 months ago·41 pts / 64 comm

OpenAI· FRONTIER

Cybersecurity in the Intelligence Age

OpenAI proposes five-part plan to democratize AI-powered cybersecurity defenses and protect critical infrastructure.

OpenAI·2 months ago

r/singularity· COMMUNITY

Sketch to HTML works now

User reports successfully implementing sketch-to-HTML conversion using GPT-4V image generation and downstream HTML generation.

u/withmagi·2 months ago·101 pts / 27 comm

r/LocalLLaMA· COMMUNITY

100M tokens for $2.65 (Deepseek V4 Pro)

Deepseek V4 Pro offers 100M tokens for $2.65, dramatically undercutting API pricing across the industry.

u/Danny_Davitoe·2 months ago·43 pts / 26 comm

r/LocalLLaMA· COMMUNITY

Study: 2x+ coding performance of 7B model without touching the coding agent

Study demonstrates 2x coding performance gains on 7B models through prompt/agent optimization without model retraining.

u/9gxa05s8fa8sh·2 months ago·44 pts / 10 comm

r/LocalLLaMA· COMMUNITY

Xiami mimo-v2.5 pro MIT license surpasses Opus 4.5 on arena

Xiami mimo-v2.5 pro (MIT license) ranks #9 on Arena coding leaderboard, outperforming Anthropic's Claude Opus 4.5 (#10).

u/Terminator857·2 months ago·42 pts / 11 comm

r/ClaudeAI· COMMUNITY

Anthropic Joins Blender Development Fund as a Corporate Patron

u/massimo_nyc·2 months ago·21 pts / 8 comm

r/Anthropic· COMMUNITY

Opus 4.7 is somewhere between seriously clueless and stupidly dangerous. The worst frontier model I have used so far in the past 2 years. We were hoping to get at least our 4.6 back but 4.7 with so many critical logical failures mean you have to babysit it all the time. I'm losing hope in Anthropic.

Opus 4.7 on Max effort decided to create a new email template by itself (which is pretty stupid btw) and mass mailed it to the whole database (some emails were repeatedly sent 20x). Before you ask me - yes, [CLAUDE.md](http://CLAUDE.md) has the exact rule for that, it's supposed to email the tester before any new email templates are to be used in production. I have created this safety rule a few months ago. I feel like the Opus 4.7 is a huge letdown the way it's been downgraded. If Anthropic is "pushing the boundaries", it's probably only in the meaning of how far they can push the...

u/DrHumorous·2 months ago·17 pts / 5 comm

Latent Space· ANALYST

[AINews] not much happened today

No substantive AI industry developments reported.

Latent Space·2 months ago

r/ClaudeAI· COMMUNITY

Suggestions For Making Claude Less Lazy?

User reports Claude Opus 4.6/4.7 exhibiting reduced effort behavior—avoiding research, providing outdated info, and deflecting tasks—starting this week.

u/Sad-Ticket5394·2 months ago·21 pts / 34 comm

r/LocalLLaMA· COMMUNITY

Why isn’t LLM reasoning done in vector space instead of natural language?

Reddit discussion questioning why LLMs use language-based chain-of-thought reasoning instead of latent vector space operations for faster, more compressed inference.

u/ZeusZCC·2 months ago·54 pts / 51 comm·+ covered by others

TechCrunch AI· PRESS

At his OpenAI trial, Musk relitigates an old friendship

It's a story Musk has told before -- in interviews and to author Walter Isaacson for his bestselling biography of Musk -- but Tuesday was the first time he said it under oath.

Connie Loizos·2 months ago

r/LocalLLaMA· COMMUNITY

llama.cpp's Preliminary SM120 Native NVFP4 MMQ Is Merged

llama.cpp merged SM120 native NVFP4 quantization support; community released GGUFs for Gemma-4-31B and Nemotron-Cascade models.

u/ggonavyy·2 months ago·42 pts / 19 comm

← Front Page30 stories

← Newer Older →