The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

r/singularity· COMMUNITY

Benchmarks in 2024

Reddit discussion on 2024 AI benchmarks without substantive content or specific findings provided.

u/RetiredApostle·2 months ago·115 pts / 20 comm

r/Anthropic· COMMUNITY

How we never heard about this again is a crime.

u/brianjenkins94·2 months ago·14 pts / 3 comm

r/ClaudeAI· COMMUNITY

Why do a lot of programmers and technical people hate AI, vibecoding AI assisted coding?

Reddit discussion on programmer skepticism toward AI-assisted coding, arguing resistance stems from fear of disruption rather than technical merit.

u/Gullible-Angle4206·2 months ago·20 pts / 68 comm

TechCrunch AI· PRESS

SAP bets $1.16B on 18-month-old German AI lab and says yes to NemoClaw

SAP plans to buy German AI startup Prior Labs and invest heavily in it. It is also prohibiting customers' agents use to a select few like Nvidia's NemoClaw.

Anna Heim·2 months ago

Simon Willison· ANALYST

datasette-referrer-policy 0.1

Simon Willison releases datasette-referrer-policy 0.1 to fix OpenStreetMap tile loading issues in Datasette.

Simon Willison·2 months ago

TechCrunch AI· PRESS

Altara secures $7M to bridge the data gap that’s slowing down physical sciences

Altara’s AI aims to diagnose failures and help speed up R&D by unifying data siloed across spreadsheets and legacy systems.

Marina Temkin·2 months ago

r/ClaudeAI· COMMUNITY

How does Claude (with access to the law) perform compared to law-specific AI systems (like Westlaw/Lexis)? We ran a series of head to head tests

We’re now a couple of years into the AI wave, and it seems like the available legal AI technology has begun splitting down two different tracks: In one direction, there are general purpose AI systems like Claude or Chat GPT; in the other direction you have purpose-built legal AI systems like Westlaw’s AI Deep Research and Lexis Protege. We’re two active litigators (Ding and Duff) who use both Claude and Westlaw regularly. Curious to see how well the various systems perform legal research, we decided to run a series of comparison tests consisting of five prompts across all three systems. We t...

u/deaexmachinae·2 months ago·22 pts / 7 comm

Ars Technica AI· PRESS

OpenAI president forced to read his personal diary entries to jury

Elon Musk argued the journals show the moment when OpenAI abandoned its mission.

Ashley Belanger ·2 months ago

r/LocalLLaMA· COMMUNITY

MTP on strix halo with llama.cpp (PR #22673)

User reports successful MTP speculative decoding on AMD Strix Halo (AI Max 395) with llama.cpp achieving 60-80 tok/s on Qwen 3.6B GGUF.

u/Edenar·2 months ago·42 pts / 20 comm

Simon Willison· ANALYST

Our AI started a cafe in Stockholm

Andon Labs deploys AI agent (Mona) to manage cafe operations in Stockholm; illustrates real-world agent failures in inventory and decision-making.

Simon Willison·2 months ago

r/ClaudeAI· COMMUNITY

Spyware?

Reddit user reports suspicious behavior in Claude desktop app; claims Anthropic-signed files involved.

u/Devil694·2 months ago·38 pts / 27 comm

r/LocalLLaMA· COMMUNITY

US and tech firms strike deal to review AI models for national security before public release | Technology

US government and tech firms agree to pre-release AI model review process for national security assessment before public deployment.

u/Merchant_Lawrence·2 months ago·44 pts / 40 comm

The Verge AI· PRESS

Google Home’s Gemini AI can handle more complicated requests

Google Home users can now ask Gemini to complete more complex, multi-step tasks and combine multiple tasks in a single command. Google has updated Gemini for Home to Gemini 3.1, which it says will improve the smart home assistant's ability to interpret and act on requests. The upgrade will also make Gemini for Home better at handling recurring and all-day events and allow users to "move around" upcoming events. Last month, Google also updated Gemini for Home with improvements for understanding natural language and identifying devices correctly. The upgrades follow reports of bugs in Google's ...

Stevie Bonifield·2 months ago

Ars Technica AI· PRESS

Silicon Valley bets $200M on AI data centers floating in the ocean

Panthalassa aims to test floating AI computing nodes in the Pacific in 2026.

Jeremy Hsu ·2 months ago

The Verge AI· PRESS

Apple agrees to pay iPhone owners $250 million for not delivering AI Siri

Apple has agreed to pay $250 million to settle a class action lawsuit that accused it of misleading customers about the availability of its Apple Intelligence features. The proposed settlement would apply to people in the US who purchased all models of the iPhone 16 and the iPhone 15 Pro between June 10th, 2024 and March 29th, 2025. The settlement will resolve a 2025 lawsuit, alleging Apple's advertisements created a "clear and reasonable consumer expectation" that Apple Intelligence features would be available with the launch of the iPhone 16. The lawsuit claimed Apple's products "offered a ...

Emma Roth·2 months ago

r/OpenAI· COMMUNITY

This is getting dangerous…

Reddit anecdote about Claude responding to comparative model criticism; no technical substance or novel information.

u/Chance-Address-6180·2 months ago·178 pts / 10 comm

r/LocalLLaMA· COMMUNITY

DeepSeek V4 being 17x cheaper got me to actually measure what I send to cloud vs what I could run locally. the results are stupid.

Developer benchmarked local Qwen 3.6 27B vs cloud models on 150 real coding tasks, finding local matched cloud 97% on 35% of workload, suggesting cost arbitrage opportunity.

u/spencer_kw·2 months ago·41 pts / 17 comm

r/LocalLLaMA· COMMUNITY

I know this isn’t technically an LLM but OmniVoice is FUCKING AMAZING.

Reddit user expresses enthusiasm for OmniVoice, a one-shot voice cloning tool, though lacks technical detail or verification.

u/Borkato·2 months ago·40 pts / 19 comm

Latent Space· ANALYST

🔬Doing Vibe Physics — Alex Lupsasca, OpenAI

Alex Lupsasca (OpenAI) details how GPT-5.x generated novel theoretical physics and quantum gravity results.

Latent Space·2 months ago

r/ClaudeAI· COMMUNITY

What do you think?

Reddit discussion post with no substantive content; insufficient information for professional analysis.

u/268allensteve·2 months ago·32 pts / 29 comm

r/LocalLLaMA· COMMUNITY

Why run local? Count the money

User quantifies cost savings from running local Qwen-397B with Hermes agent vs. API pricing: 200M tokens in 5 days ≈ $250 saved at API rates.

u/Badger-Purple·2 months ago·42 pts / 115 comm

TechCrunch AI· PRESS

ASML CEO Christophe Fouquet: No one is coming for us

Christophe Fouquet, who became ASML's CEO in 2024 after more than a decade at the company, sat down with this editor on the rooftop deck of his Beverly Hills hotel Tuesday morning ahead of his appearance at the Milken Institute Global Conference. Dressed in a blue suit and white shirt, he was relaxed — even when the conversation turned to the rivals.

Connie Loizos·2 months ago

The Verge AI· PRESS

Microsoft gives up on Xbox Copilot AI

Xbox is "winding down Copilot on mobile" and "will stop development of Copilot on console," new Xbox CEO Asha Sharma announced on Tuesday. The move follows Sharma's reorganization of the Xbox platform team earlier on Tuesday, which added executives from Microsoft's CoreAI team - where Sharma worked before taking over Xbox - to the Xbox side of the company. Sharma, on X: Xbox needs to move faster, deepen our connection with the community, and address friction for both players and developers. Today, we promoted leaders who helped build Xbox, while also bringing in new voices to help push us for...

Jay Peters·2 months ago

The Verge AI· PRESS

Apple could let you pick a favorite AI model in iOS 27

The next update to Apple's operating systems could allow users to choose their preferred AI model for running Apple Intelligence. According to Bloomberg's Mark Gurman, Apple is planning to allow third-party chatbots to power its AI features system-wide in iOS 27, iPadOS 27, and macOS 27, all expected for this fall. In addition to running Siri, compatible third-party AI models, called "Extensions," will also now be able to run other Apple Intelligence features like Writing Tools and Image Playground. According to Gurman, Apple will also allow users to choose different Siri voices for different...

Stevie Bonifield·2 months ago·+ covered by others

r/ClaudeAI· COMMUNITY

Opus 4.7 has a new favorite word

Reddit observation about a repeated word in Claude Opus 4.7 outputs; informal linguistic pattern-spotting.

u/RatherRoundDonut·2 months ago·22 pts / 12 comm

r/MachineLearning· COMMUNITY

NeurIPS Submission Number [D]

Reddit discussion about NeurIPS submission volume potentially exceeding 40k submissions.

u/StriderKing27·2 months ago·30 pts / 15 comm

r/LocalLLaMA· COMMUNITY

Dense Model Shoot-Off: Gemma 4 31B vs Qwen3.6/5 27B... Result is Slower is Faster.

Benchmark comparison shows Gemma 4 31B trades inference speed for token efficiency vs Qwen 3.6/5 27B; Qwen optimizes for metrics, Gemma for throughput.

u/MiaBchDave·2 months ago·51 pts / 11 comm

r/ClaudeAI· COMMUNITY

I turned Claude into a small claims court (with AI lawyers, a judge, and bribes)

Prompt engineering demo: multi-Claude adversarial roleplay with five lawyer archetypes, persistent case law, and emergent jurisprudence system.

u/etaheri·2 months ago·20 pts / 21 comm

r/ClaudeAI· COMMUNITY

10 things about Claude that took me way too long to figure out

User shares practical tips for Claude usage including system prompt design, file uploads, and critique workflows.

u/VidekVipPro·2 months ago·25 pts / 11 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

A Closed-Form Adaptive-Landmark Kernel for Certified Point-Cloud and Graph Classification

PALACE: kernel method for certified point-cloud/graph classification with adaptive landmarks and cover-theoretic guarantees.

Sushovan Majhi·2 months ago

← Front Page30 stories

← Newer Older →

The Archive

Benchmarks in 2024

How we never heard about this again is a crime.

Why do a lot of programmers and technical people hate AI, vibecoding AI assisted coding?

SAP bets $1.16B on 18-month-old German AI lab and says yes to NemoClaw

datasette-referrer-policy 0.1

Altara secures $7M to bridge the data gap that’s slowing down physical sciences

How does Claude (with access to the law) perform compared to law-specific AI systems (like Westlaw/Lexis)? We ran a series of head to head tests

OpenAI president forced to read his personal diary entries to jury

MTP on strix halo with llama.cpp (PR #22673)

Our AI started a cafe in Stockholm

Spyware?

US and tech firms strike deal to review AI models for national security before public release | Technology

Google Home&#8217;s Gemini AI can handle more complicated requests

Silicon Valley bets $200M on AI data centers floating in the ocean

Apple agrees to pay iPhone owners $250 million for not delivering AI Siri

This is getting dangerous…

DeepSeek V4 being 17x cheaper got me to actually measure what I send to cloud vs what I could run locally. the results are stupid.

I know this isn’t technically an LLM but OmniVoice is FUCKING AMAZING.

🔬Doing Vibe Physics — Alex Lupsasca, OpenAI

What do you think?

Why run local? Count the money

ASML CEO Christophe Fouquet: No one is coming for us

Microsoft gives up on Xbox Copilot AI

Apple could let you pick a favorite AI model in iOS 27

Opus 4.7 has a new favorite word

NeurIPS Submission Number [D]

Dense Model Shoot-Off: Gemma 4 31B vs Qwen3.6/5 27B... Result is Slower is Faster.

I turned Claude into a small claims court (with AI lawyers, a judge, and bribes)

10 things about Claude that took me way too long to figure out

A Closed-Form Adaptive-Landmark Kernel for Certified Point-Cloud and Graph Classification

Google Home’s Gemini AI can handle more complicated requests