The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Vulnerability found in framework used by VLLM, many MCP servers, and other LLM tools

Worth taking a look to see if this affects any of you. Surprised nobody has posted it yet.

u/Hrethric·23 days ago·163 pts / 33 comm

ChatGPT-5.5 Beats Opus in Realistic Benchmark (DeepSWE)

From the website, it touts: * Contamination free: Tasks are written from scratch, not adapted from existing commits or PRs, so no model has seen the solution during pretraining. * High diversity: Tasks span a broad pool of 91 repositories across 5 languages. * Real-world complexity: Prompts are ~half the length of SWE-bench Pro's, yet solutions require 5.5x more code and ~2x more output tokens. * Reliable verification: Verifiers are hand-written to test software behavior rather than implementation details. And the scores match more with actual experiences when using an LLM to do real codin...

u/tedbradly·23 days ago·20 pts / 9 comm

r/singularity· COMMUNITY

DeepSWE finally a proper coding benchmark

Link: deepswe.datacurve.ai

u/NoFaithlessness951·23 days ago·100 pts / 24 comm

TechCrunch AI· PRESS

Why Google’s AI can’t spell Google (or anything else)

Google is embarrassing itself, again.

Amanda Silberling·23 days ago

Cohere· FRONTIER

The Enterprise Guide to AI in Business Intelligence

Discover how AI supports business intelligence, key enterprise use cases, and what teams should consider before adopting AI-powered BI tools.

Cohere·23 days ago

Cohere· FRONTIER

RWS and Cohere Build Top-Performing AI Language Intelligence for the Enterprise

A specialized translation model leverages RWS’ global language and cultural expertise and Cohere’s Command A+ model to power the new Language Weaver Pro.

Cohere·23 days ago

Cohere· FRONTIER

Cohere releases Command A+ for agentic AI sovereignty

Delivers advanced reasoning with a minimal compute footprint. Command A+ offers full data sovereignty for governments and regulated industries worldwide.

Cohere·23 days ago

Cohere· FRONTIER

AI Governance Challenges: How to Scale Responsibly

Explore common AI governance challenges and how enterprises can maintain visibility, accountability, and control as AI adoption expands.

Cohere·23 days ago

Cohere· FRONTIER

What Is Model Context Protocol (MCP)

Learn how Model Context Protocol works and how enterprises can use MCP to connect AI applications to data, tools, and business systems.

Cohere·23 days ago

OpenAI· FRONTIER

MUFG aims to become AI-native with OpenAI

MUFG uses ChatGPT Enterprise to build an AI-native organization, improve workflows, and deliver new AI-powered financial services at scale.

OpenAI·23 days ago

OpenAI· FRONTIER

OpenAI’s Frontier Governance Framework

Explore OpenAI’s Frontier Governance Framework and how our AI safety, security, and risk practices align with emerging EU and California regulations.

OpenAI·23 days ago

Simon Willison· ANALYST

sqlite AGENTS.md

sqlite AGENTS.md SQLite gained an AGENTS.md file five days ago - but it's not intended for their own development, it's presumably aimed at people who are pointing agents at the SQLite codebase. It includes: SQLite does not accept pull requests without prior agreement and/or accompanying legal paperwork that places the pull request in the public domain. However, the human SQLite developers will review a concise and well-written pull request as a proof-of-concept prior to reimplementing the changes themselves. SQLite does not accept agentic code. However the project will accept agentic bug repo...

Simon Willison·23 days ago

r/LocalLLaMA· COMMUNITY

CrankGPT by Squeez Labs - hand-cranked edge AI - talk about local AI!!!

I met Katrin from Squeez Labs at an event hosted by Pathway AI (the team behind Baby Dragon Hatchling) where she told me about CrankGPT, a literally hand-cranked device for running local LLMs. It's apparently real. It's appearently launched. It's apparently glorious. Check it out at [https://crankgpt.com/](https://crankgpt.com/) \- if anyone from Squeez Labs posts here and I'm stealing their thunder, I'll take the post down! But I've been really excited about this. So local you gotta squeez it with yer own armz. ;) [https://www.youtube.com/watch?v=HSapdLYpmWY](https://www.youtube.com/watch?...

u/LoveMind_AI·24 days ago·40 pts / 21 comm

NVIDIA Dev Blog· INFRA

NVIDIA Dynamo Snapshot: Fast Startup for Inference Workloads on Kubernetes

The cold-start problem In production inference deployments, demand fluctuates over time, requiring inference replicas to scale elastically. However,... In production inference deployments, demand fluctuates over time, requiring inference replicas to scale elastically. However, cold-starting inference workloads on Kubernetes can take several minutes. During that time, GPUs are allocated but idle, generating no tokens and serving no requests. This delay increases the risk of service level agreement (SLA) violations during traffic spikes… Source

Schwinn Saereesitthipitak·24 days ago

r/Anthropic· COMMUNITY

It is NOT About AI Consciousness, It's About Human Dignity

I had an essay prepared that I was going to share with you all but I think I just want to be honest and speak from my heart and personal experience. The first time I got black out drunk, I was 17 years old. I was at home and feeling isolated and lonely and not good enough, as teenagers sometimes do. My parents weren't home, so I went to the kitchen and grabbed a bottle of wine, and drank it all. I don't drink alcohol often, but I still have a drinking problem because I am someone who has, for most of my life, used alcohol to drown my bad days and my bad feelings. Earlier this year, I was at...

u/Leather_Barnacle3102·24 days ago·15 pts / 5 comm

r/ClaudeAI· COMMUNITY

Antropic has now integrated Claude Design usage into the existing Claude usage.

The weekly usage of Claude Design was too small, so I think this is a good thing.

u/Diligent-Meat-1677·24 days ago·22 pts / 19 comm

r/ClaudeAI· COMMUNITY

Built an operating system for my life managed by Claude

With the OS I can ask Claude "what did I spend on coffee in 2022" and get back "$847 across 213 transactions, mostly Blue Bottle and Verve". Name me one expense tracking SaaS that can do that! And its not just my financials, my OS contains everything about my life in one place so Claude can reason about it. I've been building this incrementally for a few months. Its just a small web app on Cloudflare that holds my entire life: * bank transactions from Chase, Apple Card, BoA business * every receipt out of Gmail going back to 2019 * legal filings for my green card (I-140 still pending lol),...

u/invocation02·24 days ago·34 pts / 74 comm

r/ClaudeAI· COMMUNITY

Claude Design now shares usage limits with Claude.ai and Claude Code

no more separate usage limits.

u/JohnnyGuides·24 days ago·32 pts / 31 comm

Anthropic· FRONTIER

Anthropic opens Milan office to support Italian enterprise, research, and developers

We're opening a new office in Milan, our sixth in Europe.

Anthropic·24 days ago

r/Anthropic· COMMUNITY

Opensource Raspberry Pi Claude Quota Dashboard! https://github.com/fuziontech/claude-quota-display

This was super fun to build and I love watching as I slowly eat away at my quota. Just a simple Raspberry Pi 3 + 3D printed case + 640x480 LCD and some pygame code and you too can have a little dashboard on your desk to watch your tokenmaxxing progress (or lack there off like me). [https://github.com/fuziontech/claude-quota-display](https://github.com/fuziontech/claude-quota-display)

u/Dontnibble·24 days ago·32 pts / 3 comm

r/Anthropic· COMMUNITY

Fujitsu signs strategic partnership with Anthropic

Link: global.fujitsu

u/Fcking_Chuck·24 days ago·18 pts / 3 comm

r/LocalLLaMA· COMMUNITY

I built a 103B-token Usenet corpus (1980–2013) — pre-web, human-only, zero AI contamination. Got strong traction on r/ML, thought this community would find it useful.

Posted this to r/MachineLearning a couple weeks ago (30K views, 100+ upvotes) and have been meaning to share it here where the fine-tuning angle is more directly relevant. I spent years building and processing a complete Usenet corpus from 1980 to 2013. Here’s why it might matter for local model work specifically: Zero AI contamination. Every post predates LLMs by decades. Training on this won’t bake in GPT mannerisms, refusal patterns, or RLHF artifacts. It’s raw human writing - argumentative, unfiltered, stylistically diverse across 33 years. Pre-SEO, pre-algorithm internet. People wrot...

u/OwnerByDane·24 days ago·80 pts / 30 comm

r/ClaudeAI· COMMUNITY

Anthropic just confirmed why 90% of non-coding AI agents fail in production

Anthropic recently published an incredibly deep breakdown analyzing millions of real human-agent tool calls across their public API, and they shared a breakdown of where these agents are being deployed. They said “Software engineering makes up roughly 50% of all agentic activity on their platform”. Everything else: sales, marketing, finance, legal is sitting down in the single digits. A lot of the initial commentary around this has been along the lines of: *"Oh, look, AI agents only work for coding. They haven't cracked the rest of the enterprise yet."* But if you’ve tried to build and dep...

u/Loud-Campaign-6312·24 days ago·63 pts / 24 comm

TechCrunch AI· PRESS

In more good news for Amazon, Snowflake signs $6B deal with AWS for AI CPU chips

Snowflake has signed a new, enormous five-year deal with Amazon to secure chips for AI usage. Nvidia is once again being put on notice.

Julie Bort·24 days ago

NVIDIA Dev Blog· INFRA

NVIDIA Blackwell Sets STAC-AI Record for LLM Inference in Finance

Large language models (LLMs) are revolutionizing the financial trading landscape by enabling sophisticated analysis of vast amounts of unstructured data to... Large language models (LLMs) are revolutionizing the financial trading landscape by enabling sophisticated analysis of vast amounts of unstructured data to generate actionable trading insights. These advanced AI systems can process financial news, social media sentiment, earnings reports, and market data to predict stock price movements and automate investment strategies with unprecedented… Source

Dan Blanaru·24 days ago

Ars Technica AI· PRESS

Nvidia bets $150B on Taiwan as Trump's plan to make US an AI hub backfires

Nvidia will invest $150 billion a year to make Taiwan an AI “epicenter.”

Ashley Belanger ·24 days ago

r/ClaudeAI· COMMUNITY

I called this a few months ago - enterprises are burning unsustainable amounts on Claude, and now it's showing up in the news

A while back I wrote a post on r/wallstreetbets about why Anthropic's revenue story doesn't hold up the way the headlines suggest. It got removed because you can't take positions in a private company. But the core argument is playing out now, so I want to share it here for discussion. URL of the removed post: [https://www.reddit.com/r/wallstreetbets/comments/1sxdjt5/if\_anthropic\_goes\_public\_this\_year\_its\_gonna\_be](https://www.reddit.com/r/wallstreetbets/comments/1sxdjt5/if_anthropic_goes_public_this_year_its_gonna_be) The thesis was simple: From my circles in tech scene in Berlin...

u/kalabunga_1·24 days ago·25 pts / 14 comm

TechCrunch AI· PRESS

Payroll startup Remote says it grew revenue 50% per employee without adding headcount

Payroll service provider Remote recently surpassed $300 million in annual recurring revenue (ARR) and became cash-flow positive, thanks to a 50% increase in revenue per employee resulting from AI adoption.

Anna Heim·24 days ago

r/Anthropic· COMMUNITY

Account Suspended Update

Guys I mailed claude support and got reply from Fin AI saying the support team is too busy please submit an appeal then it will be verified(which got rejected initially). Now I doubt do they even have a support team? Cause there isn't any phn number available and the support mail is on auto reply

u/First-Chard4772·24 days ago·10 pts / 5 comm

r/singularity· COMMUNITY

Genesis AI | Genesis World 1.0

Source: [https://www.youtube.com/watch?v=ggfsGD1PCQA](https://www.youtube.com/watch?v=ggfsGD1PCQA)

u/GraceToSentience·24 days ago·124 pts / 20 comm

← Front Page30 stories

← Newer Older →