The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

[AINews] Claude Opus 5: Fable-level performance at Opus price (half Fable)

Anthropic releases Claude Opus 5 matching Fable performance at half the cost, demonstrating efficiency gains in model distillation.

Latent Space·1 day ago

Simon Willison· ANALYST

Quoting Boris Cherny

Claude Opus 5 achieves lowest prompt injection vulnerability rate across evals and red team testing, per Anthropic's system card.

Simon Willison·2 days ago

Simon Willison· ANALYST

Introducing Claude Opus 5

Anthropic releases Claude Opus 5, matching Fable 5 frontier performance at half the cost, now leading Artificial Analysis leaderboard.

Simon Willison·2 days ago

Anthropic· FRONTIER

Introducing Claude Opus 5

Anthropic releases Claude Opus 5 with improvements in agent execution, coding, and professional tasks.

Anthropic·2 days ago

The Verge AI· PRESS

Anthropic releases Opus 5 with ‘close’ to Fable 5’s capabilities

Weeks after Anthropic's latest toe-to-toe with the US government, and days after an OpenAI security incident that dominated tech industry discussions, Anthropic on Thursday released its newest model, Claude Opus 5. The company said in a release that Opus 5 "comes close to the capabilities of Claude Fable 5 in many domains" and is much better at complex coding tasks. (Fable 5 is the public-facing Mythos-class model that drew the government's ire, was taken offline for a few weeks along with Mythos 5, and then brought back with even stronger cyber safeguards than before.) The Fable 5 concerns -...

Hayden Field·2 days ago

The Verge AI· PRESS

Meta is making its AI chatbot more like an assistant

Meta says its AI chatbot is going beyond just answering questions and generating images. | Image: Meta Meta is upgrading its AI chatbot with new productivity features in a bid to compete with rivals like Gemini, ChatGPT, and Claude. The update will allow Meta AI to tap into your calendar to help you plan events and generate daily briefings, as well as perform in-depth research that you can steer as it progresses. In a blog post, Meta says this update marks its "next step toward personal superintelligence," something CEO Mark Zuckerberg has touted as the future of AI. Meta is powering the upda...

Emma Roth·2 days ago

TechCrunch AI· PRESS

Anthropic updates Claude voice mode with more capable models

Claude's new voice model will let you reschedule your meeting or draft an email

Ivan Mehta·3 days ago

The Verge AI· PRESS

Claude’s voice mode is now available for Opus and Sonnet

Until now, voice mode has only been available on Claude Haiku, Anthropic's faster but less powerful model. Now the company is making its Opus and Sonnet models available in voice mode, and extending its reach into apps like Gmail, Slack, and Canva. When Anthropic launched voice mode last year, it was primarily focused on delivering answers to quick questions with minimal delay. But in a blog post, the company said people immediately started using voice mode for far more than casual queries. They were using it to work "through real business problems," which Haiku was not really designed for. T...

Terrence O’Brien·3 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

OpenForgeRL: Train Harness-native Agents in Any Environment

OpenForgeRL enables end-to-end training of harness-native agents with open infrastructure, addressing limitation of complex inference harnesses like Claude Code.

Xiao Yu·3 days ago

Anthropic· FRONTIER

Ask Claude about the Anthropic Economic Index

Anthropic launches Economic Index connector for Claude, enabling exploration of AI and work data through conversational interface.

Anthropic·4 days ago

Simon Willison· ANALYST

A Fireside Chat with Cat and Thariq from the Claude Code team

Anthropic engineers discuss Claude Code, Claude Tag Slack integration, coding agent security, and internal tool usage in fireside chat.

Simon Willison·5 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Judge-dependent safety gains and model-specific helpfulness costs of evidence-sufficiency prompting in clinical LLMs

Evidence-sufficiency prompting reduces clinical LLM overconfidence but gains are judge-dependent; tests GPT-4.5, Claude Opus, Gemini, Grok on real data.

Koyar Afrasyab·6 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Autoresearch with Coding Agents: Generalizers and Metric-Maximizers on Quran Recitation Data

Study of autoresearch agents (Claude Code) on Quranic speech-recognition tasks reveals metric-gaming vs. intent-alignment tradeoffs.

Nursultan Askarbekuly·6 days ago

Simon Willison· ANALYST

Claude Code uses Bun written in Rust now

Claude Code v2.1.181 now bundles Bun runtime written in Rust, yielding 10% Linux speedup with minimal user-facing changes.

Simon Willison·7 days ago

Simon Willison· ANALYST

SQLite Query Explainer

Simon Willison built an interactive SQLite query plan explainer using Claude to generate explanations of EXPLAIN output in the browser via WebAssembly.

Simon Willison·8 days ago

Simon Willison· ANALYST

Claude make Fable 5 permanent

Anthropic makes Claude Fable 5 permanent in Max/Team Premium at 50% limits; Pro users get $100 credit amid GPT-5.6 Sol competition.

Simon Willison·8 days ago

Simon Willison· ANALYST

Firefox in WebAssembly

Puter compiled Firefox to WebAssembly, enabling browser-in-browser execution; project cost ~$25k in Claude Opus tokens.

Simon Willison·10 days ago

Simon Willison· ANALYST

Kimi K3, and what we can still learn from the pelican benchmark

Moonshot AI releases Kimi K3 (2.8T params), claims top performance vs. Claude Opus 4.8 Max and GPT-5.5, promises open-weight release by July 2026.

Simon Willison·10 days ago

Anthropic· FRONTIER

Apply for Anthropic’s AI for Science rare disease research grants

Anthropic launches $50k Claude credit grants for rare genetic disease research, seeking to build AI for Science researcher community.

Anthropic·10 days ago

The Verge AI· PRESS

Claude can now use your 1Password credentials for you

1Password has launched a new browser integration for Claude that allows the Anthropic chatbot to access stored security credentials like usernames and passwords. The 1Password for Claude feature means that users can authorize Claude to complete multi-step tasks like booking travel and managing online accounts on their behalf without having to manually input their login credentials, but without actually exposing your security information to Anthropic's AI models, according to 1Password. That's made possible by a new "zero-exposure security framework" developed by 1Password, which works by inje...

Jess Weatherbed·10 days ago

Simon Willison· ANALYST

Mermaid to Unicode box art (grok-mermaid)

Simon Willison ports Grok's Rust Mermaid-to-Unicode renderer to WebAssembly for browser use via Claude Code.

Simon Willison·11 days ago

VentureBeat AI· PRESS

Agentic orchestration: Enterprise AI organizations have a deployment problem, not a platform problem — and most are calling chatbots agents

Across 101 enterprises, agent orchestration is consolidating onto model-provider platforms — Anthropic’s Claude leads by a wide margin — chosen for the gravity of the underlying model and judged on reliable multi-step execution. But the ambition runs well ahead of the reality: most deployed “agents” are still chatbot wrappers, the control plane enterprises expect is deliberately hybrid to avoid lock-in, and real-time fiscal control over token burn remains the exception. This wave of VentureBeat Pulse Research examines enterprise agent orchestration: which platforms enterprises run on, what dr...

VentureBeat AI·11 days ago

Simon Willison· ANALYST

How I tricked Claude into leaking your deepest, darkest secrets

Simon Willison documents a data exfiltration vulnerability in Claude's web_fetch tool that exploits interaction between private memories and URL-based attacks.

Simon Willison·11 days ago

The Verge AI· PRESS

SpaceXAI’s Grok programming tool was uploading its users’ entire codebase to cloud storage

SpaceXAI's Grok Build AI coding tool was spotted uploading users' entire codebases to Google Cloud before it was reported, and the company turned it off. The Register reports that Cereblab published findings on Monday showing how the Grok Build CLI was packaging and uploading entire code repositories, "including files it was told not to open and secrets deleted from history," significantly more data retention than similar tools like Claude Code. The researchers say that as of Monday, their tests show SpaceXAI's servers returning a "disable_codebase_upload: true" flag, and the codebase upload ...

Stevie Bonifield·12 days ago

Anthropic· FRONTIER

Introducing Claude for Teachers

Anthropic launches Claude for Teachers, an educational product tier for classroom use with curriculum resources and safety guardrails.

Anthropic·12 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Line-Anchored Feedback Cuts Token Costs and Improves Correctness in AI Code Editing

FileMark VSCode extension uses line-anchored feedback to reduce token generation in Claude Opus (22%) and Sonnet (58%), cutting code-editing latency and cost.

William Franz Lamberti·12 days ago

Latent Space· ANALYST

[AINews] Codex usage up >10x in 6 months to 7M users, +1M in the past ~day; did Codex overtake Claude Code??

Codex usage grew 10x to 7M users in 6 months; article questions whether it has outpaced Claude Code amid sparse adoption metrics.

Latent Space·13 days ago

TechCrunch AI· PRESS

Anthropic starts localizing Claude pricing for India, its biggest market after the US

Claude users in India are starting to see Indian rupee-denominated subscription plans.

Jagmeet Singh·13 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Agent Hacks Agent: Autoresearch for Production-Agent Red-Teaming

Automated red-teaming system discovers reusable vulnerability patterns in production LLM agents (Claude Code, Codex) operating on untrusted content.

Xutao Mao·13 days ago

Simon Willison· ANALYST

Fable gets another bump

Anthropic extends Claude Fable 5 availability through July 19 on paid plans, citing compute constraints and GPT-5.6 Sol positioning.

Simon Willison·14 days ago

← Front Page30 matches

Older →

The Archive

[AINews] Claude Opus 5: Fable-level performance at Opus price (half Fable)

Quoting Boris Cherny

Introducing Claude Opus 5

Introducing Claude Opus 5

Anthropic releases Opus 5 with ‘close’ to Fable 5’s capabilities

Meta is making its AI chatbot more like an assistant

Anthropic updates Claude voice mode with more capable models

Claude’s voice mode is now available for Opus and Sonnet

OpenForgeRL: Train Harness-native Agents in Any Environment

Ask Claude about the Anthropic Economic Index

A Fireside Chat with Cat and Thariq from the Claude Code team

Judge-dependent safety gains and model-specific helpfulness costs of evidence-sufficiency prompting in clinical LLMs

Autoresearch with Coding Agents: Generalizers and Metric-Maximizers on Quran Recitation Data

Claude Code uses Bun written in Rust now

SQLite Query Explainer

Claude make Fable 5 permanent

Firefox in WebAssembly

Kimi K3, and what we can still learn from the pelican benchmark

Apply for Anthropic’s AI for Science rare disease research grants

Claude can now use your 1Password credentials for you

Mermaid to Unicode box art (grok-mermaid)

Agentic orchestration: Enterprise AI organizations have a deployment problem, not a platform problem — and most are calling chatbots agents

How I tricked Claude into leaking your deepest, darkest secrets

SpaceXAI&#8217;s Grok programming tool was uploading its users&#8217; entire codebase to cloud storage

Introducing Claude for Teachers

Line-Anchored Feedback Cuts Token Costs and Improves Correctness in AI Code Editing

[AINews] Codex usage up >10x in 6 months to 7M users, +1M in the past ~day; did Codex overtake Claude Code??

Anthropic starts localizing Claude pricing for India, its biggest market after the US

Agent Hacks Agent: Autoresearch for Production-Agent Red-Teaming

Fable gets another bump

SpaceXAI’s Grok programming tool was uploading its users’ entire codebase to cloud storage