The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

If you are also sick of renaming your chats like me

Reddit user reports Claude responds to requests to auto-name conversations, a minor UX workaround for chat organization.

u/SuccessfulTonight391·1 month ago·20 pts / 20 comm

r/Anthropic· COMMUNITY

Sonnet 4.6 incapable of writing different styles/attuning to style guides

I’ve used Claude for a long time and created various style guides with specific tones/voice and structure. Sonnet 4.6 can follow structure but that’s where it begins and ends. Claude models have always been able to emulate different styles of writing but with sonnet 4.6 it can no longer do that…what’s going on? Can any of the models emulate different kinds of writing styles anymore? Sonnet 4.5 can…

u/alwaysstaycuriouss·1 month ago·10 pts / 4 comm

r/LocalLLaMA· COMMUNITY

Computer build using Intel Optane Persistent Memory - Can run 1 trillion parameter model at over 4 tokens/sec

Builder demonstrates 1T parameter Kimi K2.5 inference at 4 tokens/sec using Intel Optane Persistent Memory on commodity hardware.

u/APFrisco·1 month ago·66 pts / 10 comm

Simon Willison· ANALYST

Quoting James Shore

James Shore argues AI coding agents must reduce maintenance costs inversely to productivity gains or risk long-term debt; doubling output without halving maintenance costs creates net negative ROI.

Simon Willison·1 month ago

r/OpenAI· COMMUNITY

i cannot go back to claude now

Reddit user expresses preference for OpenAI over Anthropic Claude; anecdotal product comparison without technical detail.

u/snafu_2020·1 month ago·157 pts / 11 comm

NVIDIA Dev Blog· INFRA

Introducing NVIDIA Fleet Intelligence for Real-Time GPU Fleet Visibility and Optimization

The compute capability of large GPU fleets presents unprecedented opportunities to innovate and provide value to customers in record time. Yet these... The compute capability of large GPU fleets presents unprecedented opportunities to innovate and provide value to customers in record time. Yet these advancements come with a variety of challenges. At scale, teams are juggling heterogeneous hardware, fast‑moving software stacks, tight power envelopes, and spiky, multitenant workloads. A single hotspot, misconfigured driver, or subtle hardware fault… Source

Christian Shrauder·1 month ago

Simon Willison· ANALYST

Your AI Use Is Breaking My Brain

Jason Koebler argues AI-generated text proliferation creates "Zombie Internet" fatigue, degrading human writing quality and online discourse authenticity.

Simon Willison·1 month ago

r/ClaudeAI· COMMUNITY

I built an app with Claude Code that converts any text into high-quality audio. It works with PDFs, blog posts, Substack and Medium links, and even photos of text.

Developer built text-to-speech mobile app using Claude Code, supporting PDFs, web articles, and image text with privacy focus.

u/OneMoreSuperUser·1 month ago·22 pts / 10 comm

Simon Willison· ANALYST

Using LLM in the shebang line of a script

Simon Willison documents using LLM CLI tool in Unix shebang lines to enable natural-language executable scripts with tool calls and YAML templating.

Simon Willison·1 month ago

The Archive

If you are also sick of renaming your chats like me

Sonnet 4.6 incapable of writing different styles/attuning to style guides

Computer build using Intel Optane Persistent Memory - Can run 1 trillion parameter model at over 4 tokens/sec

Quoting James Shore

i cannot go back to claude now

Introducing NVIDIA Fleet Intelligence for Real-Time GPU Fleet Visibility and Optimization

Your AI Use Is Breaking My Brain

I built an app with Claude Code that converts any text into high-quality audio. It works with PDFs, blog posts, Substack and Medium links, and even photos of text.

Using LLM in the shebang line of a script

ELF: Embedded Language Flows

Variational Inference for Lévy Process-Driven SDEs via Neural Tilting

DECO: Sparse Mixture-of-Experts with Dense-Comparable Performance on End-Side Devices

Quantifying Concentration Phenomena of Mean-Field Transformers in the Low-Temperature Regime

Dynamic Skill Lifecycle Management for Agentic Reinforcement Learning

Optimal and Scalable MAPF via Multi-Marginal Optimal Transport and Schrödinger Bridges

Confidence-Guided Diffusion Augmentation for Enhanced Bangla Compound Character Recognition

Shepherd: A Runtime Substrate Empowering Meta-Agents with a Formalized Execution Trace

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

Equivariant Reinforcement Learning for Clifford Quantum Circuit Synthesis

Revisiting Policy Gradients for Restricted Policy Classes: Escaping Myopic Local Optima with $k$-step Policy Gradients

Ban Wave

Engineering Robustness into Personal Agents with the AI Workflow Store

DataMaster: Towards Autonomous Data Engineering for Machine Learning

The interesting BDH question: What if LLM memory lived in the network weights instead of the ever-growing KV cache?

Beyond Red-Teaming: Formal Guarantees of LLM Guardrail Classifiers

RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

V4FinBench: Benchmarking Tabular Foundation Models, LLMs, and Standard Methods on Corporate Bankruptcy Prediction

Will there be any more Qwen3.6 series models?

Three things in AI to watch, according to a Nobel-winning economist

Grounded or Guessing? LVLM Confidence Estimation via Blind-Image Contrastive Ranking