The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Can't believe I got it working! Dual GPU - 48gb VRAM llama-cpp server - R7900 + 7800XT

User configures dual AMD RDNA GPUs (48GB VRAM) with llama-cpp via Vulkan for local inference.

u/Jorlen·30 days ago·42 pts / 39 comm

Claude has no way to navigate long conversations — this is a real productivity killer

Reddit user reports lack of in-session search/navigation in Claude, forcing manual scrolling or context loss in long conversations.

u/Indiranagara·1 month ago·20 pts / 56 comm

r/ClaudeAI· COMMUNITY

If you use the "Get Shit Done" (GSD) AI tool, you need to migrate immediately (Original creator rug-pulled)

Get Shit Done NPM tool creator executed rug pull on $GSD token; community forked to get-shit-done-redux; immediate uninstall of original packages required.

u/linuxzinho·1 month ago·73 pts / 28 comm

Ars Technica AI· PRESS

US scrambles to stop Internet users re-creating dead pilots’ voices

Workaround flouts law that bans NTSB disclosures of cockpit audio recordings.

Jeremy Hsu ·1 month ago

r/singularity· COMMUNITY

Anthropic Co-founder Jack Clark’s recent predictions: AI will help make a Nobel Prize-winning discovery within the next year, bipedal robots doing useful work in 2 years, RSI by end of 2028

Anthropic co-founder Jack Clark predicts AI-driven Nobel Prize discovery within 1 year, functional bipedal robots in 2 years, and RSI by end of 2028.

u/socoolandawesome·1 month ago·115 pts / 80 comm

r/ClaudeAI· COMMUNITY

Appropriate use of ai...

u/pakprotector·1 month ago·24 pts / 6 comm

Anthropic· FRONTIER

Project Glasswing: An initial update

Anthropic shares initial findings from Project Glasswing, an internal research initiative on AI safety or capability insights.

Anthropic·1 month ago

Google AI (Gemma)· FRONTIER

Catch up on the Dialogues stage at Google I/O 2026.

Recap of Google I/O 2026 Dialogues stage covering AI, quantum computing, robotics, and creativity topics.

Google AI (Gemma)·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

SkillOpt applies gradient-based optimization to agent skill text as frozen external state, enabling systematic skill improvement under feedback.

Yifan Yang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws

Shannon Scaling Law models LLM training as noisy-channel information transmission, explaining non-monotonic phenomena like catastrophic overtraining.

Xu Ouyang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills

Comprehensive study of model-generated agent skills lifecycle: extraction, consumption, adaptation across experience generation and domain-level reuse.

Zisu Huang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

SPACENUM: Revisiting Spatial Numerical Understanding in VLMs

SpaceNum benchmark tests whether VLMs genuinely ground numerical outputs in spatial perception via dynamic and static reasoning tasks.

Jianshu Zhang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

ETCHR: Editing To Clarify and Harness Reasoning

ETCHR decouples image editing from understanding in MLLMs to improve visual reasoning without predefined toolkits or noisy generation.

Beichen Zhang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Complete-muE: Optimal Hyperparameter Transfer and Scaling for MoE Models

Complete-muE enables hyperparameter transfer between dense FFN and MoE architectures via normalized router scaling and active-width μP bridges.

Hongwu Peng·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Good Token Hunting: A Hitchhiker's Guide to Token Selection for Visual Geometry Transformers

Token selection strategy reduces quadratic attention cost in visual geometry transformers for 3D reconstruction by restricting key/value interactions.

Shuhong Zheng·1 month ago

r/ClaudeAI· COMMUNITY

I built an app with Claude Code that converts any text into high-quality audio. It works with PDFs, blog posts, Substack and Medium links, and even photos of text.

Developer built text-to-speech mobile app using Claude Code, supporting PDFs, web articles, and image text with privacy-first design.

u/OneMoreSuperUser·1 month ago·41 pts / 15 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

CHRONOS: Temporally-Aware Multi-Agent Coordination for Evolving Data Marketplaces

CHRONOS three-layer architecture handles temporal decay, dynamic Shapley pricing, and shared differential-privacy budgets in evolving knowledge-graph marketplaces.

Joydeep Chandra·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Multilingual Knowledge Transfer under Data Constraints via Lexical Interventions

Lexical intervention technique improves cross-lingual knowledge transfer for low-resource languages without parallel data or auxiliary models.

Anastasiia Sedova·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

PGT: Procedurally Generated Tasks for improving visual grounding in MLLMs

PGT generates procedural geometric tasks to improve MLLM fine-grained visual grounding and diagnose perception failure sources.

Rim Assouel·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

On the Stability of Spherical Hellinger-Kantorovich Flows and Their Implications for Differential Privacy

Perturbation theory for spherical Hellinger-Kantorovich gradient flows with dimension-free stability bounds.

Aratrika Mustafi·1 month ago

r/LocalLLaMA· COMMUNITY

BeeLlama v0.2.0 – major DFlash update. Single RTX 3090: Qwen 3.6 27B up to 164 tps (4.40x), Gemma 4 31B up to 177.8 tps (4.93x). Prompt processing speed near baseline.

BeeLlama v0.2.0 achieves 4-5x token throughput gains on RTX 3090 via DFlash optimizations for Qwen 27B and Gemma 31B models.

u/Anbeeld·1 month ago·99 pts / 74 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Training-Free Looped Transformers

Inference-time layer looping retrofit for frozen transformers improves efficiency without retraining or architecture changes.

Lizhang Chen·1 month ago

r/Anthropic· COMMUNITY

The butterfly effect in LLM social simulations. Relevant to how we write CLAUDE.md and system prompts.

Two persona prompts, identical content, same model (gpt-5.2). Only difference is formatting: one prose, one bullet points. In a 10-round Prisoner’s Dilemma the prose version cooperated \~96% of the time, the bullet version \~20%. A 76pp gap, p < 0.001. Same meaning, opposite behavior. Authors call it the butterfly effect in LLM simulations. The part that matters here: CLAUDE.md, system prompts, and memory are mostly declared self-description. If formatting alone moves behavior this much, two people with the same intent get different Claudes based on how they happened to write it up. Any...

u/silence-and-magic·1 month ago·13 pts / 9 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Move on Muon : A Hamiltonian probability gradient flow perspective of Muon optimizer

Hamiltonian gradient flow interpretation of Muon optimizer via regularized orthogonalization and Fenchel duality.

Aratrika Mustafi·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Human Decision-Making with Persuasive and Narrative LLM Explanations

Large-scale human study on how LLM narrative explanations affect decision-making accuracy in classification tasks.

Laura R. Marusich·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Leveraging Foundation Models for Causal Generative Modeling

FM-CGM modular framework leverages foundation models for visual causal reasoning and counterfactual generation.

Aneesh Komanduri·1 month ago

The Verge AI· PRESS

Elon, stop trying to make Grok happen

There is a harsh truth about Elon Musk's "truth-seeking" AI chatbot Grok: It's not very good, and not many people are using it. That's the takeaway of a new Reuters report, which found that Grok barely appears in federal records of how the US government used AI last year. It's not the only sign xAI's signature chatbot is in trouble, even as Musk puts it at the heart of what could be the biggest IPO in history. Reuters reviewed more than 400 examples of government AI use where specific vendors were named. Grok or xAI, it found, appeared in only three - each of those for basic uses like documen...

Robert Hart·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Strong Teacher Not Needed? On Distillation in LLM Pretraining

LLM distillation empirical finding: weak teachers improve larger students with proper loss mixing, challenging strong-teacher assumption.

Taiming Lu·1 month ago

Stratechery· ANALYST

2026.21: The Data Center Veto

Stratechery weekly roundup covering data center policy tensions, agent economics models, and tangential topics from May 2026.

Ben Thompson·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Entrywise Error Bounds for Spectral Ranking with Semi-Random Adversaries

Entry-wise error analysis of spectral ranking (Bradley-Terry-Luce) under semi-random adversarial edge sampling.

Dongmin Lee·1 month ago

← Front Page30 stories

← Newer Older →