The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

History Anchors: How Prior Behavior Steers LLM Decisions Toward Unsafe Actions

Study shows frontier LLMs continue harmful actions when primed by prior unsafe steps in agent logs, revealing misalignment in long-horizon reasoning.

Alberto G. Rodríguez Salgado·1 month ago

Ars Technica AI· PRESS

Altman forced to confront claims at OpenAI trial that he's a prolific liar

"Very painful": Altman relives his Muskian reaction to losing control over OpenAI.

Ashley Belanger ·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Harnessing Agentic Evolution

Framework for agentic evolution integrates feedback organization and evidence management to improve program search and long-horizon agent planning.

Jiayi Zhang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Neurosymbolic Auditing of Natural-Language Software Requirements

VERIME combines LLMs with SMT solvers to audit natural-language specs, detecting ambiguity and safety violations in safety-critical requirements.

Bethel Hall·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Uncertainty-Driven Anomaly Detection for Psychotic Relapse Using Smartwatches: Forecasting and Multi-Task Learning Fusion

Transformer-based smartwatch framework for early psychotic relapse detection using uncertainty-driven anomaly detection on cardiac and motion signals.

Nikolaos Tsalkitzis·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Provable Quantization with Randomized Hadamard Transform

Dithered Hadamard quantization provides theoretical guarantees for vector compression in KV cache and federated learning with O(d log d) complexity.

Ying Feng·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Parallel Scan Recurrent Neural Quantum States for Scalable Variational Monte Carlo

Parallel-scan recurrent neural networks enable scalable variational Monte Carlo for quantum many-body systems via parallelizable RNN architectures.

Ejaaz Merali·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Min-Max Optimization Requires Exponentially Many Queries

Theoretical result: finding ε-approximate stationary points in nonconvex-nonconcave min-max optimization requires exponential query complexity.

Martino Bernasconi·1 month ago

r/ClaudeAI· COMMUNITY

A new monthly Agent SDK credit for Claude plans

Anthropic introduces dedicated monthly Agent SDK credits for paid Claude plans, separating programmatic usage limits from interactive chat starting June 15.

u/ClaudeOfficial·1 month ago·22 pts / 16 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Improving Reproducibility in Evaluation through Multi-Level Annotator Modeling

Multi-level annotator modeling framework improves reproducibility in LLM evaluations by accounting for human rater bias and subjective variance.

Deepak Pandita·1 month ago

r/LocalLLaMA· COMMUNITY

Efficient pretraining with token superposition by Nous Research

Nous Research publishes efficient pretraining method using token superposition, reducing compute requirements for model training.

u/de4dee·1 month ago·40 pts / 13 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

An LLM-Based System for Argument Reconstruction

Multi-stage LLM pipeline reconstructs arguments from natural language into directed acyclic graphs of premises, conclusions, and logical relations.

Paulo Pirozelli·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Di-BiLPS: Denoising induced Bidirectional Latent-PDE-Solver under Sparse Observations

Di-BiLPS neural framework solves PDEs under extremely sparse observations via denoising-induced bidirectional latent solvers with efficient inference.

Zhonghao Li·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

ENSEMBITS: an alphabet of protein conformational ensembles

Ensembits tokenizes protein conformational ensembles for dynamics-aware protein language modeling.

Kaiwen Shi·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Force-Aware Neural Tangent Kernels for Scalable and Robust Active Learning of MLIPs

Active learning framework for machine-learning interatomic potentials scales to 200k structures via neural tangent kernels.

Eszter Varga-Umbrich·1 month ago

r/LocalLLaMA· COMMUNITY

New models possibly from Baidu (ERNIE) this month?

Unconfirmed reports of new ERNIE models from Baidu possibly launching in 2026; details sparse, sourced from tweets and a long video.

u/pmttyji·1 month ago·41 pts / 14 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Interpretable Machine Learning for Antepartum Prediction of Pregnancy-Associated Thrombotic Microangiopathy Using Routine Longitudinal Laboratory Data

ML model predicts pregnancy-associated thrombotic microangiopathy from longitudinal lab data using interpretable methods.

Chuanchuan Sun·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Amplification to Synthesis: A Comparative Analysis of Cognitive Operations Before and After Generative AI

Study compares cognitive operation tactics in bot-driven amplification vs. generative-AI-enabled disinformation campaigns.

Liz Cho·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Attention Once Is All You Need: Efficient Streaming Inference with Stateful Transformers

Stateful transformer inference engine cuts streaming query latency from O(n) to O(|q|) via persistent KV cache and Flash Queries.

Victor Norgren·1 month ago

r/LocalLLaMA· COMMUNITY

DramaBox - Most Expressive Voice model ever based on LTX 2.3

Resemble AI releases DramaBox, a voice synthesis model built on LTX 2.3, with open weights on Hugging Face.

u/manmaynakhashi·1 month ago·60 pts / 16 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

LMPath: Language-Mediated Priors and Path Generation for Aerial Exploration

LMPath uses LLMs and vision models to generate semantic exploration priors for UAV search missions.

Jonathan A. Diller·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

MinT: Managed Infrastructure for Training and Serving Millions of LLMs

MinT infrastructure scales LoRA post-training and serving across millions of adapted LLMs without merging checkpoints.

Mind Lab·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

(How) Do Large Language Models Understand High-Level Message Sequence Charts?

Empirical study evaluates whether LLMs correctly interpret formal semantics of High-Level Message Sequence Charts.

Mohammad Reza Mousavi·1 month ago

r/singularity· COMMUNITY

Figure AI livestream: watch a team of humanoid robots running a full 8-hour shift at human performance levels, fully autonomous.

Figure AI demonstrates humanoid robots completing 8-hour autonomous shifts at human performance levels in livestreamed deployment.

u/Distinct-Question-16·1 month ago·102 pts / 30 comm

r/ClaudeAI· COMMUNITY

Even the competition approves.

u/Russkiy_Muzhik·1 month ago·36 pts / 5 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Where Does Reasoning Break? Step-Level Hallucination Detection via Hidden-State Transport Geometry

Method detects step-level hallucinations in LLM reasoning by monitoring hidden-state trajectory geometry during inference.

Tyler Alvarez·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Dense vs Sparse Pretraining at Tiny Scale: Active-Parameter vs Total-Parameter Matching

Tiny-scale study compares dense vs. sparse (MoE) transformers under matched parameter budgets; sparse outperforms with top-2 routing.

Abdalrahman Wael·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

High-Rate Quantized Matrix Multiplication II

Quantization technique for weight-only post-training of LLMs using waterfilling to optimize rate distribution across coordinates.

Or Ordentlich·1 month ago

The Verge AI· PRESS

Mark Zuckerberg announces ‘completely private’ encrypted Meta AI chat

Meta CEO Mark Zuckerberg says its new Incognito Chat is "the first major AI product where there is no log of your conversations stored on servers." Messages in Incognito Chat aren't saved or stored in users' chat history, similar to incognito modes on other AI chatbots, but Meta says its version is different because it also uses end-to-end encryption, which Meta recently removed from Instagram DMs: "Other apps have introduced incognito-style modes, but they can still see the questions coming in and the answers going out. Incognito Chat with Meta AI is truly private, meaning no one - not even ...

Stevie Bonifield·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

VectorSmuggle: Steganographic Exfiltration in Embedding Stores and a Cryptographic Provenance Defense

Steganographic attack exfiltrating data from vector database embeddings; proposes cryptographic provenance defense for RAG systems.

Jascha Wanger·1 month ago

← Front Page30 stories

← Newer Older →

The Archive

History Anchors: How Prior Behavior Steers LLM Decisions Toward Unsafe Actions

Altman forced to confront claims at OpenAI trial that he's a prolific liar

Harnessing Agentic Evolution

Neurosymbolic Auditing of Natural-Language Software Requirements

Uncertainty-Driven Anomaly Detection for Psychotic Relapse Using Smartwatches: Forecasting and Multi-Task Learning Fusion

Provable Quantization with Randomized Hadamard Transform

Parallel Scan Recurrent Neural Quantum States for Scalable Variational Monte Carlo

Min-Max Optimization Requires Exponentially Many Queries

A new monthly Agent SDK credit for Claude plans

Improving Reproducibility in Evaluation through Multi-Level Annotator Modeling

Efficient pretraining with token superposition by Nous Research

An LLM-Based System for Argument Reconstruction

Di-BiLPS: Denoising induced Bidirectional Latent-PDE-Solver under Sparse Observations

ENSEMBITS: an alphabet of protein conformational ensembles

Force-Aware Neural Tangent Kernels for Scalable and Robust Active Learning of MLIPs

New models possibly from Baidu (ERNIE) this month?

Interpretable Machine Learning for Antepartum Prediction of Pregnancy-Associated Thrombotic Microangiopathy Using Routine Longitudinal Laboratory Data

Amplification to Synthesis: A Comparative Analysis of Cognitive Operations Before and After Generative AI

Attention Once Is All You Need: Efficient Streaming Inference with Stateful Transformers

DramaBox - Most Expressive Voice model ever based on LTX 2.3

LMPath: Language-Mediated Priors and Path Generation for Aerial Exploration

MinT: Managed Infrastructure for Training and Serving Millions of LLMs

(How) Do Large Language Models Understand High-Level Message Sequence Charts?

Figure AI livestream: watch a team of humanoid robots running a full 8-hour shift at human performance levels, fully autonomous.

Even the competition approves.

Where Does Reasoning Break? Step-Level Hallucination Detection via Hidden-State Transport Geometry

Dense vs Sparse Pretraining at Tiny Scale: Active-Parameter vs Total-Parameter Matching

High-Rate Quantized Matrix Multiplication II

Mark Zuckerberg announces &#8216;completely private&#8217; encrypted Meta AI chat

VectorSmuggle: Steganographic Exfiltration in Embedding Stores and a Cryptographic Provenance Defense

Mark Zuckerberg announces ‘completely private’ encrypted Meta AI chat