The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

STRIDE: Training Data Attribution via Sparse Recovery from Subset Perturbations

Training Data Attribution (TDA) seeks to trace a model's predictions back to its training data. The gold standard for TDA relies on causal interventions, observing how a model changes when data is added or removed, but repeated retraining is computationally challenging for Large Language Models (LLMs). Consequently, most approaches approximate this effect in the parameter space using gradients. However, tracking gradients across billions of parameters is not only prohibitively expensive but relies on local approximations. In this work, we propose a shift: rather than estimating parameter chan...

Rishit Dagli·2 months ago

The Archive

STRIDE: Training Data Attribution via Sparse Recovery from Subset Perturbations

Beyond Text Following: Repairable Arbitration Reversals in Audio-Language Models

Streaming Communication in Multi-Agent Reasoning

Reinforcement Learning from Rich Feedback with Distributional DAgger

Multi-Column RBF Neural Network Using Adaptive and Non-Adaptive Particle Swarm Optimization

An Open-Source Two-Stage Computer Vision Pipeline for Fine-Grained Vehicle Classification using Vision Transformers

Failed Reasoning Traces Tell You What Is Fixable (But Not by Reading Them)

GeM-NR: Geometry-Aware Multi-View Editing for Nonrigid Scene Changes

BBOmix: A Tabular Benchmark for Hyperparameter Optimization of Unsupervised Biological Representation Learning

Generating Financial Time Series by Matching Random Convolutional Features

As AI gets better, it reveals an empty promise

Activation-Based Active Learning for In-Context Learning: Challenges and Insights

Deep Embedded Multiplicative DMD for Algebra-Preserving Koopman Learning

Towards Efficient and Evidence-grounded Mobility Prediction with LLM-Driven Agent

Preserving Data Privacy in Learning Causal Structure with Fully Homomorphic Encryption

Geometry Gaussians: Decoupling Appearance and Geometry in Gaussian Splatting

Self-Evaluation Is Already There: Eliciting Latent Judge Calibration in Base LLMs with Minimal Data

Audio Interaction Model

Graph Set Transformer

Continual Visual and Verbal Learning Through a Child's Egocentric Input

Evaluating Large Language Models in Dynamic Clinical Decision-Making with Standardized Patient Cases

⚡️Satya Nadella: No Priors x Latent Space Crossover Special at Microsoft Build

RePercENT: Scaling Disentangled Representation Learning Beyond Two Modalities

Who Needs Labels? Adapting Vision Foundation Models With the Metadata You Already Have

Arithmetic Pedagogy for Language Models

Knowledge Index of Noah's Ark

Identifying Gems from Roman RAPIDly

FoeGlass: Simple In-Context Learning Is Enough for Red Teaming Audio Deepfake Detectors

Light or Full Verb? A Minimal-Pair Dataset for Probing Phraseological Competence in Language Models

Automatic Generation of Titles for Research Papers Using Language Models