The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

RobustToolBench benchmark exposes tool-use agent failures from deployment noise; domain-randomized RL improves robustness.

Xiaolin Zhou·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

StepCodeReasoner: Aligning Code Reasoning with Stepwise Execution Traces via Reinforcement Learning

StepCodeReasoner supervises intermediate execution traces via RL to prevent reward hacking in code reasoning tasks.

Hao Wang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Domain Restriction via Multi SAE Layer Transitions

Sparse autoencoders on LLM layer transitions detect out-of-domain interactions without treating model as black box.

Elias Shaheen·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

STAGE: Tackling Semantic Drift in Multimodal Federated Graph Learning

STAGE framework addresses semantic drift across modality domains in federated graph learning with multimodal node attributes.

Zekai Chen·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Understanding Sample Efficiency in Predictive Coding

Study quantifies sample efficiency in Predictive Coding vs Backpropagation using target alignment metric, finding PC enables more efficient learning in small-scale experiments.

Gaspard Oliviers·1 month ago

r/ClaudeAI· COMMUNITY

🦀 Claude has crabs?! 🦀

Claude Haiku vulnerable to multi-turn prompt injection via fictional rule construction and word-filling technique.

u/BordairAPI·1 month ago·21 pts / 21 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Rethinking Positional Encoding for Neural Vehicle Routing

Paper formalizes positional encoding requirements for Transformer-based neural combinatorial optimization on vehicle routing, accounting for spatial structure unlike NLP.

Chuanbo Hua·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Delightful Gradients Accelerate Corner Escape

Theoretical analysis of Delightful Policy Gradient, which gates advantage-based updates to escape suboptimal policy corners faster than softmax policy gradient.

Jincheng Mei·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Procedural-skill SFT across capacity tiers: A W-Shaped pre-SFT Trajectory and Regime-Asymmetric Mechanism on 0.8B-4B Qwen3.5 Models

Empirical study of supervised fine-tuning on procedural skills across Qwen3.5 0.8B-4B models shows W-shaped pre-training trajectory and uniform SFT gains.

Igor Strozzi·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

YFPO: A Preliminary Study of Yoked Feature Preference Optimization with Neuron-Guided Rewards for Mathematical Reasoning

YFPO leverages neuron activation patterns for preference optimization in mathematical reasoning, using model internals to guide reward signals instead of external preference data.

Yifan Le·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Rethinking Supervision Granularity: Segment-Level Learning for LLM-Based Theorem Proving

Proposes segment-level supervision for LLM-based Lean 4 theorem proving, balancing dense local signals of step-level training with coherence of whole-proof generation.

Shuo Xu·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Beyond Point-wise Neural Collapse: A Topology-Aware Hierarchical Classifier for Class-Incremental Learning

Proposes Hierarchical-Cluster SOINN classifier for class-incremental learning that models classes as manifolds rather than point collapse, addressing Neural Collapse theory gaps.

Huiyu Yi·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

AccLock: Unlocking Identity with Heartbeat Using In-Ear Accelerometers

Earphone-based passive biometric authentication system using in-ear accelerometers for heartbeat identification, unrelated to AI frontier topics.

Lei Wang·1 month ago

r/LocalLLaMA· COMMUNITY

Models and Quants quality test results - the chessboard svg (Qwen3.6 27B/35B-A3B/Zaya1)

Reddit user benchmarks open-weight LLMs (Qwen 3.6, Zaya1) on chess visualization task; Qwen 35B-A3B achieves near-perfect SVG chessboard generation.

u/Beamsters·1 month ago·40 pts / 17 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Toward Modeling Player-Specific Chess Behaviors

Model for emulating individualized chess player styles and decision-making, addressing limitations of skill-level generalization in superhuman chess engines.

Loris Sogliuzzo·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Proteus: A Self-Evolving Red Team for Agent Skill Ecosystems

Proteus red-teaming framework studies adaptive leakage of LLM agent skills via iterative adversarial revision, addressing real deployment risks beyond single-shot audits.

Zhaojiacheng Zhou·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Incentivizing Truthfulness and Collaborative Fairness in Bayesian Learning

Mechanism for collaborative ML ensures fair data valuation and incentivizes truthfulness via game-theoretic rewards.

Rachael Hwee Ling Sim·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Qwen-Scope: Turning Sparse Features into Development Tools for Large Language Models

Qwen-Scope: open-source suite of sparse autoencoders for mechanistic interpretability across 7 Qwen models.

Boyi Deng·1 month ago

Stratechery· ANALYST

SpaceX and Anthropic, xAI’s Two Companies, Elon Musk and SpaceXAI’s Future

Stratechery opinion: Elon Musk's dual involvement in SpaceX and xAI mirrors Anthropic's structure; argues Musk should focus xAI on B2B services.

Ben Thompson·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

From Clever Hans to Scientific Discovery: Interpreting EEG Foundational Transformers with LRP

Layer-wise relevance propagation extends attribution methods to EEG-based foundation models for interpretability verification.

Justus Meyer zu Bexten·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Sobolev Regularized MMD Gradient Flow

Sobolev-regularized MMD gradient flow with global convergence guarantees for distribution matching.

Chenyang Tian·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

On-Policy Self-Evolution via Failure Trajectories for Agentic Safety Alignment

FATE: on-policy framework for agentic safety alignment via failure trajectory learning without safety-utility trade-off.

Bo Yin·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Adaptive TD-Lambda for Cooperative Multi-agent Reinforcement Learning

Adaptive TD-Lambda extends temporal difference learning to multi-agent RL with large joint action spaces.

Yue Deng·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Modulation Consistency-based Contrastive Learning for Self-Supervised Automatic Modulation Classification

Task-aware contrastive learning for automatic modulation classification via intra-instance consistency.

Chenxu Wang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

LOFT: Low-Rank Orthogonal Fine-Tuning via Task-Aware Support Selection

LOFT: parameter-efficient fine-tuning framework using low-rank orthogonal transformations with task-aware subspace selection.

Lanxin Zhao·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Information theoretic underpinning of self-supervised learning by clustering

Information-theoretic foundation for self-supervised clustering via KL-divergence optimization with mode-collapse constraints.

Josef Kittler·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

FIS-DiT: Breaking the Few-Step Video Inference Barrier via Training-Free Frame Interleaved Sparsity

FIS-DiT: training-free frame interleaving acceleration for video diffusion transformers in few-step inference regimes.

Jian Tang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

IPI-proxy: An Intercepting Proxy for Red-Teaming Web-Browsing AI Agents Against Indirect Prompt Injection

IPI-proxy toolkit enables red-teaming web-browsing AI agents against indirect prompt injection attacks embedded in whitelisted domain HTML.

Chia-Pei·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Variance-aware Reward Modeling with Anchor Guidance

Anchor-guided variance-aware reward modeling resolves non-identifiability in preference learning by augmenting pairwise comparisons with coarse response-level labels.

Shuxing Fang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Very Efficient Listwise Multimodal Reranking for Long Documents

ZipRerank achieves efficient listwise multimodal reranking for long documents by reducing visual tokens and eliminating autoregressive decoding.

Yiqun Sun·1 month ago

← Front Page30 stories

← Newer Older →

The Archive

When Simulation Lies: A Sim-to-Real Benchmark and Domain-Randomized RL Recipe for Tool-Use Agents

StepCodeReasoner: Aligning Code Reasoning with Stepwise Execution Traces via Reinforcement Learning

Domain Restriction via Multi SAE Layer Transitions

STAGE: Tackling Semantic Drift in Multimodal Federated Graph Learning

Understanding Sample Efficiency in Predictive Coding

🦀 Claude has crabs?! 🦀

Rethinking Positional Encoding for Neural Vehicle Routing

Delightful Gradients Accelerate Corner Escape

Procedural-skill SFT across capacity tiers: A W-Shaped pre-SFT Trajectory and Regime-Asymmetric Mechanism on 0.8B-4B Qwen3.5 Models

YFPO: A Preliminary Study of Yoked Feature Preference Optimization with Neuron-Guided Rewards for Mathematical Reasoning

Rethinking Supervision Granularity: Segment-Level Learning for LLM-Based Theorem Proving

Beyond Point-wise Neural Collapse: A Topology-Aware Hierarchical Classifier for Class-Incremental Learning

AccLock: Unlocking Identity with Heartbeat Using In-Ear Accelerometers

Models and Quants quality test results - the chessboard svg (Qwen3.6 27B/35B-A3B/Zaya1)

Toward Modeling Player-Specific Chess Behaviors

Proteus: A Self-Evolving Red Team for Agent Skill Ecosystems

Incentivizing Truthfulness and Collaborative Fairness in Bayesian Learning

Qwen-Scope: Turning Sparse Features into Development Tools for Large Language Models

SpaceX and Anthropic, xAI’s Two Companies, Elon Musk and SpaceXAI’s Future

From Clever Hans to Scientific Discovery: Interpreting EEG Foundational Transformers with LRP

Sobolev Regularized MMD Gradient Flow

On-Policy Self-Evolution via Failure Trajectories for Agentic Safety Alignment

Adaptive TD-Lambda for Cooperative Multi-agent Reinforcement Learning

Modulation Consistency-based Contrastive Learning for Self-Supervised Automatic Modulation Classification

LOFT: Low-Rank Orthogonal Fine-Tuning via Task-Aware Support Selection

Information theoretic underpinning of self-supervised learning by clustering

FIS-DiT: Breaking the Few-Step Video Inference Barrier via Training-Free Frame Interleaved Sparsity

IPI-proxy: An Intercepting Proxy for Red-Teaming Web-Browsing AI Agents Against Indirect Prompt Injection

Variance-aware Reward Modeling with Anchor Guidance

Very Efficient Listwise Multimodal Reranking for Long Documents