The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Algorithm for Contextual Queueing Bandits with Rate-Optimal Queue Length Regret

Contextual queueing bandits provide a framework for learning to schedule heterogeneous jobs under unknown context-dependent service rates. Under stochastic contexts, existing algorithms achieve $\widetilde{\mathcal{O}}(T^{-1/4})$ queue length regret, defined as the expected difference between the learner's and oracle's queue lengths at horizon $T$. In this paper, we improve this rate to $\widetilde{\mathcal{O}}(T^{-1/2})$. The key observation is that random exploration is needed only up to a carefully chosen cutoff round, rather than throughout the entire horizon. We propose CQB-$η$-2, a thre...

Seoungbin Bae·2 months ago

The Archive

Algorithm for Contextual Queueing Bandits with Rate-Optimal Queue Length Regret

Cross-Modal Masking for Robust Silent Speech Synthesis Using sEMG and Lipreading

Frequency-based Constrained Sampling for Interval Patterns

Amazon now lets you design custom merch using AI

In-Context Learning for Latent Space Bayesian Optimization

From 0-to-1 to 1-to-N: Reproducible Engineering Evidence for MetaAI Recursive Self-Design

When Built-in Thinking Helps and Hurts: Constraint-Level Error Shifts in Instruction Following

End-to-End Context Compression at Scale

Muon Learns More Robust and Transferable Features than Adam

Beyond Accuracy: Community Perspectives on Machine Translation

A Unifying Framework for Concept-Based Representational Similarity

ArtiFact: A Large-Scale Multi-Modal Cultural Heritage Dataset

Do Video Foundation Models Understand Intuitive Physics? A Layerwise Probing Analysis

Where Does the Answer Come From? Benchmarking View-Level Visual Evidence Identification in Multi-View MLLMs for Autonomous Driving

FMplex: Model Virtualization for Serving Extensible Foundation Models

Data-driven discovery of governing differential equations across physical systems

Gradient-Guided Reward Optimization for Inference-time Alignment

ATN3D: Density-Aware LiDAR-Radar Early 3D Object Detection Under Extreme Sparsity

Civil Court Simulation with Large Language Models

ReCoVLA: VLM-Guided Reward Compilation for Failure Recovery in Vision-Language-Action Policies

Constrained user-item allocation for e-commerce marketing campaigns

Powering the Future of AI: Navigating the Trade-offs for Europe's Energy Transition and Net-Zero Goals

AGENTSERVESIM: A Hardware-aware Simulator for Multi-Turn LLM Agent Serving

Shape Formation for the Cooperative Transportation of Arbitrary Objects Using Multi-Agent Reinforcement Learning

Closure-Validated Circuit Discovery in Attention Heads: Co-activation Proposes, Ablation Disposes

Next-Token Prediction Learns Generalisable Representations of Sleep Physiology

Automated IEP Generation from Traditional Chinese Parent-Teacher Interviews via Corpus-Grounded Feature Diffusion

Assessing Sample Quality in Conditional Generation under Compositional Shift

Clinically Grounded Privacy Evaluation of Medical LMs

I Was Scrolling and Then I Saw a Pregnant Strawberry