The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Efficient ASR Training with Conversations that Never Happened

Conversational ASR for lower-resource languages and niche domains is limited by the scarcity of domain-matched multi-speaker training data. We propose an augmentation pipeline that generates scenario-level dialogues with participant metadata, maps speaker attributes to TTS voice profiles, and assembles synthesized utterances into speaker-aware simulated conversations. We evaluated five LLM families under single-generator, fixed-budget mixture, and scale-up settings using the same FastConformer-Large training recipe for each one. We ran comprehensive evaluations on the Hungarian BEA-Dialogue b...

Máté Gedeon·15 days ago

The Archive

Efficient ASR Training with Conversations that Never Happened

VLESA: Vision-Language Embodied Safety Agent for Human Activity Monitoring

A Pocket Offline Model for Simultaneous Speech Translation as CUNI Submission to IWSLT 2026

MLSkip: Data Skipping for ML Filters via Lightweight Metadata

Microsoft&#8217;s Project Solara is an OS for AI agent gadgets

SEAOTTER: Sensor Embedded Autoencoding with One-Time Transcode for Efficient Reconstruction

FlashbackCL: Mitigating Temporal Forgetting in Federated Learning

q0: Primitives for Hyper-Epoch Pretraining

Entropy Is Not Enough: Unlocking Effective Reinforcement Learning for Visual Reasoning via Vision-Anchored Token Selection

Correcting Neural Operator Spectral Bias via Diffusion Posterior Sampling with Sparse Observations

Quadratic integrate-and-fire neurons exhibit less fragmented loss landscapes and outperform leaky integrate-and-fire neurons in spike-based gradient descent

Value-Aware Stochastic KV Cache Eviction for Reasoning Models

FFR: Forward-Forward Learning for Regression

DiffUNet^2: Bidirectional Prediction, Probabilistic Generation and Collaborative Visual Discovery for Scientific Data

Knowledge Editing in Masked Diffusion Language Models

Contrastive Neural Algorithmic Reasoning for Graph Coloring

Forecasting Conceptual Diffusion in Science: The Case of Quantum Computing

Hedge-Bench: Benchmarking Agents on Hard, Realistic Tasks Pertaining to Financial Reasoning

Beyond Gradient Descent: Adam for Analog Ising Machines

NetKV: Network-Aware Decode Instance Selection for Disaggregated LLM Inference

The Impact of Configuring Agentic AI Coding Tools on Build-vs-Buy Decisions: A Study Protocol

scTranslation: A Comprehensive Benchmark for Single-Cell Multi-Omics Modality Translation

MAdam: Metric-Aware Multi-Objective Adam

Denoise First, Orthogonalize Later: Understanding Momentum in Muon via Spectral Filtering

Agent libOS: A Library-OS-Inspired Runtime for Long-Running, Capability-Controlled LLM Agents

Synthesize and Reward -- Reinforcement Learning for Multi-Step Tool Use in Live Environments

RealClawBench: Live OpenClaw Benchmarks from Real Developer-Agent Sessions

CoralBay: A Self-Supervised CT Foundation Model

Attribution via Distributional Paths for Information Revelation

Reasoning Structure of Large Language Models

Microsoft’s Project Solara is an OS for AI agent gadgets