The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Do Composed Image Retrieval Benchmarks Require Multimodal Composition?

Analysis shows CIR benchmarks can be solved with single-modality embeddings, questioning necessity of multimodal composition.

Matteo Attimonelli·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Known By Their Actions: Fingerprinting LLM Browser Agents via UI Traces

Websites can fingerprint LLM browser agents with 96% F1 accuracy via UI interaction traces, enabling targeted exploits.

William Lugoloobi·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Understanding Imbalanced Forgetting in Rehearsal-Based Class-Incremental Learning

Study identifies systematic imbalanced forgetting patterns in class-incremental learning with rehearsal-based mitigation.

Alberto Tamajo·1 month ago

r/LocalLLaMA· COMMUNITY

NVFP4 Kimi2.6 and Kimi 2.5 released by Nvidia

NVIDIA releases quantized NVFP4 versions of Moonshot AI's Kimi-K2.6 and Kimi-K2.5 models via Model Optimizer with benchmark results.

u/Opening-Broccoli9190·1 month ago·43 pts / 28 comm

r/Anthropic· COMMUNITY

Yesterday I was at 50% of my weekly usage. This morning I'm now at 30%. What happened?

User reports unexpected Claude API usage reset timing discrepancy; potential billing or rate-limit system bug.

u/Coconut-Agua·1 month ago·11 pts / 11 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Peng's Q($λ$) for Conservative Value Estimation in Offline Reinforcement Learning

Conservative Peng's Q(λ) algorithm for offline RL using multi-step value estimation in fixed behavior distributions.

Byeongchan Kim·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Identifying Culprits Through Deep Deterministic Policy Gradient Deep Learning Investigation

DDPG-based approach for criminal identification in complex datasets with reduced false positives.

Lata B T·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Beyond What to Select: A Plug-and-play Oscillatory Data-Volume Scheduling for Efficient Model Training

Oscillatory data-volume scheduling method that dynamically adjusts training data selection ratios for efficiency.

Suorong Yang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

BioHuman: Learning Biomechanical Human Representations from Video

BioHuman10M dataset enables muscle activation inference from video via simulation-based biomechanical annotation.

Yujun Huo·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

MediaClaw: Multimodal Intelligent-Agent Platform Technical Report

MediaClaw multimodal agent platform unifies fragmented AIGC capabilities with pluginized architecture and workflow orchestration.

Shaoan Zhao·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Composable Crystals: Controllable Materials Discovery via Concept Learning

Concept-based compositional framework for controllable de novo crystal generation via vector-quantized VAE.

Nian Liu·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Streaming Speech-to-Text Translation with a SpeechLLM

Real-time streaming speech-to-text translation system combining speech recognition and translation in SpeechLLM architecture.

Titouan Parcollet·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Persian MusicGen: A Large-Scale Dataset and Culturally-Aware Generative Model for Persian Music

Persian MusicGen adapts MusicGen to Persian tonalities and Dastgah systems using 900-hour culturally-specific dataset.

Mohammad Hossein Sameti·1 month ago

r/LocalLLaMA· COMMUNITY

Scenema Audio: Zero-shot expressive voice cloning and speech generation

Scenema Audio releases open-weights model for zero-shot expressive voice cloning, decoupling voice identity from emotional performance via separate control prompts.

u/a__side_of_fries·1 month ago·50 pts / 14 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Compositional Sparsity as an Inductive Bias for Neural Architecture Design

Information Filtering Networks and Homological Neural Networks combined to study compositional sparsity as structural prior for DNN design.

Hongyu Lin·1 month ago

r/ClaudeAI· COMMUNITY

Claude Certified Architect

Anthropic launches Claude Certified Architect exam covering evals, RAG, multi-agent orchestration, and LLM integration pitfalls.

u/invasionbarbare·1 month ago·25 pts / 20 comm

r/ClaudeAI· COMMUNITY

What do you actually use claude for every day that you'd miss if it disappeared?

Reddit thread on daily Claude usage patterns, from document analysis to agent building workflows.

u/OsinomaFunds·1 month ago·20 pts / 32 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

AI Outperforms Humans in Personalized Image Aesthetics Assessment via LLM-Based Interviews and Semantic Feature Extraction

LLM-based preference interviews paired with semantic feature extraction outperform human judges on personalized image aesthetic assessment.

Yoshia Abe·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Crys-JEPA: Accelerating Crystal Discovery via Embedding Screening and Generative Refinement

Crys-JEPA addresses stability-novelty trade-off in crystal generation via embedding screening and generative refinement for materials discovery.

Nian Liu·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Probabilistic Verification of Recurrent Neural Networks for Single and Multi-Agent Reinforcement Learning

RNN-ProVe probabilistically verifies RNN-based policies in partially observable RL without restrictive assumptions or coarse approximations.

Luca Marzari·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

XDomainBench: Diagnosing Reasoning Collapse in High-Dimensional Scientific Knowledge Composition

XDomainBench diagnostic benchmark stress-tests LLM compositional reasoning across interdisciplinary scientific knowledge with interactive workflows.

Gong Zhiren·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Cognitive-Uncertainty Guided Knowledge Distillation for Accurate Classification of Student Misconceptions

Two-stage knowledge distillation framework addresses student misconception classification via cognitive uncertainty guidance on edge devices.

Qirui Liu·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

EVA: Editing for Versatile Alignment against Jailbreaks

EVA model editing defense mitigates textual and visual jailbreak attacks on LLMs and VLMs without safety-utility trade-off via targeted edits.

Yi Wang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Non-linear Interventions on Large Language Models

Non-linear intervention framework extends LLM mechanistic understanding beyond Linear Representation Hypothesis to implicitly encoded features.

Sangwoo Kim·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

Video2GUI extracts GUI interaction trajectories from unlabeled Internet videos for large-scale GUI agent pretraining without manual annotation.

Weimin Xiong·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Selective Safety Steering via Value-Filtered Decoding

Value-filtered decoding selectively applies safety steering at test-time, avoiding unnecessary interventions that degrade helpfulness and coherence.

Bat-Sheva Einbinder·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Mechanical Enforcement for LLM Governance:Evidence of Governance-Task Decoupling in Financial Decision Systems

Study shows LLM-based financial governance lacks behavioral compliance; proposes five rationale-level metrics and mechanical enforcement approaches.

José Manuel de la Chica Rodríguez·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Addressing Terminal Constraints in Data-Driven Demand Response Scheduling

Reinforcement learning method combining Goal-Space Planning and DDPG for demand response scheduling with terminal constraints.

Maximilian Bloor·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

TAPIOCA: Why Task- Aware Pruning Improves OOD model Capability

Task-aware layer pruning improves OOD generalization but not ID accuracy in LLMs; geometric explanation via norm/distance profile divergence.

Krish Sharma·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

IsoNet: Spatially-aware audio-visual target speech extraction in complex acoustic environments

Audio-visual speech extraction system IsoNet uses spatial cues and face embeddings on compact 4-microphone arrays with curriculum learning.

Dinanath Pathya·1 month ago

← Front Page30 stories

← Newer Older →