The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Forward-Learned Discrete Diffusion: Learning how to noise to denoise faster

Forward-Learned Discrete Diffusion enables few-step generation by learning discrete diffusion forward process directly.

Grigory Bartosh·1 month ago

Qwen 3.6 27B on 24GB VRAM setup: backend comparisons, quant choice and settings (llama.cpp, ik_llama.cpp, BeeLlama, vllm)

Qwen 3.6 27B performance benchmarks across llama.cpp backends on RTX 3090: ik_llama.cpp achieved 1261 tok/s prefill, 72.9 tok/s decode with 156k context.

u/VolandBerlioz·1 month ago·43 pts / 16 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Concise and Logically Consistent Conformal Sets for Neuro-Symbolic Concept-Based Models

Integrates conformal prediction with neuro-symbolic concept models to quantify prediction confidence and improve trustworthiness.

Samuele Bortolotti·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

PIPER: Content-Based Table Search via profiling and LLM-Generated Pseudoqueries

PIPER uses LLM-generated pseudoqueries and profiling for content-based table search across data lakes.

Riccardo Terrenzi·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

RGB-only Active 3D Scene Graph Generation for Indoor Mobile Robots

RGB-only active 3D scene graph generation for mobile robots without depth sensors, with learned viewpoint selection.

Giorgia Modi·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Beyond the Cartesian Illusion: Testing Two-Stage Multi-Modal Theory of Mind under Perceptual Bottlenecks

MLLMs fail at grounded 3D spatial reasoning and multi-agent Theory of Mind due to text-based probability distributions lacking topological understanding.

Yajing Zhou·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Pairwise Preference Reward and Group-Based Diversity Enhancement for Superior Open-Ended Generation

PPR-GDE method combines pairwise preference rewards with group-based diversity enhancement to reduce RL diversity collapse in open-ended generation.

Guining Cao·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Dual-Rate Diffusion: Accelerating diffusion models with an interleaved heavy-light network

Dual-Rate Diffusion accelerates inference by interleaving sparse heavy context encoder with light denoising model for efficient feature reuse.

Grigory Bartosh·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

UTOPYA: A Multimodal Deep Learning Framework for Physics-Informed Anomaly Detection and Time-Series Prediction

UTOPYA is a 15.2M-parameter multimodal framework for anomaly detection and time-series prediction in batch distillation using FiLM-based fusion.

Robson W. S. Pessoa·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Fixed External Cameras as Common Prior Maps for Active 3D Scene Graph Generation

Framework integrates fixed external RGB cameras as geometric priors for active 3D scene graph generation in robotic systems.

Giorgia Modi·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Scalable Environments Drive Generalizable Agents

Position paper argues agent generalization requires scaling environment rule-sets and interaction interfaces, not just trajectories within fixed benchmarks.

Jiayi Zhang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Canonical Regularisation of Wide Feature-Learning Neural Networks

Theoretical analysis of canonical regularisation in wide feature-learning neural networks, extending kernel regime NN-GP correspondence insights.

George Whittle·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

MARS: Technical Report for the CASTLE Challenge at EgoVis 2026

MARS system for CASTLE Challenge treats egocentric multimodal reasoning (4 days, 15 views, 8 modalities) as agentic evidence selection.

Haoyu Zhang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Ringmaster LMO: Asynchronous Linear Minimization Oracle Momentum Method

Ringmaster LMO enables asynchronous Linear Minimization Oracle training for heterogeneous distributed systems, improving on synchronous Muon.

Abdurakhmon Sadiev·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Visualizing the Invisible: Generative Visual Grounding Empowers Universal EEG Understanding in MLLMs

Generative Visual Grounding uses EEG-to-image models to ground brain signals visually instead of via text, improving MLLM-based brain foundation models.

Junyu Pan·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Buffer-Parameterized Machine Learning Surrogate Models for Cross-Technology Signal Integrity Analysis and Optimization

ML surrogate models for PCB signal integrity analysis that adapt to buffer parameter variations without retraining.

Julian Withöft·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Elastic-dLLM: Position Preserving Context Compression and Augmentation of Diffusion LLMs

Elastic-dLLM reduces computational redundancy in diffusion language models via context compression and position-aware augmentation.

Junyi Wu·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

TRACE: Trajectory Correction from Cross-layer Evidence for Hallucination Reduction

TRACE method corrects LLM hallucinations by analyzing cross-layer evidence to identify where factual information is suppressed.

Tej Sanibh Ranade·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Self-Evolving Spatial Reasoning in Vision Language Models via Geometric Logic Consistency

SAGE framework improves VLM spatial reasoning via geometric logic consistency and duality operations in GRPO training.

Junming Liu·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Vision Inference Former: Sustaining Visual Consistency in Multimodal Large Language Models

Vision Inference Former addresses visual consistency in MLLMs by elevating visual features above text-token parity in connector-based architectures.

Xinpeng Dong·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

FOL2NS: Generating Natural Sentences from First-Order Logic

FOL2NS neurosymbolic framework generates natural language from first-order logic formulas with deep nesting and varying quantifier depths.

Mei Jia·1 month ago

r/ClaudeAI· COMMUNITY

Claude in an Enterprise Environment

Reddit discussion on Claude deployment in enterprise, data security concerns, and comparison to Copilot for Enterprise.

u/kylehadfield1992·1 month ago·22 pts / 25 comm

Stratechery· ANALYST

Data Center Discontent, Understanding the Opposition, Fixing the Problem

Stratechery analyzes data center opposition and proposes financial compensation as the viable solution to community resistance.

Ben Thompson·1 month ago

OpenAI· FRONTIER

OpenAI and Dell partner to bring Codex to hybrid and on-premise enterprise environments

OpenAI and Dell partner to deploy Codex in hybrid/on-premise environments for enterprise coding automation with security controls.

OpenAI·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Whispers in the Noise: Surrogate-Guided Concept Awakening via a Multi-Agent Framework

Multi-agent framework exposes concept erasure vulnerabilities in diffusion models, showing suppressed concepts can be reactivated via black-box awakening.

Mengyu Sun·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Foundation Models for Credit Risk Prediction: A Game Changer?

Benchmarking study on foundation models vs. gradient-boosting for credit risk prediction with interpretability via SHAP.

Bart Baesens·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Evidence-Grounded Frontier Mapping and Agentic Hypothesis Generation in Nanomedicine

pArticleMap system uses embeddings and graph algorithms to map nanomedicine research frontiers and generate evidence-grounded hypotheses.

Christiaan G. A. Viviers·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Generative AI and the Productivity Divide: Human-AI Complementarities in Education

RCT shows LLM assistance improves learning task performance unevenly; gains correlate with domain knowledge, revealing productivity heterogeneity.

Lihi Idan·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

An Empirical Study of Privacy Leakage Chains via Prompt Injection in Black-Box Chatbot Environments

Empirical study of indirect prompt injection attacks enabling privacy leakage in black-box LLM chatbot agents with external tool access.

Hongjang Yang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Who Generated This 3D Asset? Learning Source Attribution for Generative 3D Models

First source attribution benchmark for generative 3D models across multi-view, geometric, and frequency-domain fingerprints.

Sihan Ma·1 month ago

← Front Page30 stories

← Newer Older →