The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

arXiv (cs.AI/CL/LG)· ACADEMIA

A Typed Tensor Language for Federated Learning

Typed tensor language formalizing federated learning via client-local computation and mergeable aggregation.

Theofilos Mailis·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

ACL-Verbatim: hallucination-free question answering for research

VerbatimRAG system for hallucination-free QA over ACL Anthology via extractive retrieval and verbatim text spans.

Gábor Recski·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

WCXB: A Multi-Type Web Content Extraction Benchmark

WCXB: web content extraction benchmark with 2,008 pages across 7 content types for retrieval and LLM training.

Murrough Foley·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

UOTIP: Unbalanced Optimal Transport Map for Unpaired Inverse Problems

UOTIP: unpaired inverse problem solver using unbalanced optimal transport for image reconstruction.

Donggyu Lee·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Reviving Error Correction in Modern Deep Time-Series Forecasting

Error correction mechanisms for deep autoregressive time-series forecasting to mitigate long-term prediction drift.

Minh Hoang Nguyen·1 month ago

r/ClaudeAI· COMMUNITY

Coffee, Claude, and Remotion is all you need to make launch videos.

https://reddit.com/link/1tik0qe/video/9bh6ypr3ca2h1/player A few hours, [Claude Code](https://www.claude.com/product/claude-code) \+ [Remotion](https://www.remotion.dev/), 4 black coffees, no design tools, no After Effects, no editor. **The whole trick:** Remotion is React for video. You write JSX, you get an mp4. Every animation is `interpolate(frame, [start, end], [from, to])`. That means **Claude Code can write the entire video for you** — it already knows React, animation is just numbers, and you can iterate the same way you iterate on a landing page. Change a value, re-render, see w...

u/Top_Commission_8567·1 month ago·22 pts / 8 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

LoCar: Localization-Aware Evaluation of In-Vehicle Assistants through Fine-Grained Sociolinguistic Control

LoCar introduces evaluation framework for in-vehicle LLM assistants with focus on Korean honorific stability and localization.

Seogyeong Jeong·1 month ago

r/ClaudeAI· COMMUNITY

I dont think Claude Design likes my idea

u/Distinct-Bag6507·1 month ago·27 pts / 8 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Decoupling Communication from Policy: Robust MARL under Bandwidth Constraints

Decoupling Communication from Policy decouples latent representations in MARL to enable robust multi-agent coordination under bandwidth constraints.

Alexi Canesse·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

AIMBio-Mat: An AI-Native FAIR Platform for Closed-Loop Materials Discovery and Biomedical Translation

AIMBio-Mat proposes AI-native FAIR framework linking materials provenance, knowledge graphs, and active learning for biomedical discovery.

D. -M. Mei·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

AutoRPA: Efficient GUI Automation through LLM-Driven Code Synthesis from Interactions

AutoRPA distills LLM reasoning into efficient code synthesis for repetitive GUI automation tasks, bridging ReAct and traditional RPA.

Minghao Chen·1 month ago

r/ClaudeAI· COMMUNITY

Can we talk about how annoying Claude chat's question popup is?

User feedback on Claude Chat UI/UX: question popup blocks content and disrupts workflow.

u/inconspicuous_object·1 month ago·20 pts / 17 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Musical Attention Transformer: Music Generation Using a Music-Specific Attention Model

Musical Attention Transformer incorporates meta-information (bar, key, tempo) into attention mechanism to reduce repetition in music generation.

Shinnosuke Taksuka·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

GradeLegal: Automated Grading for German Legal Cases

GradeLegal evaluates LLM capability to automatically grade German legal exam solutions in criminal and public law domains.

Abdullah Al Zubaer·1 month ago

r/LocalLLaMA· COMMUNITY

[WIP] Gemma 4 MTP

Early-stage Gemma 4 MTP compilation work-in-progress shared on LocalLLaMA.

u/jacek2023·1 month ago·52 pts / 14 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

SpectralEarth-FM: Bringing Hyperspectral Imagery into Multimodal Earth Observation Pretraining

SpectralEarth-FM integrates hyperspectral imagery with multisensor Earth observation data via hierarchical transformer with spectral tokenization.

Nassim Ait Ali Braham·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Fine-grained Claim-level RAG Benchmark for Law

Fine-grained Claim-level RAG Benchmark for Law provides granular evaluation of legal RAG systems to detect hallucinations at claim level.

Souvick Das·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Towards Understanding Self-Pretraining for Sequence Classification

Self-Pretraining analysis investigates why masked token prediction pretraining on Transformers improves sequence classification without external data.

Omar Coser·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Robust Personalized Recommendation under Hidden Confounding in MNAR

Robust Personalized Recommendation mitigates hidden confounding in MNAR observational data via novel causal inference approach for recommender systems.

Zongyu Li·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

APM: Evaluating Style Personalization in LLMs with Arbitrary Preference Mappings

APM benchmark for evaluating style personalization in LLMs using arbitrary preference mappings without reference responses.

Philipp Spohn·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Grounding Driving VLA via Inverse Kinematics

Driving VLA redesigned via inverse kinematics framework to improve trajectory prediction by grounding visual tokens in dual boundary conditions.

Junsung Park·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Divide et Calibra: Multiclass Local Calibration via Vector Quantization

Vector quantization-based multiclass calibration method for ML models addressing heterogeneous calibration errors across latent space.

Cesare Barbera·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Multimodal LLMs under Pairwise Modalities

Theoretical framework for training multimodal LLMs using only pairwise modality alignments instead of full joint multimodal datasets.

Yan Li·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A Dialogue between Causal and Traditional Representation Learning: Toward Mutual Benefits in a Unified Formulation

Position paper bridging causal representation learning and traditional representation learning via unified problem formulation.

Yan Li·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Genetic Programming with Transformer-Based Mutation for Approximate Circuit Design

Transformer-based mutation operator for Cartesian genetic programming applied to approximate circuit design optimization.

Ondrej Galeta·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Cross-lingual robustness of LLM-brain alignment and its computational roots

Multilingual whole-brain encoding study confirms LLM-brain alignment for language comprehension across Mandarin, English, French.

Ni Yang·1 month ago

r/LocalLLaMA· COMMUNITY

RTX 5080 16GB: Qwen3.6 35B MoE at 128k context — 56 tok/s, and why MTP doesn't help

Qwen 3.6 35B MoE benchmark on RTX 5080: 56 tok/s at 128k context; Multi-Token Prediction offers no speed gain at scale.

u/gaztrab·1 month ago·48 pts / 46 comm

OpenAI· FRONTIER

Introducing OpenAI for Singapore

OpenAI announces multi-year partnership in Singapore for AI deployment, talent development, and enterprise/public sector adoption.

OpenAI·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Conditioning Gaussian Processes on Almost Anything

Equivalence between Gaussian processes and linear diffusion models enabling likelihood-guided conditioning beyond conjugate settings.

Henry Moss·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Efficient Banzhaf-Based Data Valuation for $k$-Nearest Neighbors Classification

Data valuation, the task of quantifying the contribution of individual data points to model performance, has emerged as a fundamental challenge in machine learning. Game-theoretic approaches, such as the Banzhaf value, offer principled frameworks for fair data valuation; however, they suffer from exponential computational complexity. We address this challenge by developing efficient algorithms specifically tailored for computing Banzhaf values in $k$-nearest neighbor ($k$NN) classifiers. We first establish the theoretical hardness of the problem by proving that it is \#P-hard. Despite this in...

Guangyi Zhang·1 month ago

← Front Page30 stories

← Newer Older →