The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Greening AI Inference with Accuracy and Latency-aware User Incentives

The widespread use of AI services has raised concerns for its environmental sustainability, towards which recent studies have identified carbon emissions of AI inference as the major contributor. This paper introduces a framework for designing AI inference incentives based on the users' valuation for inference quality and latency, together with their environmental consciousness, while accounting for the tradeoff between carbon emissions and the two QoE parameters. Our approach can accommodate different tradeoffs, that depend on the size and complexity of the AI models and the allocation of re...

Vasilios A. Siris·25 days ago

The Archive

Greening AI Inference with Accuracy and Latency-aware User Incentives

Normal Guidance is what Attention Needs

3D-printable humanoid legs let robotics experiments run wild

Risk Averse Alert Prioritization for IDS Using Subnormal Gaussian Fuzzy Models

Self-Ensembling Vision-Language Models for Chart Data Extraction

Probing Cultural Awareness in LLMs: A Case Study of Cross-Culture Aesthetic Stylistics

Separating Semantic Competition from Context Length in RAG Reading

BASIS: Batchwise Advantage Estimation from Single-Rollout Information Sharing for LLM Reasoning

Detectability in Diversity: Improved Canary Crafting for Privacy Auditing in One Run

It's Not Always Sycophancy: Measuring LLM Conformity as a Function of Epistemic Uncertainty

Falcon-X: A Time Series Foundation Model for Heterogeneous Multivariate Modeling

FineVLA: Fine-Grained Instruction Alignment for Steerable Vision-Language-Action Policies

Causal Risk Minimization for High-Dimensional Treatments

SIA: Self Improving AI with Harness & Weight Updates

me at hour 3 of prompting claude to verify something i could've just checked myself

Transfer Learning using 66 Diseases for Disease Forecasting Applications

Lost in Sampling: Assessing Lexical Reachability in LLMs via the Word Coverage Score (WCS)

Kan Extension Transformers: A Categorical Unification of Attention, Diffusion, and Predict-Detach Self-Conditioning

PilotTTS: A Disciplined Modular Recipe for Competitive Speech Synthesis

Pair-In, Pair-Out: Latent Multi-Token Prediction for Efficient LLMs

LUCoS: Latent Unsupervised Context Selection for Tabular Foundation Models

Gumbel Machine: Counterfactual Student Writing Generation via Gumbel Noise Steering

Claude keeps telling me to do something

Many Logics, One Methodology: A Plea for Logical Pluralism in Formalised Reasoning (preprint)

Demis Hassabis now says AGI could arrive in just 3 years in 2029

Symbolic Regression via Latent Iterative Refinement

ENPMR-Bench: Benchmarking Proactive Memory Retrieval for Emotional Support Agents

Temporal Simultaneity Predicts Annotation Quality in Sentiment Corpora

Explainable Comparison of Feature-Based and Deep Learning Models for TROPOMI Methane Plume Screening

The Coverage Illusion: From Pre-retrieval Routing Failure to Post-retrieval Cascades in a Production RAG System