The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Plants from OpenAI, Meta, xAI, and Microsoft could emit more than 129M tons annually.

Molly Taft, wired.com ·2 months ago

Agentic AI-assisted coding offers a unique opportunity to instill epistemic grounding during software development

Proposes GROUNDING.md field-scoped epistemic grounding document to improve agentic AI coding reliability in scientific domains.

Magnus Palmblad·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Bridging the Training-Deployment Gap: Gated Encoding and Multi-Scale Refinement for Efficient Quantization-Aware Image Enhancement

Mobile image enhancement model with gated encoding and multi-scale refinement bridging training-deployment gap for quantized inference.

Dat To-Thanh·2 months ago

r/singularity· COMMUNITY

Spud time is nigh!

Unsubstantiated post with no context or verifiable claim.

u/Alex__007·2 months ago·267 pts / 80 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Enabling and Inhibitory Pathways of University Students' Willingness to Disclose AI Use: A Cognition-Affect-Conation Perspective

Mixed-methods study on university students' psychological willingness to disclose AI use via Cognition-Affect-Conation framework.

Yiran Du·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Ramen: Robust Test-Time Adaptation of Vision-Language Models with Active Sample Selection

Ramen framework enables test-time adaptation of vision-language models under mixed-domain distribution shifts via active sample selection.

Wenxuan Bao·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

AEL: Agent Evolving Learning for Open-Ended Environments

Agent Evolving Learning framework enables LLM agents to accumulate and leverage experience across open-ended episodes via two-timescale Thompson sampling.

Wujiang Xu·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Beyond N-gram: Data-Aware X-GRAM Extraction for Efficient Embedding Parameter Scaling

X-GRAM proposes frequency-aware dynamic token injection and hybrid hashing to improve embedding parameter efficiency for large lookup tables.

Yilong Chen·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

From If-Statements to ML Pipelines: Revisiting Bias in Code-Generation

Code LLMs generate ML pipelines with 87.7% inclusion of sensitive attributes in feature selection, revealing underestimated bias beyond simple conditional statements.

Minh Duc Bui·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Fairness under uncertainty in sequential decisions

Framework for fairness in sequential decision-making under uncertainty via counterfactual inference, addressing online ML applications with cascading decisions.

Michelle Seng Ah Lee·2 months ago

r/LocalLLaMA· COMMUNITY

Tencent Releases Hy3 preview - Open Source 295B 21B Active MoE

Tencent releases Hy3 preview: 295B MoE model with 21B active parameters on Hugging Face.

u/TKGaming_11·2 months ago·130 pts / 33 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Phonological Subspace Collapse Is Aetiology-Specific and Cross-Lingually Stable: Evidence from 3,374 Speakers

Cross-lingual dysarthria severity assessment across 3,374 speakers from 12 languages shows aetiology-specific phonological degradation patterns in SSL speech models.

Bernard Muller·2 months ago

r/LocalLLaMA· COMMUNITY

An Overnight Stack for Qwen3.6–27B: 85 TPS, 125K Context, Vision — on One RTX 3090 | by Wasif Basharat | Apr, 2026

Technical walkthrough: Qwen 3.6 27B achieves 85 TPS, 125K context on single RTX 3090 using llama.cpp.

u/AmazingDrivers4u·2 months ago·216 pts / 70 comm

r/LocalLLaMA· COMMUNITY

Been using PI Coding Agent with local Qwen3.6 35b for a while now and its actually insane

User reports success running Qwen 3.6 35B with local coding agent using structured planning skill file on production code.

u/SoAp9035·2 months ago·195 pts / 95 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Stealthy Backdoor Attacks against LLMs Based on Natural Style Triggers

BadStyle backdoor attack framework uses natural language style triggers against LLMs with reliable payload injection in long-form generation.

Jiali Wei·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Fixation Sequences as Time Series: A Topological Approach to Dyslexia Detection

Topological data analysis via persistent homology applied to eye-tracking fixation sequences for dyslexia detection using hybrid statistical-topological features.

Marius Huber·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Towards Universal Tabular Embeddings: A Benchmark Across Data Tasks

TEmBed benchmark evaluates tabular foundation models across four representation levels for table retrieval, semantic search, and prediction tasks.

Liane Vogel·2 months ago

r/Anthropic· COMMUNITY

Did claude just change their token window to be 5h from exactly when you start?

Usually the Claude session window always starts and ends on the full hour but today I just noticed it's not matching up with the full hour. Another small change is they removed the refresh button from their UI so you can't see your real-time usage without refreshing the whole page. Is this happening for everyone else? Are they changing some things about their usage limits?

u/StarFlower0429·2 months ago·21 pts / 11 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Efficient Logic Gate Networks for Video Copy Detection

Logic Gate Networks replace floating-point feature extractors with binary representations for compact, efficient video copy detection at scale.

Katarzyna Fojcik·2 months ago

The Verge AI· PRESS

THE PEOPLE DO NOT YEARN FOR AUTOMATION

Today on Decoder, I want to lay out an idea that’s been banging around my head for weeks now as we’ve been reporting on AI and having conversations here on this show. I’ve been calling it software brain, and it’s a particular way of seeing the world that fits everything into algorithms, databases and loops — software. Software brain is powerful stuff. It’s a way of thinking that basically created our modern world. Marc Andreessen, the literal embodiment of software brain, called it in 2011 when he wrote the piece “Why software is eating the world” as an op-ed in The Wall Street Journal. But s...

Nilay Patel·2 months ago

TechCrunch AI· PRESS

Grab a ticket today: The first StrictlyVC of 2026 kicks off in just a week in San Francisco

StrictlyVC San Francisco is in just a week. Now’s the time to grab yourself a ticket. Join VCs and founders at Sentro Filipino Cultural Center on April 30.

TechCrunch Events·2 months ago

TechCrunch AI· PRESS

Another customer of troubled startup Delve suffered a big security incident

TechCrunch has confirmed that Delve was the compliance company that performed the security certifications for Context AI, the AI agent training startup that last week disclosed a security incident.

Julie Bort·2 months ago

r/OpenAI· COMMUNITY

Chinese Workers Horrified as Bosses Direct Them to Train Their AI Replacements

Reddit anecdote claims Chinese workers at OpenAI contractor training AI models that may displace them; unverified sourcing and secondhand reporting.

u/EchoOfOppenheimer·2 months ago·92 pts / 12 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

There Will Be a Scientific Theory of Deep Learning

Opinion piece identifying five research strands—solvable settings, tractable limits, feature analysis, loss landscapes, implicit regularization—converging toward unified deep learning theory.

Jamie Simon·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Evaluating Post-hoc Explanations of the Transformer-based Genome Language Model DNABERT-2

AttnLRP explanation method applied to DNABERT-2 genome language model reveals whether Transformer attention captures relevant genomic patterns versus CNNs.

Isabel Kurth·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A-IC3: Learning-Guided Adaptive Inductive Generalization for Hardware Model Checking

A-IC3 augments IC3 hardware model checking with learning-guided inductive generalization to accelerate counterexample generalization and clause synthesis.

Xiaofeng Zhou·2 months ago

r/Anthropic· COMMUNITY

Jensen Huang basically said US chip export controls might be creating the problem they are trying to solve.

He said it on the [Dwarkesh Podcast ](https://mrkt30.com/anthropic-mythos-triggers-chinas-ai-arms-frenzy/)this week and I have not been able to stop thinking about it. His argument was not that China is not a threat. It was that cutting them off and treating them as an enemy is probably not the smartest long term play. His actual words were that victimising them and turning them into an enemy likely is not the best answer. The context here is Huawei targeting 750,000 AI chip shipments this year. It is nowhere near Nvidia's compute but the direction of travel is clear. And if DeepSeek ends u...

u/Odd_Row1657·2 months ago·10 pts / 14 comm

The Verge AI· PRESS

You’re about to feel the AI money squeeze

Earlier this month, millions of OpenClaw users woke up to a sweeping mandate: The viral AI agent tool, which this year took the worldwide tech industry by storm, had been severely restricted by Anthropic. Anthropic, like other leading AI labs, was under immense pressure to lessen the strain on its systems and start turning a profit. So if the users wanted its Claude AI to power their popular agents, they'd have to start paying handsomely for the privilege. "Our subscriptions weren't built for the usage patterns of these third-party tools," wrote Boris Cherny, head of Claude Code, on X. "We wa...

Hayden Field·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Geometric Monomial (GEM): a family of rational 2N-differentiable activation functions

GEM: smooth rational activation functions matching ReLU performance with C^2N differentiability for deep networks.

Eylon E. Krause·2 months ago

Simon Willison· ANALYST

Quoting Maggie Appleton

Maggie Appleton on social signaling benefits of public learning via blogging and podcasting.

Simon Willison·2 months ago

← Front Page30 stories

← Newer Older →

The Archive

Greenhouse gases from data center boom could outpace entire nations

Agentic AI-assisted coding offers a unique opportunity to instill epistemic grounding during software development

Bridging the Training-Deployment Gap: Gated Encoding and Multi-Scale Refinement for Efficient Quantization-Aware Image Enhancement

Spud time is nigh!

Enabling and Inhibitory Pathways of University Students' Willingness to Disclose AI Use: A Cognition-Affect-Conation Perspective

Ramen: Robust Test-Time Adaptation of Vision-Language Models with Active Sample Selection

AEL: Agent Evolving Learning for Open-Ended Environments

Beyond N-gram: Data-Aware X-GRAM Extraction for Efficient Embedding Parameter Scaling

From If-Statements to ML Pipelines: Revisiting Bias in Code-Generation

Fairness under uncertainty in sequential decisions

Tencent Releases Hy3 preview - Open Source 295B 21B Active MoE

Phonological Subspace Collapse Is Aetiology-Specific and Cross-Lingually Stable: Evidence from 3,374 Speakers

An Overnight Stack for Qwen3.6–27B: 85 TPS, 125K Context, Vision — on One RTX 3090 | by Wasif Basharat | Apr, 2026

Been using PI Coding Agent with local Qwen3.6 35b for a while now and its actually insane

Stealthy Backdoor Attacks against LLMs Based on Natural Style Triggers

Fixation Sequences as Time Series: A Topological Approach to Dyslexia Detection

Towards Universal Tabular Embeddings: A Benchmark Across Data Tasks

Did claude just change their token window to be 5h from exactly when you start?

Efficient Logic Gate Networks for Video Copy Detection

THE PEOPLE DO NOT YEARN FOR AUTOMATION

Grab a ticket today: The first StrictlyVC of 2026 kicks off in just a week in San Francisco

Another customer of troubled startup Delve suffered a big security incident

Chinese Workers Horrified as Bosses Direct Them to Train Their AI Replacements

There Will Be a Scientific Theory of Deep Learning

Evaluating Post-hoc Explanations of the Transformer-based Genome Language Model DNABERT-2

A-IC3: Learning-Guided Adaptive Inductive Generalization for Hardware Model Checking

Jensen Huang basically said US chip export controls might be creating the problem they are trying to solve.

You’re about to feel the AI money squeeze

Geometric Monomial (GEM): a family of rational 2N-differentiable activation functions

Quoting Maggie Appleton