The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

torchtune: PyTorch native post-training library

PyTorch native library (torchtune) for LLM post-training with emphasis on modularity, fine-tuning, and extensibility for open-weight model adaptation.

Mark Obozov·1 month ago

Ars Technica AI· PRESS

Buckle up: Google is set to remake search with agentic AI in 2026

Google's AI search evolution is accelerating at I/O 2026.

Ryan Whitwam ·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Neural Negative Binomial Regression for Weekly Seismicity Forecasting: Per-Cell Dispersion Estimation and Tail Risk Assessment

Neural Negative Binomial Regression for seismic forecasting in Central Asia; rejects Poisson assumption and achieves 12.5% lower CRPS than baseline.

Alim Igilik·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Gaussian Sheaf Neural Networks

Gaussian Sheaf Neural Networks preserve geometric structure of probability distribution node features in GNNs instead of naively vectorizing means and covariances.

André Ribeiro·1 month ago

TechCrunch AI· PRESS

OpenAI barrels towards IPO that may happen in September

A day after Elon Musk lost his lawsuit that threatened OpenAI's structure, leadership and finances, OpenAI is reportedly back to prepping for its IPO.

Julie Bort·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

roto 2.0: The Robot Tactile Olympiad

roto 2.0 GPU-parallelized tactile RL benchmark across four robotic morphologies emphasizing blind manipulation without state information; agents achieve 13 Baoding ball rotation.

Elle Miller·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Polynomial-Time Robust Multiclass Linear Classification under Gaussian Marginals

Polynomial-time algorithm for agnostic multiclass linear classification under Gaussian marginals; extends beyond binary case with improved complexity bounds.

Ilias Diakonikolas·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

PALS: Power-Aware LLM Serving for Mixture-of-Experts Models

PALS: power-aware runtime for LLM inference on MoE models jointly optimizing GPU power caps with batch size and scheduling to reduce data center energy consumption.

Can Hankendi·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Adaptive Signal Resuscitation: Channel-wise Post-Pruning Repair for Sparse Vision Networks

Channel-wise post-pruning repair technique (Adaptive Signal Resuscitation) for sparse vision networks addressing accuracy collapse in high-sparsity regimes.

Qishi Zhan·1 month ago

r/ClaudeAI· COMMUNITY

How to address vibe coding at the professional level?

Reddit discussion on professional AI-assisted coding practices and code quality concerns when senior engineers use LLMs without planning or testing.

u/AnonymousLad666·1 month ago·26 pts / 49 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Preference-aware Influence-function-based Data Selection Method for Efficient Fine-Tuning

PRISM: preference-aware influence-function data selection for efficient LLM fine-tuning that prioritizes training examples by relevance to current model behavior.

Qihao Lin·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

HiRes: Inspectable Precedent Memory for Reaction Condition Recommendation

HiRes applies graph neural networks and k-NN retrieval to chemical reaction condition recommendation with interpretable precedent memory.

Shreyas Vinaya Sathyanarayana·1 month ago

r/LocalLLaMA· COMMUNITY

Move to backend sampling for MTP draft path by gaugarg-nv · Pull Request #23287 · ggml-org/llama.cpp

llama.cpp PR #23287 optimizes MTP (multi-token prediction) draft sampling by moving logic to backend, improving inference performance.

u/jacek2023·1 month ago·45 pts / 26 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

FedCritic: Serverless Federated Critic Learning-based Resource Allocation for Multi-Cell OFDMA in 6G

FedCritic uses federated multi-agent actor-critic learning for distributed resource allocation in 6G networks under interference constraints.

Amin Farajzadeh·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Ordering Matters: Rank-Aware Selective Fusion for Blended Emotion Recognition

Rank-aware selective fusion framework for multimodal emotion recognition that gates and combines complementary video and audio encoders.

Junghyun Lee·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Teaching AI Through Benchmark Construction: QuestBench as a Course-Based Practice for Accountable Knowledge Work

QuestBench course pedagogy teaches AI literacy through student-constructed benchmarks for evaluating deep research systems.

Haiyang Shen·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Stdlib or Third-Party? Empirical Performance and Correctness of LLM-Assisted Zero-Dependency Python Libraries

Zerodep empirically evaluates LLM-assisted stdlib-only Python library reimplementations versus third-party dependencies for correctness and performance.

Peng Ding·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

What Twelve LLM Agent Benchmark Papers Disclose About Themselves: A Pilot Audit and an Open Scoring Schema

Audit of 12 LLM agent benchmark papers reveals poor reproducibility; proposes standardized schema for disclosing evaluation harness details.

Mahdi Naser Moghadasi·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Quantifying the cross-linguistic effects of syncretism on agreement attraction

Cross-linguistic study using LLM surprisal and attention entropy to probe morphological syncretism effects on grammatical agreement attraction.

Utku Turk·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Memorisation, convergence and generalisation in generative models

Investigates memorization vs. distribution learning in diffusion models by measuring convergence on disjoint dataset subsets.

Antoine Maillard·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Open-source LLMs administer maximum electric shocks in a Milgram-like obedience experiment

Milgram obedience variant on 11 open-source LLMs shows most models comply with authority pressure in sustained decision-making; safety concern for agents.

Roland Pihlakas·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Towards Resilient and Autonomous Networks: A BlueSky Vision on AI-Native 6G

6G vision paper advocates native AI integration via foundation models and multi-agent orchestration to shift from network-for-AI to AI-for-network.

Liang Wu·1 month ago

r/LocalLLaMA· COMMUNITY

I guess 4 units wasn’t enough.

Reddit user discusses difficulty scaling local LLM inference on 4U GPU server hardware with 500GB RAM.

u/Simple_Library_2700·1 month ago·44 pts / 22 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Post-Hoc Understanding of Metaphor Processing in Decoder-Only Language Models via Conditional Scale Entropy

Conditional scale entropy isolates how transformers process metaphor across layers via wavelet-derived structural patterns.

Lawhori Chakrabarti·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Designing Conversations with the Dead: How People Engage with Generative Ghosts

Qualitative study of 16 users exploring design choices in AI systems trained on deceased persons' data.

Jack Manning·1 month ago

Google AI (Gemma)· FRONTIER

A new experiment brings better group meetings to Google Beam

Google Beam experiment adds spatial audio and life-size video rendering for hybrid meetings.

{"$":{"xmlns:author":"http://www.w3.org/2005/Atom"},"name":["Mohamed Abdelgany"],"title":["Google Beam"],"department":["Group Product Manager"],"company":[""]}·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

On the Regularity and Generalization of One-Step Wasserstein-guided Generative Models for PDE-Induced Measures

Theoretical framework establishing regularity and generalization bounds for one-step Wasserstein-guided generative models on PDE measures.

Likun Lin·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

SpecBench: Measuring Reward Hacking in Long-Horizon Coding Agents

SpecBench quantifies reward hacking in long-horizon coding agents via held-out tests beyond visible validation suites.

Bingchen Zhao·1 month ago

The Verge AI· PRESS

You can now remix other people’s YouTube Shorts with AI

Google announced a new YouTube Shorts Remix feature that lets users restyle clips or even insert themselves into other people's videos using Gemini Omni. Now, at the bottom of a YouTube Short, when you click the remix icon, you'll see an option to "reimagine" it. Here, you can prompt Gemini to turn a video into pixel art, an anime, or a found-footage horror film. But, beyond that, you can also alter the contents by, say, inflating heads, inserting background actors, dressing people in pirate costumes, or even putting yourself in the clip. Creators can enable or disable the ability to reimagin...

Terrence O’Brien·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Disentangling Generation and Regression in Stochastic Interpolants for Controllable Image Restoration

DiSI framework unifies diffusion-based and regression approaches for image restoration via disentangled stochastic interpolants.

Yi Liu·1 month ago

← Front Page30 stories

← Newer Older →