The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

100 things we announced at I/O 2026

Google I/O 2026: Gemini Omni and 99 other announcements; focus on multimodal AI and platform expansions.

{"$":{"xmlns:author":"http://www.w3.org/2005/Atom"},"name":["Keyword Team"],"title":[""],"department":[""],"company":[""]}·1 month ago

r/MachineLearning· COMMUNITY

How competitive are PhD admissions currently [D]

Reddit discussion on ML PhD admissions competitiveness and networking requirements across regions.

u/strammerrammer·1 month ago·32 pts / 31 comm

r/singularity· COMMUNITY

A glimpse of Level 4? OpenAI model helps challenge an 80-year-old math assumption

OpenAI model contributes to proof challenging 80-year-old mathematical conjecture, demonstrating general-purpose reasoning for novel knowledge production.

u/Eyeswideshut_91·1 month ago·222 pts / 20 comm

r/OpenAI· COMMUNITY

An OpenAI model has disproved a central conjecture in discrete geometry

OpenAI model disproved a longstanding conjecture in discrete geometry, demonstrating AI capability in mathematical research and theorem discovery.

u/Anxious_Woodpecker52·1 month ago·104 pts / 12 comm·+ covered by others

r/singularity· COMMUNITY

Midjourney says their research was set back by a year by using TPU, regrets not sticking purely with nvidia

Midjourney reports TPU infrastructure choice delayed research by ~1 year; regrets not exclusive NVIDIA commitment.

u/Charuru·1 month ago·148 pts / 24 comm

r/ClaudeAI· COMMUNITY

Excuse me, viewing what?

Reddit post with unclear title and no content; appears to be incomplete or user error.

u/sacman73·1 month ago·21 pts / 11 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Variance Reduction for Expectations with Diffusion Teachers

CARV framework reduces variance in Monte Carlo gradient estimation for diffusion-model-based pipelines via hierarchical resampling.

Jesse Bettencourt·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Equilibrium Reasoners: Learning Attractors Enables Scalable Reasoning

Equilibrium Reasoners enable test-time compute scaling via learned task-conditioned attractors without external verifiers.

Benhao Huang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Quantifying Hyperparameter Transfer and the Importance of Embedding Layer Learning Rate

Framework quantifies hyperparameter transfer across model scales, revealing embedding layer learning rate criticality.

Dayal Singh Kalra·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

EvoStruct: Bridging Evolutionary and Structural Priors for Antibody CDR Design via Protein Language Model Adaptation

EvoStruct integrates protein language models with equivariant GNNs to fix vocabulary collapse in antibody CDR design.

Mansoor Ahmed·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Velocityformer: Broken-Symmetry-Matched Equivariant Graph Transformers for Cosmological Velocity Reconstruction

Velocityformer applies equivariant graph transformers to cosmological kinematic Sunyaev-Zel'dovich velocity reconstruction.

Tilman Tröster·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

DeepWeb-Bench: A Deep Research Benchmark Demanding Massive Cross-Source Evidence and Long-Horizon Derivation

DeepWeb-Bench introduces harder evaluation for frontier LLMs on deep research requiring massive cross-source evidence and reasoning.

Sixiong Xie·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

AiraXiv: An AI-Driven Open-Access Platform for Human and AI Scientists

AiraXiv proposes AI-era publishing platform enabling human and AI authors with continuous feedback-driven iteration.

Junshu Pan·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

WikiVQABench: A Knowledge-Grounded Visual Question Answering Benchmark from Wikipedia and Wikidata

WikiVQABench benchmark combines Wikipedia images with Wikidata for knowledge-grounded visual question answering evaluation.

Basel Shbita·1 month ago

Simon Willison· ANALYST

How fast is 10 tokens per second really?

Interactive tool visualizing LLM token generation speeds from 5 to 800 tokens/second for practical latency understanding.

Simon Willison·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Is Fixing Schema Graphs Necessary? Full-Resolution Graph Structure Learning for Relational Deep Learning

FROG enables learnable graph structure for relational deep learning on RDBs without fixed schema constraints.

Yi Huang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Agent JIT Compilation for Latency-Optimizing Web Agent Planning and Scheduling

Agent JIT compilation compiles task descriptions into executable code for web agents, reducing latency vs. sequential fetch-execute loops.

Caleb Winston·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

RLVR training exhibits rank-1 weight trajectory structure; minimal training captures performance gains via linear parameter evolution.

Zhepei Wei·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

DelTA interprets RLVR updates as linear discriminators over token gradients, explaining token-level probability changes in reasoning model training.

Kaiyi Zhang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Leveraging LLMs for Grammar Adaptation: A Study on Metamodel-Grammar Co-Evolution

LLM-based grammar adaptation for metamodel evolution in domain-specific languages; evaluated on Xtext DSLs.

Weixing Zhang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Mem-$π$: Adaptive Memory through Learning When and What to Generate

Mem-π framework generates context-specific agent guidance on-demand via dedicated model rather than static retrieval-based memory.

Xiaoqiang Wang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A Machine Learning Framework for Weighted Least Squares GNSS Positioning based on Activation Functions

ML framework for GNSS positioning error mitigation in urban environments using activation function-based weighted least squares.

Pin-Hsun Lee·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

HITL-D: Human In The Loop Diffusion Assisted Shared Control

HITL-D combines diffusion policies with human control for shared autonomous manipulation, conditioning on scene point clouds.

Riley Zilka·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Mind the Sim-to-Real Gap & Think Like a Scientist

Framework for optimal simulator-experiment allocation when deploying pre-trained simulators; decomposes value error into calibration drift and parametric residual.

Harsh Parikh·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Mitigating Label Bias with Interpretable Rubric Embeddings

Rubric embeddings mitigate label bias in high-stakes prediction (hiring, admissions) by replacing black-box embeddings with interpretable representations.

Calvin Isley·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Quality and Security Signals in AI-Generated Python Refactoring Pull Requests

Empirical study of AI-generated Python refactoring PRs from AIDev dataset; assesses maintainability, code quality, and security impact.

Mohamed Almukhtar·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Approximation Theory for Neural Networks: Old and New

Survey of approximation theory for neural networks covering universal approximation, quantitative rates, depth/width efficiency over four decades.

Soumendu Sundar Mukherjee·1 month ago

The Verge AI· PRESS

Vibe coding is coming to your phone

Coming to your homescreen soon: your own app. | Photo: Allison Johnson / The Verge "There's an app for that" was the promise of the App Store from the very beginning. The app that will get your phone to do the thing you want it to? It's just a few taps away. The tagline wasn't strictly true - I'm still waiting for that one perfect grocery list app. Still, apps shaped the modern smartphone into what it is today. We spend all day, every day inside of apps - scrolling, listening, and tapping until we find what we want. But your next favorite app might just be one that you made yourself. If you w...

Allison Johnson·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Lost in Fog: Sensor Perturbations Expose Reasoning Fragility in Driving VLAs

Study of Vision-Language-Action model robustness under sensor degradation in autonomous driving; Alpamayo R1 tested across 18K trials with noise, lighting, fog perturbations.

Abhinaw Priyadershi·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

TempGlitch: Evaluating Vision-Language Models for Temporal Glitch Detection in Gameplay Videos

Benchmark distinguishing temporal vs. spatial glitch detection in VLMs for game quality assurance; finds temporal glitches substantially harder than frame-level anomalies.

Yakun Yu·1 month ago

← Front Page30 stories

← Newer Older →