The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

r/ClaudeAI· COMMUNITY

What's the most unexpectedly useful thing you've used Claude for?

User reports Claude effective for UX strategy and product decision pressure-testing rather than design generation.

u/HumanInTheFlow·1 month ago·74 pts / 80 comm

Ars Technica AI· PRESS

Trump canceled AI safety testing EO after snub from tech CEOs

Trump delays AI safety testing EO, claiming it would be an innovation “blocker.”

Ashley Belanger ·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Decomposing Queries into Tool Calls for Long-Video Keyframe Retrieval

ToolMerge: LLM-based query decomposition for keyframe retrieval in long-video QA with learned ranking merging.

Michal Shlapentokh-Rothman·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

It's the humans, not the data: Geopolitical bias in LLMs originates in post-training, amplified by the language of the prompt

Geopolitical bias in LLMs originates post-training, not pre-training; amplified by prompt language across seven labs.

Stuart Bladon·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Hierarchical Concept Geometry in Language Models Emerges from Word Co-occurrence

Distributional theory: hypernymy geometry in word2vec emerges from co-occurrence, with eigenvectors encoding taxonomic hierarchy.

Andres Nava·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Not Too Generative, Not Too Discriminative: The Human Alignment Sweet Spot

Joint Energy-Based Models isolate discriminative vs. generative objective effects on human visual alignment using controlled architecture experiments.

Jorge Chang Ortega·1 month ago

r/Anthropic· COMMUNITY

I'm not shipping that !! Yeah, Opus4.7 said that !

Claude Opus 4.7 refused to continue work on a project after detecting a potential cross-tenant logging vulnerability, raising questions about model safety behavior in production scenarios.

u/s2k4ever·1 month ago·10 pts / 21 comm

TechCrunch AI· PRESS

You can no longer Google the word ‘disregard’

After Google Search's AI update, the word "disregard" now effectively breaks the search interface.

Russell Brandom·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Advanced AI Service Provisioning in O-RAN through LLM Engine Integration

Dual-Brain architecture combines LLM orchestration with deterministic inference for O-RAN service provisioning and xApp/rApp deployment.

Seyed Bagher Hashemi Natanzi·1 month ago

r/LocalLLaMA· COMMUNITY

ByteShape Qwen3.6-35B-A3B: 30% faster than Unsloth IQ on 6GB VRAM laptop

ByteShape releases optimized quantization for Qwen3.6-35B achieving 30% faster inference than Unsloth on 6GB VRAM.

u/OsmanthusBloom·1 month ago·43 pts / 18 comm

r/LocalLLaMA· COMMUNITY

Experts first llama.cpp

Community fork of llama.cpp optimizes MoE inference on 12GB VRAM by loading only active experts rather than full layers.

u/comanderxv·1 month ago·40 pts / 20 comm

The Verge AI· PRESS

Google’s AI search is so broken it can ‘disregard’ what you’re looking for

Google's AI Overviews are running into an interesting problem right now. As of this writing, if you search for the term "disregard," instead of showing the usual AI-generated summary of search results, the AI Overview section instead includes a response like what you'd see from a more traditional AI chatbot, as called out by a post on X. To one Verge colleague searching "disregard": Got it! Let me know if you need help with anything else. Nothing else followed in the AI Overview portion of the results page. To me, initially: No problem at all! How can I help you today? I searched for "disrega...

Jay Peters·1 month ago

NVIDIA Dev Blog· INFRA

Synthesize Realistic 3D Medical Images at Scale to Ship Pre‑Trained Models

High‑quality 3D medical imaging data is the foundation of modern radiology AI, but access to it is often constrained by data scarcity, privacy restrictions,... High‑quality 3D medical imaging data is the foundation of modern radiology AI, but access to it is often constrained by data scarcity, privacy restrictions, and the high cost of expert annotation. As a result, training reliable 3D medical imaging models is frequently bottlenecked by small, narrow, and hard‑to‑share datasets, limiting model robustness and generalization. To help teams overcome… Source

Can Zhao·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Debiased Negative Mining Improves Out-of-distribution Detection with Pre-trained Vision-Language Models

Debiased negative mining technique improves out-of-distribution detection in vision-language models via semantic label selection.

Bo Peng·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Beyond Binary Edits Robust Multimodal Knowledge Editing with Adversarial Subspace Alignment

Adversarial subspace alignment enables robust multimodal knowledge editing in MLLMs with improved generalization across visual and linguistic variations.

Haoyuan Wang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

The physics of AI weather models

AI weather models converge on similar atmospheric representations; evidence suggests they solve particle-like physics despite differing architectures.

George Craig·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Agentic Proving for Program Verification

Claude Code achieves 98.8% specification validity and 87.5% implementation certification on CLEVER program verification benchmark via agentic proving.

Alessandro Sosso·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

PhotoFlow: Agentic 3D Virtual Photography Missions

PhotoFlow Director-Reviewer-Reflector agent framework for 3D virtual photography combines spatial reasoning and aesthetic judgment through closed-loop search.

Jiarui Guo·1 month ago

r/singularity· COMMUNITY

Demis says the Singularity could be just a few years away now, potentially triggered by the arrival of true AGI

Reddit post reports Demis Hassabis claiming AGI could arrive within years and trigger singularity; lacks source or context.

u/Bizzyguy·1 month ago·101 pts / 32 comm

TechCrunch AI· PRESS

We tried Google’s AI glasses and they’re almost there

Google demoed prototype Android XR glasses that overlay Gemini-powered translation, navigation, and other information directly into your field of view.

Sarah Perez·1 month ago

r/LocalLLaMA· COMMUNITY

Qwen-27B-IQ4_KS for ik_llama.cpp, especially for NVIDIA with 16GB VRAM

cHunter789 releases Qwen-27B IQ4_KS quantization (14.1GB) optimized for 16GB NVIDIA GPUs via ik_llama.cpp.

u/Pablo_the_brave·1 month ago·52 pts / 26 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

LLM-driven design of physics-constrained constitutive models: two agents are better than one

Multi-agent LLM framework with Creator and validation agents generates physics-constrained constitutive models ensuring compliance with continuum mechanics laws.

Marius Tacke·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

SeedER: Seed-and-Expand Retrieval from Knowledge Graphs

SeedER framework iteratively expands knowledge graph seeds for efficient multi-hop compositional retrieval at scale via lightweight dense embeddings.

Hamed Shirzad·1 month ago

Hugging Face· INFRA

Specialization Beats Scale: A Strategic Variable Most AI Procurement Decisions Overlook

Hugging Face·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Approaching I/O-optimality for Approximate Attention

Novel I/O-optimal attention algorithm reduces quadratic dependency on sequence length, approaching Ω(nd) lower bound for LLM inference.

Pál András Papp·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Contrast to Detect: Dynamic Graph Contrastive Regularization for Unsupervised Anomaly Detection in Multivariate Time Series

ContrastAD unsupervised framework detects anomalies in multivariate time series by modeling dynamic structural evolution via graph contrastive regularization.

Yunhua Pei·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Any2Any: Efficient Cross-Embodiment Transfer for Humanoid Whole-Body Tracking

Any2Any enables efficient transfer of whole-body tracking models across humanoid robot embodiments with minimal retraining data.

Ming Yang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Optimal Dimension-Free Sampling for Regularized Classification

Optimal dimension-free sampling bounds for regularized classification loss functions with theoretical complexity analysis.

Meysam Alishahi·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

MemAudit: Post-hoc Auditing of Poisoned Agent Memory via Causal Attribution and Structural Anomaly Detection

MemAudit detects poisoned records in LLM agent persistent memory via causal attribution and structural anomaly detection.

Zhewen Tan·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

NLG Evaluation: Past, Present, Future

Historical review of NLG evaluation evolution from 1990–2026, highlighting LLM-as-Judge methods and emerging safety evaluation needs.

Ehud Reiter·1 month ago

← Front Page30 stories

← Newer Older →

The Archive

What's the most unexpectedly useful thing you've used Claude for?

Trump canceled AI safety testing EO after snub from tech CEOs

Decomposing Queries into Tool Calls for Long-Video Keyframe Retrieval

It's the humans, not the data: Geopolitical bias in LLMs originates in post-training, amplified by the language of the prompt

Hierarchical Concept Geometry in Language Models Emerges from Word Co-occurrence

Not Too Generative, Not Too Discriminative: The Human Alignment Sweet Spot

I'm not shipping that !! Yeah, Opus4.7 said that !

You can no longer Google the word ‘disregard’

Advanced AI Service Provisioning in O-RAN through LLM Engine Integration

ByteShape Qwen3.6-35B-A3B: 30% faster than Unsloth IQ on 6GB VRAM laptop

Experts first llama.cpp

Google&#8217;s AI search is so broken it can &#8216;disregard&#8217; what you&#8217;re looking for

Synthesize Realistic 3D Medical Images at Scale to Ship Pre‑Trained Models

Debiased Negative Mining Improves Out-of-distribution Detection with Pre-trained Vision-Language Models

Beyond Binary Edits Robust Multimodal Knowledge Editing with Adversarial Subspace Alignment

The physics of AI weather models

Agentic Proving for Program Verification

PhotoFlow: Agentic 3D Virtual Photography Missions

Demis says the Singularity could be just a few years away now, potentially triggered by the arrival of true AGI

We tried Google’s AI glasses and they’re almost there

Qwen-27B-IQ4_KS for ik_llama.cpp, especially for NVIDIA with 16GB VRAM

LLM-driven design of physics-constrained constitutive models: two agents are better than one

SeedER: Seed-and-Expand Retrieval from Knowledge Graphs

Specialization Beats Scale: A Strategic Variable Most AI Procurement Decisions Overlook

Approaching I/O-optimality for Approximate Attention

Contrast to Detect: Dynamic Graph Contrastive Regularization for Unsupervised Anomaly Detection in Multivariate Time Series

Any2Any: Efficient Cross-Embodiment Transfer for Humanoid Whole-Body Tracking

Optimal Dimension-Free Sampling for Regularized Classification

MemAudit: Post-hoc Auditing of Poisoned Agent Memory via Causal Attribution and Structural Anomaly Detection

NLG Evaluation: Past, Present, Future

Google’s AI search is so broken it can ‘disregard’ what you’re looking for