The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

China Sought Access to Anthropic’s Newest A.I. The Answer Was No.

Link: nytimes.com

u/ThereWas·1 month ago·10 pts / 3 comm

TabPFN-3 just released: a pre-trained tabular foundation model for up to 1M rows [R][N]

TabPFN-3 was released today, the next iteration of the tabular foundation model, originally published in Nature. Quick recap for anyone new to TabPFN: TabPFN predicts on tabular data in a single forward pass - no training, no hyperparameter search, no tuning. Built on TabPFN-2.5 (Nov 2025) and TabPFNv2 (Nature, Jan 2025), which together crossed 3M downloads and 200+ published applications. What's new: * Scale: 1M rows on a single H100 (10x larger than 2.5).A reduced KV cache (\~8GB per million rows per estimator) and row-chunked inference make this practical on a single GPU * Speed: 10x-10...

u/rsesrsfh·1 month ago·30 pts / 5 comm

r/ClaudeAI· COMMUNITY

Keep losing great answers in long Claude chats

Reddit user describes friction in Claude's UI for retrieving specific answers from long conversations; suggests workaround of manual copying.

u/Embarrassed-Slip8094·1 month ago·29 pts / 10 comm

r/ClaudeAI· COMMUNITY

Git push ftw

u/Outrageous_Zone3242·1 month ago·44 pts / 5 comm

r/Anthropic· COMMUNITY

oh lovely anthropic

Reddit user complaint about Claude API plan limits and rate-cap implementation, claims marketing misrepresented compute capacity gains.

u/Perfect-Lab-1791·1 month ago·10 pts / 21 comm

Ars Technica AI· PRESS

Amazon employees are "tokenmaxxing" due to pressure to use AI tools

Workers are using an internal AI tool to automate non-essential tasks.

Rafe Rosner-Uddin, Financial Times ·1 month ago

r/LocalLLaMA· COMMUNITY

Gemma 4 MTP vs DFlash on 1x H100: dense vs MoE results

Benchmark comparing Gemma 4 multi-token prediction vs. DFlash speculative decoding on H100 using vLLM and SPEED-Bench dataset.

u/LayerHot·1 month ago·42 pts / 18 comm

TechCrunch AI· PRESS

Dessn raises $6M for its production focused design tool

A new startup called Dessn has raised $6M to build AI-powered design tools that work directly with production codebases.

Ivan Mehta·1 month ago

r/LocalLLaMA· COMMUNITY

examples : add llama-eval by ggerganov · Pull Request #21152 · ggml-org/llama.cpp

llama.cpp adds llama-eval benchmarking tool supporting AIME, GSM8K, GPQA for local quantized model evaluation.

u/jacek2023·1 month ago·41 pts / 13 comm

r/ClaudeAI· COMMUNITY

I built a Claude Code plugin that actually enforces your rules instead of hoping the model follows them

Been using Claude Code heavily and kept running into the same thing everyone here talks about: the model ignores your rules. You tell it to write tests first, it writes the implementation. You give it coding standards, it cherry-picks which ones to follow. And as your rulebook grows, you're burning more and more tokens stuffing everything into context when only a handful of rules are relevant to what you're working on. So I built Writ. Two pieces: A retrieval engine that picks only the relevant rules and skills for the current task. It runs a five stage pipeline over a Neo4j knowledge graph...

u/InfinriDev·1 month ago·20 pts / 7 comm

The Archive

China Sought Access to Anthropic’s Newest A.I. The Answer Was No.

TabPFN-3 just released: a pre-trained tabular foundation model for up to 1M rows [R][N]

Keep losing great answers in long Claude chats

Git push ftw

oh lovely anthropic

Amazon employees are "tokenmaxxing" due to pressure to use AI tools

Gemma 4 MTP vs DFlash on 1x H100: dense vs MoE results

Dessn raises $6M for its production focused design tool

examples : add llama-eval by ggerganov · Pull Request #21152 · ggml-org/llama.cpp

I built a Claude Code plugin that actually enforces your rules instead of hoping the model follows them

A Transfer Learning Evaluation of Deep Neural Networks for Image Classification

Random-Set Graph Neural Networks

On the Limitations of Large Language Models for Conceptual Database Modeling

QDSB: Quantized Diffusion Schrödinger Bridges

High-lift Wing Separation Control via Bayesian Optimization and Deep Reinforcement Learning

On Predicting the Post-training Potential of Pre-trained LLMs

Stop wasting electricity

Stochastic Minimum-Cost Reach-Avoid Reinforcement Learning

Towards Order Fairness: Mitigating LLMs Order Sensitivity through Dual Group Advantage Optimization

AI voice startup Vapi hits $500M valuation after winning Amazon Ring over 40 rivals

Cooperative Robotics Reinforced by Collective Perception for Traffic Moderation

NOFE -- Neural Operator Function Embedding

Assessment of cloud and associated radiation fields from a GAN stochastic cloud subcolumn generator

Can we acknowledge that Anthropic watches open sourcers and copies them?

Enhancing Target-Guided Proactive Dialogue Systems via Conversational Scenario Modeling and Intent-Keyword Bridging

‘It’s here’: Google issues dire warning after catching hackers using AI to break into computers

Multimodal Abstractive Summarization of Instructional Videos with Vision-Language Models

Assessing and Mitigating Miscalibration in LLM-Based Social Science Measurement

Counterfactual Trace Auditing of LLM Agent Skills

From Noise to Diversity: Random Embedding Injection in LLM Reasoning