The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Stability and Generalization for Decentralized Markov SGD

Stability analysis of decentralized SGD/SGDA under Markov chain sampling characterizes generalization with dependent data.

Jiahuan Wang·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Probe-Geometry Alignment: Erasing the Cross-Sequence Memorization Signature Below Chance

Probe-Geometry Alignment surgically removes memorization traces from unlearned LLMs via cross-sequence detection without capability loss.

Anamika Paul Rupa·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

BIM Information Extraction Through LLM-based Adaptive Exploration

LLM-based agent uses adaptive code exploration to extract information from heterogeneous BIM models at runtime, evaluated on ifc-bench v2.

Sylvain Hellin·2 months ago

r/ClaudeAI· COMMUNITY

How can I use Claude as a project manager?

Reddit discussion on using Claude for project management and meeting tracking over multi-year timelines.

u/No_Bite_Kite·2 months ago·20 pts / 18 comm

r/LocalLLaMA· COMMUNITY

Qwen3.6-27B vs Coder-Next

Empirical comparison of Qwen3.6-27B and Coder-Next models across 40 test cases shows statistical parity with task-dependent tradeoffs.

u/Signal_Ad657·2 months ago·48 pts / 13 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Latent State Design for World Models under Sufficiency Constraints

Functional taxonomy of world models organizes latent state design by downstream task (prediction, control, planning, grounding) rather than architecture.

Keon Woo Kim·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Complex Diffusion Maps with $ω$-Parameterized Kernels Revealing Inherent Harmonic Representations

Complex Diffusion Maps framework uses ω-parameterized kernels to extract harmonic structure from high-dimensional data.

Tongzhen Dang·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

GRAVITY: Architecture-Agnostic Structured Anchoring for Long-Horizon Conversational Memory

GRAVITY module injects relational, temporal, and thematic structure into conversational memory retrieval for long-horizon agents.

Yushi Sun·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

MultiBreak: A Scalable and Diverse Multi-turn Jailbreak Benchmark for Evaluating LLM Safety

MultiBreak benchmark evaluates LLM safety via scalable multi-turn jailbreaks using active learning to generate diverse adversarial prompts.

Jialin Song·2 months ago

r/ClaudeAI· COMMUNITY

Are there privacy concerns regarding Cowork or connecting Claude to your cloud or emails?

Reddit user expresses privacy concerns about Claude's Cowork feature and data handling when connecting to cloud/email services.

u/NavXIII·2 months ago·20 pts / 31 comm

r/ClaudeAI· COMMUNITY

I left my Agent OS running overnight and it built 4 new tools I didn't even ask for

Engineer describes autonomous agent system with self-generating tools that can write, test, and register new capabilities without user intervention.

u/TheOnlyVibemaster·2 months ago·25 pts / 27 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Benchmarking Single-Pose Docking, Consensus Rescoring, and Supervised ML on the LIT-PCBA Library: A Critical Evaluation of DiffDock, AutoDock-GPU, GNINA, and DiffDock-NMDN

Large-scale evaluation of DiffDock, AutoDock-GPU, GNINA, and NMDN on LIT-PCBA library (15 targets, 578K pairs) for molecular docking.

Youssef Abo-Dahab·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Class-Aware Adaptive Differential Privacy in Deep Learning for Sensor-Based Fall Detection

Class-Aware Adaptive Differential Privacy framework for sensor-based fall detection using 3D CNN and BiLSTM with per-class noise tuning.

Joydeb Kumar Sana·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Missingness-aware Data Imputation via AI-powered Bayesian Generative Modeling

MissBGM method imputes missing data via Bayesian generative modeling with explicit missingness mechanism and posterior uncertainty quantification.

Qiao Liu·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

CP-SynC: Multi-Agent Zero-Shot Constraint Modeling in MiniZinc with Synthesized Checkers

CP-SynC multi-agent workflow translates natural language to MiniZinc constraint models using synthesized checkers for semantic validation.

Yuliang Song·2 months ago

r/LocalLLaMA· COMMUNITY

Karpathy's MicroGPT running at 50,000 tps on an FPGA

FPGA-based inference of 4,192-parameter MicroGPT achieves 50k tokens/sec throughput using onboard ROM weight storage.

u/jawondo·2 months ago·52 pts / 11 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

PRCD-MAP: Learning How Much to Trust Imperfect Priors in Causal Discovery

PRCD-MAP assigns per-edge trust weights to heterogeneous priors (physics vs. LLM) in causal discovery via soft prior-consumption layer.

Xihang Shan·2 months ago

r/LocalLLaMA· COMMUNITY

GPT 5.5 just leaked its chain of thought to me in codex, and it looks like an idea from 5 months ago in this sub.

Reddit post claiming GPT-5.5 output resembles a community suggestion from 5 months prior; unverified anecdote without official announcement or evidence.

u/Homeschooled316·2 months ago·45 pts / 26 comm

r/singularity· COMMUNITY

Software engineering jobs hit their highest posting since november 2023

Software engineering job postings reach peak since Nov 2023; Reddit commentary suggests continued demand for prompt engineering and model operation roles.

u/artemisgarden·2 months ago·100 pts / 50 comm

r/singularity· COMMUNITY

Ok This is Trippy

Reddit post with no substantive content; insufficient for professional analysis.

u/scoobydobydobydo·2 months ago·208 pts / 20 comm

r/LocalLLaMA· COMMUNITY

Qwen3.6-27B vs 35B, I prefer 35B but more people here post about 27B...

User reports preferring Qwen 35B over 27B for coding/research pipelines on local hardware despite 27B popularity.

u/Snoo_27681·2 months ago·53 pts / 53 comm

r/LocalLLaMA· COMMUNITY

I made a visualizer for Hugging Face models

I built [hfviewer.com](http://hfviewer.com), a small tool for visually exploring Hugging Face model architectures. You can paste a Hugging Face URL and get an **interactive visualization** of the architecture, which can make it easier to understand how different models are structured and compare them at a glance. Here is the recent **Qwen3.6-27B** model as an example: [https://hfviewer.com/Qwen/Qwen3.6-27B](https://hfviewer.com/Qwen/Qwen3.6-27B) And here is a side-by-side view of the **Gemma 4** family: [https://hfviewer.com/family/gemma-4](https://hfviewer.com/family/gemma-4) Feel free t...

u/Course_Latter·2 months ago·89 pts / 10 comm

r/LocalLLaMA· COMMUNITY

Tinygrad Driver testing!

Tinygrad driver testing on Blackwell + M3 Ultra RDMA cluster; seeks benchmark suggestions from community.

u/Street-Buyer-2428·2 months ago·43 pts / 29 comm

r/Anthropic· COMMUNITY

Do you remember when they said prompt engineering was a thing of the past?

Not that long ago, the pitch was that newer models would make prompt engineering mostly obsolete. You would not need elaborate prompting to get optimal performance. You could just ask for what you wanted, and the model would understand the task well enough to do it properly. Now, with Claude, it feels like the opposite. You often need to build hard rails around the task just to stop it from doing the laziest technically defensible version of what you asked for. To be clear, you can still get good results. But it often needs constant preemptive reminders to be thorough. Not just one reminder...

u/n_of_1234·2 months ago·10 pts / 6 comm

r/ClaudeAI· COMMUNITY

spent way too long manually steering claude code every session until i stopped doing that

Developer describes using persistent configuration to reduce Claude setup overhead across sessions, improving workflow efficiency and code quality.

u/CodinDev·2 months ago·21 pts / 17 comm

r/Anthropic· COMMUNITY

Opus 4.7 refuses to follow style guides

User reports Claude Opus 4.7 inconsistently refuses to follow custom style guides across conversations.

u/Used-Nectarine5541·2 months ago·12 pts / 28 comm

TechCrunch AI· PRESS

AI-generated actors and scripts are now ineligible for Oscars

Bad news for Tilly Norwood.

Anthony Ha·2 months ago

r/OpenAI· COMMUNITY

Courtroom sketch of Sam Altman

Musk testifies in OpenAI lawsuit, alleges company abandoned non-profit mission; courtroom sketch circulates on social media.

u/Outside-Iron-8242·2 months ago·59 pts / 14 comm

r/ClaudeAI· COMMUNITY

Non-business uses for Claude Cowork

Reddit user describes personal use cases for Claude with local HTML visualization and data aggregation from email, health, and files.

u/ilikethestuff·2 months ago·26 pts / 12 comm

r/LocalLLaMA· COMMUNITY

Ban phrases on llama.cpp with this script.

Community tool adds phrase-filtering capability to llama.cpp inference engine via GitHub script.

u/Total-Resort-3120·2 months ago·41 pts / 25 comm

← Front Page30 stories

← Newer Older →