The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

I can't get with 4.7

Reddit user reports Claude Opus 4.7 exhibits reduced effort, defensive reasoning, and response padding compared to 4.5/4.6.

u/Hiro_of_Lunar·1 month ago·20 pts / 25 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Agentifying Patient Dynamics within LLMs through Interacting with Clinical World Model

SepsisAgent augments LLM with learned Clinical World Model to ground sepsis treatment decisions via propose-simulate-refine workflow.

Minghao Wu·1 month ago

r/Anthropic· COMMUNITY

Anthropic’s Claude Helps Recover Lost Bitcoin Wallet Holding $400K After 11 Years

Link: blocknow.com

u/andix3·1 month ago·10 pts / 9 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

On Strong Equivalence Notions in Logic Programming and Abstract Argumentation

Analysis of strong equivalence properties in logic programming and abstract argumentation frameworks under dynamic update semantics.

Giovanni Buraglio·1 month ago

r/MachineLearning· COMMUNITY

Would a 2000-2021 ML paper even get accepted today? [D]

Discussion of whether ML papers from 2000-2021 would meet current acceptance standards, exploring if field rigor has increased or just competition.

u/Hope999991·1 month ago·30 pts / 38 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Towards Label-Free Single-Cell Phenotyping Using Multi-Task Learning

Multi-task deep learning framework for label-free single-cell phenotyping via WBC classification and protein-expression regression from DPC images.

Saqib Nazir·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

AnchorRoute: Human Motion Synthesis with Interval-Routed Sparse Contro

AnchorRoute uses sparse anchor scaffolds and interval-routed diffusion for full-body human motion synthesis from partial user specifications.

Pengcheng Fang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

IntentVLA: Short-Horizon Intent Modeling for Aliased Robot Manipulation

IntentVLA encodes visual history to disambiguate multimodal robot imitation data with variable short-horizon intents, reducing replanning conflicts.

Shijie Lian·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Vision-Core Guided Contrastive Learning for Balanced Multi-modal Prognosis Prediction of Stroke

Vision-core guided contrastive learning framework for tri-modal stroke prognosis integrating medical images, clinical data, and text.

Liren Chen·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

SceneFunRI: Reasoning the Invisible for Task-Driven Functional Object Localization

SceneFunRI benchmark tests vision-language models on inferring locations of occluded objects from task context using SceneFun3D dataset.

Posheng Chen·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

NeuroAtlas: Benchmarking Foundation Models for Clinical EEG and Brain-Computer Interfaces

NeuroAtlas: largest EEG benchmark (42 datasets, 260k hours) evaluating foundation model generalization across clinical neurophysiology tasks.

Konstantinos Kontras·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

The Rate-Distortion-Polysemanticity Tradeoff in SAEs

Paper characterizes Rate-Distortion-Polysemanticity tradeoff in Sparse Autoencoders, showing monosemantic interpretability requires reconstruction loss.

Tommaso Mencattini·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

ReMIA: a Powerful and Efficient Alternative to Membership Inference Attacks against Synthetic Data Generators

ReMIA: efficient membership inference attack alternative for synthetic data generators avoiding shadow modeling and auxiliary data requirements.

Davide Scassola·1 month ago

Ars Technica AI· PRESS

Desperate Trump taps "Tim Apple," Jensen Huang, Elon Musk to attend Xi summit

Xi meeting may force Trump to pivot on semiconductor tariffs and Taiwan.

Ashley Belanger ·1 month ago

The Verge AI· PRESS

The tyranny of software is almost over. Since the first computer programmers wrote the first computer programs, we, the users of that software, have been forced to live in the worlds those programs create. The features are the features. The design is the design. Want something else, something better? Learn to code, I guess. Until now, the people making a given piece of software - mostly well-paid professional developers - have rarely been the same as the ones using it: lawyers, doctors, churches, schools, me. (Where they overlap most directly is with developer tools, which are often the best ...

David Pierce·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Spontaneous symmetry breaking and Goldstone modes for deep information propagation

Study applies Goldstone modes from physics to analyze stable information propagation in equivariant deep networks across depth.

Nabil Iqbal·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

AI-assisted cultural heritage dissemination: Comparing NMT and glossary-augmented LLM translation in rock art documents

Compares machine translation approaches (DeepL, Gemini) for terminology-dense rock art documents, emphasizing glossary augmentation over model modification.

Vicent Briva-Iglesias·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

$π$-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

π-Bench evaluates proactive personal assistant agents on identifying hidden user intents in long-horizon multi-turn workflows.

Haoran Zhang·1 month ago

r/LocalLLaMA· COMMUNITY

Automated AI researcher running locally with llama.cpp

Hugging Face releases ml-intern, an agent framework for local LLM research automation supporting llama.cpp/ollama with Qwen and Claude models.

u/lewtun·1 month ago·48 pts / 13 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

AQKA: Active Quantum Kernel Acquisition Under a Shot Budget

AQKA: active acquisition method for quantum kernel estimation under measurement shot budgets with regime decomposition framework.

Jian Xu·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Agentic Design of Compositional Descriptors via Autoresearch for Materials Science Applications

Automat: autoresearch framework using LLM coding agents to automatically design and optimize composition-based chemical descriptors for materials science.

Matteo Cobelli·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

How Sensitive Are Radiomic AI Models to Acquisition Parameters?

Framework quantifies radiomic AI model sensitivity to acquisition parameters across multicentre protocols, identifying robustness-critical parameter regions.

D. Gil·1 month ago

Stratechery· ANALYST

An Interview with Ben Thompson at the MoffettNathanson Media, Internet & Communications Conference

Ben Thompson discusses compute shortage impacts on aggregation theory and consumer AI at MoffettNathanson conference.

Ben Thompson·1 month ago

r/LocalLLaMA· COMMUNITY

Built an open-source one-prompt-to-cinematic-reel pipeline on a single GPU — FLUX.2 [klein] for character keyframes, Wan2.2-I2V for animation, vision critic with auto-retry, music + 9-language narration in the same pipeline

Shipped this for the AMD x lablab hackathon. Attached video is one of the actual reels the pipeline produced - one English sentence in, finished mp4 with characters, story, music, and voice-over out (fast demo video, not the best quality). ~45 minutes end-to-end on a single AMD Instinct MI300X. Every model is Apache 2.0 or MIT. **Pipeline (8 stages, all sequential on the same GPU):** 1. **Director Agent** - Qwen3.5-35B-A3B (vLLM + AITER MoE) plans 6 shots from one sentence, returns structured JSON with character bibles, shot prompts, music brief, per-shot voice-over script, narration langua...

u/Inevitable-Log5414·1 month ago·46 pts / 13 comm

MIT Tech Review· PRESS

The shock of seeing your body used in deepfake porn

When Jennifer got a job doing research for a nonprofit in 2023, she ran her new professional headshot through a facial recognition program. She wanted to see if the tech would pull up the porn videos she’d made more than 10 years before, when she was in her early 20s. It did in fact return…

Jessica Klein·1 month ago

r/ClaudeAI· COMMUNITY

Claude Opus 4.7 just revealed its System prompt, without beeing asked for it

I just had a Chat with Claude and for no reason and without any question in that direction, it added a disclaimer with the system prompt in the answer. (after answering my initial question) [https://pastebin.com/C0s47rjV](https://pastebin.com/C0s47rjV) After I asked why it shared that I got: >You'll have to help me out a little here — this is the start of our conversation, so I haven't actually shared any information with you yet. There's nothing before your message for me to be referring back to. >Is it possible you're thinking of a different conversation, or that a message didn't ...

u/rudiXOR·1 month ago·31 pts / 17 comm

r/LocalLLaMA· COMMUNITY

Anyone actually using a local LLM as their daily knowledge base? Not for coding, for life stuff. What's your setup?

Reddit discussion on personal knowledge management with local LLMs; mostly user anecdotes, no novel technical insight or product announcement.

u/InformationSweet808·1 month ago·52 pts / 68 comm

r/LocalLLaMA· COMMUNITY

The "the future is fictional" problem of many local LLMs

Reddit discussion identifies knowledge-cutoff hallucination failure mode in local LLMs and some API models even with tool use enabled.

u/PromptInjection_·1 month ago·52 pts / 26 comm

r/ClaudeAI· COMMUNITY

You're abusing your subscription with agentic 24/7 workflows and that's why we all get restrictions and limits

Reddit discussion argues autonomous agent workflows strain Claude subscription economics, suggesting separate billing for agentic vs. interactive use.

u/iveroi·1 month ago·46 pts / 58 comm

r/ClaudeAI· COMMUNITY

I tested GPT-5.5 Codex against Opus 4.7 Claude Code, and it's about time Anthropic bros take pricing seriously.

Developer compares GPT-5.5 Codex to Claude Opus 4.7 on coding agent tasks (PR triage, code review UI), argues Anthropic needs aggressive pricing.

u/geekeek123·1 month ago·51 pts / 18 comm

← Front Page30 stories

← Newer Older →

The Archive

I can't get with 4.7

Agentifying Patient Dynamics within LLMs through Interacting with Clinical World Model

Anthropic’s Claude Helps Recover Lost Bitcoin Wallet Holding $400K After 11 Years

On Strong Equivalence Notions in Logic Programming and Abstract Argumentation

Would a 2000-2021 ML paper even get accepted today? [D]

Towards Label-Free Single-Cell Phenotyping Using Multi-Task Learning

AnchorRoute: Human Motion Synthesis with Interval-Routed Sparse Contro

IntentVLA: Short-Horizon Intent Modeling for Aliased Robot Manipulation

Vision-Core Guided Contrastive Learning for Balanced Multi-modal Prognosis Prediction of Stroke

SceneFunRI: Reasoning the Invisible for Task-Driven Functional Object Localization

NeuroAtlas: Benchmarking Foundation Models for Clinical EEG and Brain-Computer Interfaces

The Rate-Distortion-Polysemanticity Tradeoff in SAEs

ReMIA: a Powerful and Efficient Alternative to Membership Inference Attacks against Synthetic Data Generators

Desperate Trump taps "Tim Apple," Jensen Huang, Elon Musk to attend Xi summit

You can make an app for that

Spontaneous symmetry breaking and Goldstone modes for deep information propagation

AI-assisted cultural heritage dissemination: Comparing NMT and glossary-augmented LLM translation in rock art documents

$π$-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Automated AI researcher running locally with llama.cpp

AQKA: Active Quantum Kernel Acquisition Under a Shot Budget

Agentic Design of Compositional Descriptors via Autoresearch for Materials Science Applications

How Sensitive Are Radiomic AI Models to Acquisition Parameters?

An Interview with Ben Thompson at the MoffettNathanson Media, Internet & Communications Conference

Built an open-source one-prompt-to-cinematic-reel pipeline on a single GPU — FLUX.2 [klein] for character keyframes, Wan2.2-I2V for animation, vision critic with auto-retry, music + 9-language narration in the same pipeline

The shock of seeing your body used in deepfake porn

Claude Opus 4.7 just revealed its System prompt, without beeing asked for it

Anyone actually using a local LLM as their daily knowledge base? Not for coding, for life stuff. What's your setup?

The "the future is fictional" problem of many local LLMs

You're abusing your subscription with agentic 24/7 workflows and that's why we all get restrictions and limits

I tested GPT-5.5 Codex against Opus 4.7 Claude Code, and it's about time Anthropic bros take pricing seriously.