The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

On the Properties of Feature Attribution for Supervised Contrastive Learning

Analyzes feature attribution properties in supervised contrastive learning versus cross-entropy classification approaches.

Leonardo Arrighi·2 months ago

TechCrunch AI· PRESS

DeepSeek previews new AI model that ‘closes the gap’ with frontier models

DeepSeek says both models are more efficient and performant than DeepSeek V3.2 due to architectural improvements, and have almost "closed the gap" with current leading models, both open and closed, on reasoning benchmarks.

Ram Iyer·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

An Integrated Framework for Explainable, Fair, and Observable Hospital Readmission Prediction: Development and Validation on MIMIC-IV

Framework for hospital readmission prediction on MIMIC-IV with explainability (SHAP), fairness evaluation, and deployment reliability.

Isaac Tosin Adisa·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

FeatEHR-LLM: Leveraging Large Language Models for Feature Engineering in Electronic Health Records

FeatEHR-LLM uses LLMs to generate clinically meaningful features from irregular EHR time series while limiting privacy exposure.

Hojjat Karami·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

RouteLMT: Learned Sample Routing for Hybrid LLM Translation Deployment

RouteLMT learns to route machine translation requests between small and large LLMs based on comparative quality improvement, reducing deployment costs.

Yingfeng Luo·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Aggregate vs. Personalized Judges in Business Idea Evaluation: Evidence from Expert Disagreement

PBIG-DATA dataset with 3K expert scores on LLM-generated business ideas tests whether evaluation judges should model consensus or individual evaluator preferences.

Wataru Hirota·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Different Strokes for Different Folks: Writer Identification for Historical Arabic Manuscripts

Establishes baselines for writer identification in historical Arabic manuscripts using the Muharaf dataset with line-level and page-disjoint protocols.

Hamza A. Abushahla·2 months ago

r/OpenAI· COMMUNITY

Appreciations for work mode in Codex. On track to becoming the first real super app

Reddit user praises Codex work mode, speculates OpenAI building a super app platform.

u/py-net·2 months ago·52 pts / 10 comm

r/singularity· COMMUNITY

This is getting insane (image gen 2)

Reddit post sharing OpenAI image generation samples without technical details, benchmarks, or release announcement.

u/duselkay·2 months ago·103 pts / 30 comm

r/LocalLLaMA· COMMUNITY

Anthropic admits to have made hosted models more stupid, proving the importance of open weight, local models

Anthropic reduced Claude Sonnet 4.6 and Opus 4.6 reasoning effort and pruned session memory for latency, then reverted after user feedback.

u/spaceman_·2 months ago·76 pts / 28 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Measuring and Mitigating Persona Distortions from AI Writing Assistance

Large-scale study (N=14K) shows AI writing assistance distorts perceived writer persona across 29 dimensions including politics, personality, and identity.

Paul Röttger·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Decoding High-Dimensional Finger Motion from EMG Using Riemannian Features and RNNs

End-to-end deep learning framework for continuous finger motion estimation from forearm EMG using Riemannian features and RNNs for prosthetic control.

Martin Colot·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

CGC: Compositional Grounded Contrast for Fine-Grained Multi-Image Understanding

CGC framework improves MLLMs' fine-grained multi-image understanding by addressing spatial hallucination and attention leakage through compositional grounding.

Lihao Zheng·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Deep Learning for Model Calibration in Simulation of Itaconic Acid Production

Compares deep learning strategies (DDL and conditional flow matching) for kinetic parameter estimation in itaconic acid fermentation simulation.

Daria Fokina·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

FedSPDnet: Geometry-Aware Federated Deep Learning with SPDnet

FedSPDnet introduces geometry-preserving federated learning aggregation strategies for symmetric positive definite matrices with Stiefel constraints.

Thibault Pautrel·2 months ago

r/ClaudeAI· COMMUNITY

Claude limits no longer round to the nearest hour

Claude's usage limits no longer reset on hourly boundaries, preventing strategic timing exploits.

u/Shipposting_Duck·2 months ago·66 pts / 11 comm

The Verge AI· PRESS

Elon Musk and Sam Altman’s court showdown will dish the dirt

Might as well jump, as the poet David Lee Roth once said. | Image: Cath Virginia / The Verge Elon Musk cofounded OpenAI, and then flounced off in a huff when he wasn't anointed CEO, leaving Sam Altman as the last power-hungry man standing. Now, Musk is back with a lawsuit, and a trial is scheduled to start in Oakland, California, on April 27th. Theoretically, it's a legal case about whether OpenAI defrauded Musk. But that's not really what we're all doing here. This is about mess. Over the past couple of years, Musk's legal theories for punishing OpenAI have run the gamut from breach of contr...

Elizabeth Lopatto·2 months ago

TechCrunch AI· PRESS

In another wild turn for AI chips, Meta signs deal for millions of Amazon AI CPUs

Meta has commandeered a big chunk of Amazon's homegrown CPUs (not GPUs) for AI agentic workloads, signaling that a new kind of chip race has begun.

Julie Bort·2 months ago

r/LocalLLaMA· COMMUNITY

DeepSeek V4 is built different...

Reddit discussion of DeepSeek V4 capabilities; original Chinese content translated, lacks substantive technical details.

u/Alternative-Duty-532·2 months ago·170 pts / 26 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Contrastive Semantic Projection: Faithful Neuron Labeling with Contrastive Examples

Two-stage contrastive semantic projection method sharpens neuron-level interpretability labels in deep networks using contrastive examples.

Oussama Bouanani·2 months ago

r/MachineLearning· COMMUNITY

Is the ds/ml slowly being morphed into an AI engineer? [D]

Agents are amazing. Harnesses are cool. But the fundamental role of a data scientist is not to use a generalist model in an existing workflow; it's a completely different field. AI engineering is the body of the vehicle, whereas the actual brain/engine behind it is the data scientist's playground. I feel like I am not alone in this realisation that my role somehow got silently morphed into that of an AI engineer, with the engine's development becoming a complete afterthought. Based on industry requirements and ongoing research, most of the work has quietly shifted from building the engine t...

u/The-Silvervein·2 months ago·32 pts / 8 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

All Eyes on the Workflow: Automated and Efficient Event Discovery from Video Streams

SnapLog extracts event data from video streams via image embeddings and temporal segmentation for business process mining and workflow analysis.

Marco Pegoraro·2 months ago

r/singularity· COMMUNITY

Exactly 1 year ago, Anthropic said fully AI employees were just 1 year away

Reddit post recalling Anthropic's 1-year timeline claim for fully autonomous AI employees from a year prior; no new announcement.

u/Distinct-Question-16·2 months ago·251 pts / 57 comm

r/ClaudeAI· COMMUNITY

I'm somewhat of a coder myself

u/Flope·2 months ago·48 pts / 9 comm

r/MachineLearning· COMMUNITY

[New Optimizer] 🌹 Rose: low VRAM, easy to use, great results, Apache 2.0 [P]

Rose: stateless PyTorch optimizer with low VRAM footprint and fast convergence, released under Apache 2.0.

u/ECF630·2 months ago·31 pts / 16 comm

r/singularity· COMMUNITY

DeepSeek V4 Pro underwhelms on Arena (crowdsourced user preference benchmark, not a capability benchmark)

DeepSeek V4 Pro shows weaker-than-expected performance on LMSYS Arena user preference voting, a crowdsourced benchmark distinct from capability measurement.

u/Hemingbird·2 months ago·100 pts / 84 comm

r/LocalLLaMA· COMMUNITY

Takeaways & discussion about the DeepSeek V4 architecture

Technical deep-dive on DeepSeek V4 architecture: hybrid sparse attention, manifold-constrained connections, and FP4 quantization innovations vs. V3.

u/benja0x40·2 months ago·45 pts / 26 comm

r/LocalLLaMA· COMMUNITY

My New AI build - please be kind!

User shares local hardware build specs for AI workloads including CPU, GPU setup, and thermal management configuration.

u/Ell2509·2 months ago·41 pts / 40 comm

r/LocalLLaMA· COMMUNITY

DS4-Flash vs Qwen3.6

Reddit comparison thread between DS4-Flash and Qwen3.6 models lacking substantive analysis or benchmark data.

u/flavio_geo·2 months ago·101 pts / 36 comm

Anthropic· FRONTIER

An update on our election safeguards

Anthropic outlines safeguards for Claude during US midterms and global elections to mitigate disinformation and manipulation risks.

Anthropic·2 months ago

← Front Page30 stories

← Newer Older →

The Archive

On the Properties of Feature Attribution for Supervised Contrastive Learning

DeepSeek previews new AI model that ‘closes the gap’ with frontier models

An Integrated Framework for Explainable, Fair, and Observable Hospital Readmission Prediction: Development and Validation on MIMIC-IV

FeatEHR-LLM: Leveraging Large Language Models for Feature Engineering in Electronic Health Records

RouteLMT: Learned Sample Routing for Hybrid LLM Translation Deployment

Aggregate vs. Personalized Judges in Business Idea Evaluation: Evidence from Expert Disagreement

Different Strokes for Different Folks: Writer Identification for Historical Arabic Manuscripts

Appreciations for work mode in Codex. On track to becoming the first real super app

This is getting insane (image gen 2)

Anthropic admits to have made hosted models more stupid, proving the importance of open weight, local models

Measuring and Mitigating Persona Distortions from AI Writing Assistance

Decoding High-Dimensional Finger Motion from EMG Using Riemannian Features and RNNs

CGC: Compositional Grounded Contrast for Fine-Grained Multi-Image Understanding

Deep Learning for Model Calibration in Simulation of Itaconic Acid Production

FedSPDnet: Geometry-Aware Federated Deep Learning with SPDnet

Claude limits no longer round to the nearest hour

Elon Musk and Sam Altman’s court showdown will dish the dirt

In another wild turn for AI chips, Meta signs deal for millions of Amazon AI CPUs

DeepSeek V4 is built different...

Contrastive Semantic Projection: Faithful Neuron Labeling with Contrastive Examples

Is the ds/ml slowly being morphed into an AI engineer? [D]

All Eyes on the Workflow: Automated and Efficient Event Discovery from Video Streams

Exactly 1 year ago, Anthropic said fully AI employees were just 1 year away

I'm somewhat of a coder myself

[New Optimizer] 🌹 Rose: low VRAM, easy to use, great results, Apache 2.0 [P]

DeepSeek V4 Pro underwhelms on Arena (crowdsourced user preference benchmark, not a capability benchmark)

Takeaways &amp; discussion about the DeepSeek V4 architecture

My New AI build - please be kind!

DS4-Flash vs Qwen3.6

An update on our election safeguards

Takeaways & discussion about the DeepSeek V4 architecture