The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

CogScale: Scalable Benchmark for Sequence Processing

CogScale: 14-task synthetic benchmark for evaluating sequence processing in novel architectures at reduced computational cost.

Yannis Bendi-Ouis·1 month ago

Anthropic· FRONTIER

KPMG integrates Claude across its core business and workforce of more than 276,000 in strategic alliance

KPMG deploys Claude across 276,000+ employees via Digital Gateway platform in multi-year strategic partnership with Anthropic.

Anthropic·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

MSAlign: Aligning Molecule and Mass Spectra Foundation Models for Metabolite Identification

MSAlign aligns molecule and mass spectra foundation models for improved metabolite identification in drug discovery and clinical research.

Paul Krzakala·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Memory-Augmented Reinforcement Learning Agent for CAD Generation

Memory-augmented RL framework for CAD generation agents handling long operation sequences and geometric constraints with error correction.

Yin Xiaolong·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

EngiAI: A Multi-Agent Framework and Benchmark Suite for LLM-Driven Engineering Design

EngiAI benchmark suite for multi-agent LLM engineering design with workflow, RAG, and simulation evaluation across seven prompt styles.

Gioele Molinari·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

TERGAD: Structure-Aware Text-Enhanced Representations for Graph Anomaly Detection

TERGAD detects graph anomalies by combining text and structure-aware representations to identify inconsistencies between node content and topology.

Wen Shi·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

ContextRAG: Extraction-Free Hierarchical Graph Construction for Retrieval-Augmented Generation

ContextRAG constructs graph topology for RAG without LLM-based extraction using k-means and formal concept analysis for multi-hop QA.

Roman Prosvirnin·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Graph Neural Networks for Community Detection in Graph Signal Analysis

Survey of GNN architectures for community detection in graphs, reviewing clustering performance on large high-dimensional networks.

Roberto Cavoretto·1 month ago

r/LocalLLaMA· COMMUNITY

bytedance released an open source model that attempts to do just about anything with only 3b parameters

ByteDance released Lance, a 3B-parameter open-weight multimodal model supporting image/video understanding, generation, and editing.

u/uxl·1 month ago·117 pts / 20 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

LIFT and PLACE: A Simple, Stable, and Effective Knowledge Distillation Framework for Lightweight Diffusion Models

LIFT and PLACE: coarse-to-fine knowledge distillation framework for lightweight diffusion models via linear fitting and adaptive coefficient estimation.

Hyunsoo Han·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Mathematical Reasoning in Large Language Models: Benchmarks, Architectures, Evaluation, and Open Challenges

Survey of 120+ studies on mathematical reasoning in LLMs: datasets, architectures, training strategies, evaluation protocols for AI benchmarking.

Husnain Amjad·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Measuring Safety Alignment Effects in Autonomous Security Agents

Trace-based benchmark measuring safety-aligned LLM behavior (Gemma 4) as autonomous security agents vs. uncensored derivatives on 30 vulnerability-analysis tasks.

Isaac David·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Projecting Latent RL Actions: Towards Generalizable and Scalable Graph Combinatorial Optimization

Projection agents: RL-GNN approach for graph combinatorial optimization with improved generalization and scalability across diverse problem instances.

Franco Terranova·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

CAIT: A Syntactic Parsing Toolkit for Child-Adult InTeractions

CAIT: dependency parser and POS tagger for CHILDES child-adult interaction data, outperforming SpaCy and Stanza on syntactic structure.

Francesca Padovani·1 month ago

r/ClaudeAI· COMMUNITY

Anthropic just bought the company that generates most production MCP servers

Anthropic acquires Stainless for $300M+, gaining control of major MCP server generation platform serving OpenAI, Google, Meta, and Cloudflare SDKs.

u/Ok-Constant6488·1 month ago·64 pts / 38 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

LLM-Based Financial Sentiment Analysis in Arabic: Evidence from Saudi Markets

Arabic NLP framework for financial sentiment analysis using Transformer-based NER on Saudi market news and social media data.

Mona H. Albaqawi·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Can Large Language Models Reliably Correct Errors in Low-Resource ASR? A Contamination-Aware Case Study on West Frisian

LLM-based generative error correction for low-resource West Frisian ASR with data contamination analysis and offline dataset construction.

Yun Hao·1 month ago

r/LocalLLaMA· COMMUNITY

Meet the Fleet of BlackBeard

Home lab setup post showcasing multi-GPU infrastructure for running 35B+ parameter models locally.

u/BlackBeardAI·1 month ago·42 pts / 46 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Awakening the Hydra: Stabilizing Multi-Concept Backdoor Injection in Text-to-Image Diffusion Models

Multi-concept backdoor injection vulnerability in text-to-image diffusion models: semantic conflicts from sequential fine-tuning and redistributed checkpoints.

Kai Wang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Probabilistic Multivariate Time Series Forecasting with Diffusion Copulas

Diffusion-Copula framework decouples marginal distributions from dependence structures for multivariate financial time-series forecasting with tail-risk calibration.

David Huk·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Agentic Discovery of Cryomicroneedle Formulations

Closed-loop AI workflow (Gaussian process, Bayesian optimization) for cryomicroneedle cryoprotectant discovery from 198 mesenchymal stem-cell formulations.

Hao Li·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Beyond Rational Illusion: Behaviorally Realistic Strategic Classification

Strategic classification framework incorporating behavioral biases and cognitive deviations from rational agent assumptions.

Xinpeng Lv·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Transforming Constraint Programs to Input for Local Search

Automated neighborhood generation for local search via constraint symmetry analysis in the IDP system.

Jo Devriendt·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Convergence of Consensus-Based Particle Methods for Nonconvex Bi-Level Optimization

Convergence analysis for consensus-based particle optimization in nonconvex bi-level problems.

Yutong Chao·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Cross-View Attention Fusion Net: A Prior-Guided Dual-View Representation Learning for Cardiac Output Estimation from Short-Term PPG Signals

Cross-View Attention Fusion network for cardiac output estimation from photoplethysmography signals.

Yaowen Zhang·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

CriterAlign: Criterion-Centric Rationale Alignment for Code Preference Judging

CriterAlign: criterion-centric LLM judge for pairwise code preference evaluation with task-specific rubric alignment.

Zhenyu Li·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Pseudocode-Guided Structured Reasoning for Automating Reliable Inference in Vision-Language Models

Pseudocode-guided structured reasoning framework reducing hallucinations in vision-language models for robotic automation.

Weicong Ni·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

When Tabular Foundation Models Meet Strategic Tabular Data: A Prior Alignment Approach

Prior alignment approach enabling tabular foundation models to generalize under strategic feature manipulation post-deployment.

Xinpeng Lv·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond

OScaR: KV cache quantization technique addressing token norm imbalance for extreme compression in long-context LLMs.

Zunhai Su·1 month ago

r/OpenAI· COMMUNITY

make no mistakes jarvis

Reddit post expressing opinion about 'Jarvis' with minimal substantive content or claims.

u/irelatetolevin·1 month ago·269 pts / 10 comm

← Front Page30 stories

← Newer Older →