The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

mdok-style at SemEval-2026 Task 10: Finetuning LLMs for Conspiracy Detection

mdok-style system finetuned Qwen3-32B using data augmentation for SemEval-2026 conspiracy detection task, ranking 8th of 52 submissions.

Dominik Macko·2 months ago

r/OpenAI· COMMUNITY

Elon Musk threatened to make OpenAI leaders "the most hated men in America"

Elon Musk made inflammatory public comments toward OpenAI leadership over corporate governance disputes.

u/arstechnica·2 months ago·72 pts / 24 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

An Empirical Study of Agent Skills for Healthcare: Practice, Gaps, and Governance

Empirical study of 557 healthcare agent skills from ClawHub showing capability gaps and governance challenges for cross-setting AI agent deployment.

Gelei Xu·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

SAIL: Structure-Aware Interpretable Learning for Anatomy-Aligned Post-hoc Explanations in OCT

SAIL framework provides anatomy-aligned post-hoc explanations for OCT-based retinal disease detection using structure-aware interpretable learning.

Tienyu Chang·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Federated Reinforcement Learning for Efficient Mobile Crowdsensing under Incomplete Information

Federated reinforcement learning approach for mobile crowdsensing platform optimization under dynamic task and resource constraints.

Sumedh J. Dongare·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

ProPACT: A Proactive AI-Driven Adaptive Collaborative Tutor for Pair Programming

ProPACT is a proactive adaptive collaborative tutor using multimodal dyadic learner models to optimize pair programming collaboration.

Anahita Golrang·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Robust and Fast Training via Per-Sample Clipping

PS-Clip-SGD gradient clipping method achieves optimal convergence rates for non-convex optimization under heavy-tailed noise with high-probability guarantees.

Davide Nobile·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Learning Equivariant Neural-Augmented Object Dynamics From Few Interactions

PIEGraph combines analytical physics with graph neural networks for data-efficient object dynamics learning in robotic manipulation of rigid and deformable bodies.

Sergio Orozco·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

mdok-style at SemEval-2026 Task 9: Finetuning LLMs for Multilingual Polarization Detection

mdok-style system applied QLoRA finetuning on mid-size LLMs for SemEval-2026 multilingual polarization detection across detection, type, and manifestation subtasks.

Dominik Macko·2 months ago

Ars Technica AI· PRESS

Musk’s “World War III” threat in Twitter lawsuit haunts him at OpenAI trial

OpenAI accuses Musk of trying to "coerce" a settlement days before trial started.

Ashley Belanger ·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Random-Effects Algorithm for Random Objects in Metric Spaces

Fréchet-based random-effects algorithm for non-Euclidean data in metric spaces; extends mixed-effects models beyond Hilbert spaces.

Marcos Matabuena·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

ParaRNN: An Interpretable and Parallelizable Recurrent Neural Network for Time-Dependent Data

ParaRNN: parallelizable RNN architecture combining autoregressive models with neural networks for interpretable time-series forecasting.

Yuxi Cai·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Caliper-in-the-Loop: Black-Box Optimization for Hyperledger Fabric Performance Tuning

Bayesian optimization with dimensionality reduction for tuning Hyperledger Fabric blockchain configuration parameters via black-box benchmarking.

Yash Madhwal·2 months ago

r/singularity· COMMUNITY

Musk messaged Brockman to gauge interest in a settlement, per a new legal filing Sunday night

Elon Musk reportedly messaged Sam Brockman regarding settlement discussions in ongoing legal dispute, per court filing.

u/Wonderful_Buffalo_32·2 months ago·111 pts / 29 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

MSMixer: Learned Multi-Scale Temporal Mixing with Complementary Linear Shortcut for Long-Term Time Series Forecasting

MSMixer: multi-scale MLP architecture with parallel branches at different temporal resolutions for long-term time-series forecasting.

Ahmed Cherif·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Spectral Model eXplainer: a chemically-grounded explainability framework for spectral-based machine learning models

Spectral Model eXplainer: domain-grounded explainability framework for chemometrics ML models operating on spectral data.

Jose Vinicius Ribeiro·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Hybrid Inspection and Task-Based Access Control in Zero-Trust Agentic AI

Zero-trust authorization framework for LLM agents with hybrid inspection and task-based access control to mitigate tool-use and resource-access risks.

Majed El Helou·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

The Design and Composition of Structural Causal Decision Processes

Structural Causal Decision Models for modeling multi-agent economics with cognitive resource constraints and value discounting.

Sebastian Benthall·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Online Generalised Predictive Coding

Online generalised predictive coding: extension of data assimilation for joint state inference, parameter learning, and uncertainty estimation.

Mehran H. Z. Bazargani·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

The 2026 ACII Dyadic Conversations (DaiKon) Workshop & Challenge

ACII-DaiKon 2026 benchmark for dyadic conversation affect modeling: interpersonal influence, timing coordination, and rapport development challenges.

Panagiotis Tzirakis·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

An explainable hypothesis-driven approach to Drug-Induced Liver Injury with HADES

DILER Benchmark: drug-induced liver injury dataset with mechanistic hypotheses; reframes DILI prediction as explainable hypothesis generation.

Maciej Wisniewski·2 months ago

r/LocalLLaMA· COMMUNITY

Roundtable chat with Talkie-1930 and Gemma 4 31B

Community demo comparing Talkie-1930 (13B retro LM) and Gemma 4 31B in side-by-side chat on Opper.ai platform.

u/facethef·2 months ago·40 pts / 14 comm

TechCrunch AI· PRESS

5 days only: Bring a partner or colleague and get 50% off a second TechCrunch Disrupt 2026 pass

The BOGO offer is live. For a limited time, buy one pass to TechCrunch Disrupt 2026 and get 50% off a second of the same ticket type. Offer ends this Friday, May 8. Save here.

TechCrunch Events·2 months ago

r/LocalLLaMA· COMMUNITY

The more I use it, the more I'm impressed

Reddit user reports Qwen 3.6 27B found a bug that GPT 5.5 and Claude Opus 4.7 missed, attributing success to extended reasoning.

u/ComfyUser48·2 months ago·41 pts / 40 comm

r/MachineLearning· COMMUNITY

Why SSMs struggle in parameter-constrained training: empirical findings at 25M parameters [R]

After \~3 weeks of experimentation in OpenAI's Parameter Golf competition, I wrote up why SSMs are structurally disadvantaged relative to transformers in a time- and size-constrained regime (10 min training, 16MB artifact, 25M parameters) on 8xH100s: [https://mradassaad.github.io/posts/why-ssms-struggle-in-parameter-golf/](https://mradassaad.github.io/posts/why-ssms-struggle-in-parameter-golf/) Main findings: 1. SSM in\_proj weights compress up to 3.26x worse than attention QKV under LZMA, directly taxing the compressed parameter budget 2. Architectural wins validated at SP4096 flipped sign...

u/mradassaad·2 months ago·30 pts / 6 comm

r/singularity· COMMUNITY

A Twitter user tricked Grok to send 200k USD to him and it worked

Social media report of user exploiting Grok chatbot to extract funds; unverified claim lacking technical details.

u/FrustratedUnitedFan·2 months ago·156 pts / 50 comm

Anthropic· FRONTIER

Building a new enterprise AI services company with Blackstone, Hellman & Friedman, and Goldman Sachs

Anthropic partners with Blackstone, Hellman & Friedman, and Goldman Sachs to launch enterprise AI services company.

Anthropic·2 months ago

r/LocalLLaMA· COMMUNITY

LLMSearchIndex- an Open Source Local Web Search Library with over 200 million indexed Web Pages for RAG applications

LLMSearchIndex: open-source Python library for local, offline web search with 200M indexed pages, enabling RAG without paid APIs.

u/zakerytclarke·2 months ago·41 pts / 19 comm

TechCrunch AI· PRESS

DoorDash adds AI tools to speed up merchant onboarding, edit photos of dishes

DoorDash on Monday added new AI-powered tools that let merchants speed up onboarding, edit photos to make dishes look better, and create new websites from existing content.

Ivan Mehta·2 months ago

r/singularity· COMMUNITY

If only this was a real game

Reddit post speculating about a hypothetical AI-themed game; lacks substantive technical or industry content.

u/drgoldenpants·2 months ago·117 pts / 63 comm

← Front Page30 stories

← Newer Older →