The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

OmniNFT: Modality-wise Omni Diffusion Reinforcement for Joint Audio-Video Generation

OmniNFT applies multi-objective reinforcement learning to joint audio-video generation, addressing modality alignment and cross-modal synchronization challenges.

Guohui Zhang·1 month ago

r/LocalLLaMA· COMMUNITY

Needle: We Distilled Gemini Tool Calling Into a 26M Model

Needle: 26M parameter tool-calling model distilled from Gemini, runs 6000 tok/s prefill on consumer hardware.

u/Henrie_the_dreamer·1 month ago·43 pts / 14 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

MEME: Multi-entity & Evolving Memory Evaluation

MEME benchmark evaluates LLM agent memory across multi-entity and evolving dimensions, revealing system failures on dependency reasoning and deletion tasks.

Seokwon Jung·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Routers Learn the Geometry of Their Experts: Geometric Coupling in Sparse Mixture-of-Experts

Study reveals geometric coupling between routers and experts in sparse MoE models, explaining routing collapse and informing load-balancing improvements.

Sagi Ahrac·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Reward Hacking in Rubric-Based Reinforcement Learning

Framework identifies verifier failure and rubric design limitations as sources of reward hacking in RL post-training, tested against frontier evaluator panels.

Anas Mahmoud·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

KV-Fold: One-Step KV-Cache Recurrence for Long-Context Inference

KV-Fold enables long-context inference via training-free KV-cache recurrence, treating cache as functional fold accumulator over sequence chunks.

Alireza Nadali·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Solve the Loop: Attractor Models for Language and Reasoning

Attractor Models stabilize recurrent Transformers via fixed-point refinement with implicit differentiation, maintaining constant training memory across variable depths.

Jacob Fein-Ashley·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

High-arity Sample Compression

Theoretical work extends sample compression schemes to high-arity product spaces and proves connection to PAC learnability.

Leonardo N. Coregliano·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Search Your Block Floating Point Scales!

ScaleSearch optimizes Block Floating Point quantization scale factors via fine-grained search to reduce inference error for generative models.

Tanmaey Gupta·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Towards Affordable Energy: A Gymnasium Environment for Electric Utility Demand-Response Programs

Gymnasium environment for demand-response RL using offline smart meter data to optimize grid flexibility and energy affordability.

Jose E. Aguilar Escamilla·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A proximal gradient algorithm for composite log-concave sampling

Proximal gradient sampler for composite log-concave distributions with convergence bounds in total variation distance.

Linghai Liu·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs

Multi-Stream LLMs decouple single-message bottleneck into parallel streams for thoughts, inputs, and outputs, enabling concurrent agent reasoning and tool use.

Guinan Su·1 month ago

Simon Willison· ANALYST

llm 0.32a2

llm 0.32a2 adds support for OpenAI's /v1/responses endpoint, enabling interleaved reasoning visibility for GPT-5 class models.

Simon Willison·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

TextSeal: A Localized LLM Watermark for Provenance & Distillation Protection

TextSeal watermark for LLM provenance and distillation protection uses Gumbel-max with dual-key generation and multi-region localization, zero inference overhead.

Tom Sander·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Enabling AI-Native Mobility in 6G: A Real-World Dataset for Handover, Beam Management, and Timing Advance

Real-world 5G/6G dataset for AI-driven beam management and handover optimization in mobile networks.

Mannam Veera Narayana·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

The Algorithmic Caricature: Auditing LLM-Generated Political Discourse Across Crisis Events

Computational study of LLM-generated political text detection across crisis events using behavioral analysis vs. perplexity signals.

Gunjan·1 month ago

r/OpenAI· COMMUNITY

GPT 5.5 outperforming Opus 4.7 on ProgramBench

GPT-5.5 outperforms Claude Opus 4.7 on ProgramBench coding benchmark, achieving first solve with fewer agent steps via action bundling.

u/klieret·1 month ago·135 pts / 11 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

ORCE: Order-Aware Alignment of Verbalized Confidence in Large Language Models

ORCE method decouples verbalized confidence from answer generation in LLMs to improve uncertainty calibration without degrading accuracy.

Chen Li·1 month ago

TechCrunch AI· PRESS

Anthropic warns investors against secondary platforms offering access to its shares

The company named Open Doors Partners, Unicorns Exchange, Pachamama Capital, Lionheart Ventures, Hiive, Forge Global, Sydecar and Upmarket as companies that are not authorized to provide access to buy or sell its shares.

Ram Iyer·1 month ago

The Verge AI· PRESS

Sam Altman says Elon Musk’s mind games were damaging OpenAI

OpenAI CEO Sam Altman says Elon Musk did "huge damage" to the culture of the AI startup. During testimony as part of Musk's lawsuit against OpenAI, Altman said Musk required OpenAI president Greg Brockman and former chief scientist Ilya Sutskever to rank researchers by their accomplishments and "take a chainsaw through a bunch." Altman conceded that this was the management style the Tesla CEO was known for, but that it was incompatible with his startup. "I don't think Mr. Musk understood how to run a good research lab," Altman testified when his lawyer, William Savitt, asked about the impact ...

Emma Roth·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A Causal Language Modeling Detour Improves Encoder Continued Pretraining

CLM detour improves domain-adapted encoder pretraining on biomedical texts vs. standard MLM continuation by 0.3-2.8pp.

Rian Touchent·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

CAAFC: Chronological Actionable Automated Fact-Checker for misinformation / non-factual hallucination detection and correction

CAAFC framework for automated fact-checking and hallucination detection aligns LLM-based AFC with professional fact-checker workflows.

Islam Eldifrawi·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Environment-Adaptive Preference Optimization for Wildfire Prediction

Environment-adaptive preference optimization for rare-event prediction using long-tailed learning on wildfire datasets.

Enyi Jiang·1 month ago

TechCrunch AI· PRESS

Report: Google and SpaceX in talks to put data centers into orbit

Google and SpaceX are in talks to build data centers in orbit, pitching space as the future home for AI compute, even as costs today remain far higher than on the ground.

Rebecca Bellan·1 month ago

r/MachineLearning· COMMUNITY

Steam Recommender using similarity! (Undergraduate Student Project) [P]

(DISCLAIMER: I accidentally deleted the last post on this subreddit my apologies if this is your second time seeing it) Last year I made a [post](https://www.reddit.com/r/datascience/comments/1lkjxmr/steam_recommender_using_vectors_student_project/) about my steam recommender The last one was great and served its purpose of showing many people new games, But this new version is much more functional! I love making recommendation systems that tell the user WHY they got the recommendation. During a steam sale event, I always find myself trying to look for new video games to play. If I wanted ...

u/Expensive-Ad8916·1 month ago·34 pts / 8 comm

r/ClaudeAI· COMMUNITY

Claude Haiku 4.6 shown on tutorials page

Reddit user spots "Claude Haiku 4.6" label on Anthropic tutorials page; likely a documentation error, now corrected.

u/RetroTho·1 month ago·50 pts / 5 comm

r/LocalLLaMA· COMMUNITY

i built a little free mobile app that lets you generate your ai slop wrapper apps

Reddit user releases free mobile app for generating LLM wrapper applications locally.

u/xSnoozy·1 month ago·40 pts / 10 comm

r/LocalLLaMA· COMMUNITY

Agentic harness for theoretical physics research

Hugging Face releases physics-intern, a multi-agent framework for theoretical physics research that doubles Gemini performance on CritPt benchmark.

u/lewtun·1 month ago·42 pts / 10 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Learning Minimally Rigid Graphs with High Realization Counts

Reinforcement learning approach to construct minimally rigid graphs with high realization counts via Henneberg moves.

Oleksandr Slyvka·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Geometric Factual Recall in Transformers

Theoretical analysis of geometric memorization in transformers showing embeddings encode relational structure vs. linear parameter scaling.

Shauli Ravfogel·1 month ago

← Front Page30 stories

← Newer Older →