The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Adaptive Inverted-Index Routing for Granular Mixtures-of-Experts

AIR-MoE uses vector quantization for efficient routing in granular mixture-of-experts, reducing computational overhead of token-to-expert assignment.

Klaus-Rudolf Kladny·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Adapting Large Language Models to a Low-Resource Agglutinative Language: A Comparative Study of LoRA and QLoRA for Bashkir

Comparative study of LoRA and QLoRA fine-tuning on Bashkir, a low-resource Turkic language, using models from DistilGPT2 to Qwen2.5-7B.

Mullosharaf K. Arabov·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Training-Time Batch Normalization Reshapes Local Partition Geometry in Piecewise-Affine Networks

Theoretical analysis of batch normalization's effect on geometry of piecewise-affine networks during training via hyperplane switching.

Xuan Qi·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

DART: A Vision-Language Foundation Model for Comprehensive Rope Condition Monitoring

DART, a vision-language foundation model for synthetic fiber rope condition monitoring, provides severity estimates, maintenance recommendations, and automated reports.

Anju Rani·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

UFAL-CUNI at SemEval-2026 Task 11: An Efficient Modular Neuro-symbolic Method for Syllogistic Reasoning

Neuro-symbolic system combining LLM parser with automated theorem prover for syllogistic reasoning in SemEval-2026 Task 11.

Ivan Kartáč·2 months ago

r/LocalLLaMA· COMMUNITY

Qwen3.6 27B NVFP4 + MTP on a single RTX 5090: 200k context working in vLLM

User demonstrates Qwen3.6 27B running 200k context on single RTX 5090 with NVFP4 quantization in vLLM, sharing exact configuration and parameters.

u/Maheidem·2 months ago·41 pts / 11 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Modular Reinforcement Learning For Cooperative Swarms

Modular multi-agent reinforcement learning approach for cooperative robot swarms with limited communication and local interaction.

Erel Shtossel·2 months ago

r/singularity· COMMUNITY

Religious robots are coming: South Korea's first autonomous humanoid robot converts to Buddhism

South Korean humanoid robot programmed with Buddhist practices; novelty claim lacks technical substance or robotics advancement details.

u/GeneReddit123·2 months ago·127 pts / 62 comm·+ covered by others

TechCrunch AI· PRESS

3 days left to lock in 50% off a second ticket to TechCrunch Disrupt 2026

Three days left to lock in 50% off a second ticket to Disrupt 2026. Buy one TechCrunch Disrupt 2026 ticket, and get a second ticket at 50% off. Gain more visibility in the tech industry. Offer ends May 8 at 11:59 p.m. PT.

TechCrunch Events·2 months ago·+ covered by others

arXiv (cs.AI/CL/LG)· ACADEMIA

Jacobian-Velocity Bounds for Deployment Risk Under Covariate Drift

Drift-aligned tangent regularization (DTR) bounds deployment risk under covariate shift using Jacobian-velocity theorem and Poincaré inequalities.

Jonathan R. Landers·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

When Does Gene Regulatory Network Inference Break? A Controlled Diagnostic Study of Causal and Correlational Methods on Single-Cell Data

Controlled benchmark study diagnosing when causal vs. correlational methods fail for gene regulatory network inference from single-cell RNA-seq.

Miguel Fernandez-de-Retana·2 months ago

TechCrunch AI· PRESS

AI boom pushes Samsung to $1T

Samsung crossed the $1 trillion valuation mark after shares surged on AI-driven chip demand, making it only the second Asian company after TSMC to hit the milestone.

Kate Park·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Unintended Negative Impacts of Promotional Language in Patent Evaluation

Large-scale USPTO study finds promotional language in patents negatively correlates with approval probability, contrary to science communication norms.

Bingkun Zhao·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Evolving Idea Graphs with Learnable Edits-and-Commits for Multi-Agent Scientific Ideation

Evolving Idea Graphs (EIG), a multi-agent LLM framework using learnable graph edits for scientific ideation with novelty, feasibility, clarity metrics.

Jiangwen Dong·2 months ago

r/ClaudeAI· COMMUNITY

Kindergarten-grade nouns

Reddit user reports Claude Opus struggles to distinguish word obscurity via corpus frequency vs. human recognition familiarity.

u/babelphishy·2 months ago·58 pts / 5 comm

r/OpenAI· COMMUNITY

Anyone else hate reading AI generated text?

Reddit user expresses frustration with detectability and stylistic uniformity of AI-generated text across news and government documents.

u/Connect-Painter-4270·2 months ago·52 pts / 76 comm

r/singularity· COMMUNITY

The Blue Collar Delusion: Why the machines don’t have to climb up to where we are, because the work will descend to meet them

Mechanic argues blue-collar work faces AI displacement risk through task simplification rather than machine capability escalation, challenging consensus on trade job resilience.

u/_noise-complaint·2 months ago·164 pts / 53 comm

The Verge AI· PRESS

Google’s AI search summaries will now quote Reddit

Want real human feedback related to your search results? Google’s AI now fetches it for you. | Image by Google / The Verge Google is updating its AI Search features to make it easier for users to find information from sources they know and trust. One of the more notable changes introduces "a preview of perspectives" from firsthand sources like social media, Reddit, and other web forums, effectively linking your search queries with online conversations around similar topics. Google says this update aims to address that "people are increasingly seeking out advice from others" when searching for...

Jess Weatherbed·2 months ago

r/LocalLLaMA· COMMUNITY

An Open Benchmark for Testing RAG on Realistic Company-Internal Data

EnterpriseRAG-Bench: 500k-document synthetic dataset benchmarking RAG systems on realistic internal company data (Slack, email, tickets, PRs) vs. public corpora.

u/Weves11·2 months ago·41 pts / 14 comm

r/ClaudeAI· COMMUNITY

Voice + Claude my daily workflow for building stuff

Developer describes workflow using Claude voice for brainstorming during walks, then Claude Code for implementation.

u/dspv·2 months ago·24 pts / 31 comm

r/ClaudeAI· COMMUNITY

Dictation is the fastest way to work now, but how do you deal with the awkwardness of using it in an open office?

I'm a fast typer, but I find my projects go a lot better when I'm able to really dictate with Claude. I appreciate this won't be the case for all of you. At the moment I'm much more productive if I'm working from home or in a quiet space. There is a sensitivity setting on FluidVoice so I try to whisper, but so far it just ends up feeling too awkward and I go immediately back to typing. Also someone inevitably starts talking louder somewhere else in the office and the acoustics can impact what I'm saying. You can't express your questions and theories as freely as you'd like, because you'...

u/snowliondev·2 months ago·21 pts / 58 comm

r/MachineLearning· COMMUNITY

Stop letting LLMs edit your .bib [D]

Research community reports frequent LLM hallucinations in bibliography generation, with incorrect author attributions despite correct titles, raising integrity concerns.

u/Pure-Ad9079·2 months ago·40 pts / 10 comm

r/LocalLLaMA· COMMUNITY

Qwen3.6-27B with MTP grafted on Unsloth UD XL: 2.5x throughput via unmerged llama.cpp PR

Qwen3.6-27B with Multi-Token Prediction achieves 2.5x throughput via Unsloth quantization and llama.cpp integration.

u/havenoammo·2 months ago·48 pts / 28 comm

The Verge AI· PRESS

Microsoft’s Office and LinkedIn chief now runs Teams in latest reshuffle

Microsoft's LinkedIn CEO, Ryan Roslansky, took on an expanded role at the company as head of Office last year, and he's now getting more responsibilities as part of the latest leadership reshuffle inside Microsoft. Sources tell me that the Microsoft Teams organization is moving to report to Roslansky, who will now lead a new Work Experiences Group at Microsoft. The changes are part of a broader reshuffle triggered by Rajesh Jha, executive vice president of Microsoft's experiences and devices group, retiring from Microsoft after more than 35 years. Jha was responsible for the teams behind Wind...

Tom Warren·2 months ago

r/LocalLLaMA· COMMUNITY

Bad news: Apple drops high-memory Mac Studio configs

Apple discontinues high-memory Mac Studio configurations (256GB, 512GB), limiting local LLM inference options to 96GB max.

u/jzn21·2 months ago·47 pts / 20 comm

Google DeepMind· FRONTIER

AlphaEvolve: How our Gemini-powered coding agent is scaling impact across fields

Google DeepMind releases AlphaEvolve, a Gemini-powered coding agent demonstrating applications across business, infrastructure, and scientific domains.

Google DeepMind·2 months ago

r/OpenAI· COMMUNITY

"Water wars."

Reddit discussion about water consumption and waste impacts of AI model training, lacking specifics or novel data.

u/Total-Squirrel4634·2 months ago·58 pts / 42 comm

The Verge AI· PRESS

Chrome’s AI features may be hogging 4GB of your computer storage

Google Chrome may be taking up more of your storage than expected thanks to a large on-device AI model file that, in some cases, is being automatically downloaded to the browser's system folders. Users who have noticed unexplained drops in their available desktop device storage are now discovering that Chrome is installing a 4GB weights.bin file inside their browser directory when certain AI features are enabled. The weights.bin file in question is connected to Google's Gemini Nano AI model, which powers Chrome AI tools like scam detection, writing assistance, autofill, and suggestion feature...

Jess Weatherbed·2 months ago

r/Anthropic· COMMUNITY

Let's talk about ban policy

Should users be banned? If Anthropic wants to be the next Google, meaning revolutionize the internet and the way computers are used. Should users be banned? I've been reading a lot of horror stories lately about people getting banned for stupid things like "research work," standard usage, or simply security research. Who decides? Exactly, the model. Then you get banned without the possibility of appeal because same model read appeals. Sure, people create new accounts, but it's only a matter of time before Claude Code collects device fingerprints. Perhaps it's already doing so. Should C...

u/Beginning_Ad2239·2 months ago·10 pts / 22 comm

Stratechery· ANALYST

Microsoft Earnings, Apple Earnings

Microsoft announces agentic business model shift; Apple faces chip/memory constraints despite Mac AI gains.

Ben Thompson·2 months ago

← Front Page30 stories

← Newer Older →

The Archive

Adaptive Inverted-Index Routing for Granular Mixtures-of-Experts

Adapting Large Language Models to a Low-Resource Agglutinative Language: A Comparative Study of LoRA and QLoRA for Bashkir

Training-Time Batch Normalization Reshapes Local Partition Geometry in Piecewise-Affine Networks

DART: A Vision-Language Foundation Model for Comprehensive Rope Condition Monitoring

UFAL-CUNI at SemEval-2026 Task 11: An Efficient Modular Neuro-symbolic Method for Syllogistic Reasoning

Qwen3.6 27B NVFP4 + MTP on a single RTX 5090: 200k context working in vLLM

Modular Reinforcement Learning For Cooperative Swarms

Religious robots are coming: South Korea's first autonomous humanoid robot converts to Buddhism

3 days left to lock in 50% off a second ticket to TechCrunch Disrupt 2026

Jacobian-Velocity Bounds for Deployment Risk Under Covariate Drift

When Does Gene Regulatory Network Inference Break? A Controlled Diagnostic Study of Causal and Correlational Methods on Single-Cell Data

AI boom pushes Samsung to $1T

Unintended Negative Impacts of Promotional Language in Patent Evaluation

Evolving Idea Graphs with Learnable Edits-and-Commits for Multi-Agent Scientific Ideation

Kindergarten-grade nouns

Anyone else hate reading AI generated text?

The Blue Collar Delusion: Why the machines don’t have to climb up to where we are, because the work will descend to meet them

Google&#8217;s AI search summaries will now quote Reddit

An Open Benchmark for Testing RAG on Realistic Company-Internal Data

Voice + Claude my daily workflow for building stuff

Dictation is the fastest way to work now, but how do you deal with the awkwardness of using it in an open office?

Stop letting LLMs edit your .bib [D]

Qwen3.6-27B with MTP grafted on Unsloth UD XL: 2.5x throughput via unmerged llama.cpp PR

Microsoft’s Office and LinkedIn chief now runs Teams in latest reshuffle

Bad news: Apple drops high-memory Mac Studio configs

AlphaEvolve: How our Gemini-powered coding agent is scaling impact across fields

"Water wars."

Chrome’s AI features may be hogging 4GB of your computer storage

Let's talk about ban policy

Microsoft Earnings, Apple Earnings

Google’s AI search summaries will now quote Reddit