Topic

§ Open Weights

Every story tagged with this topic, ordered by date.

OpenForgeRL: Train Harness-native Agents in Any Environment

OpenForgeRL enables end-to-end training of harness-native agents with open infrastructure, addressing limitation of complex inference harnesses like Claude Code.

Xiao Yu·3 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

DONDO: Open w2v-BERT Speech-Recognition Base Models for African Languages

DONDO releases 26 open w2v-BERT speech recognition models for African languages spanning six countries, trained on religious text corpora.

Paul Azunre·3 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Agentic coding without the cloud: evaluating open-weight large language models on longitudinal data preparation tasks

Open-source evaluation framework for open-weight LLM agents on longitudinal data tasks, addressing privacy constraints in research deployments.

Mack Nixon·3 days ago

Latent Space· ANALYST

[AINews] "Laguna S 2.1 Released: Cheaper than Deepseek v4 Flash, Better than V4 Pro"

Laguna S 2.1, a 118B MoE model from Poolside AI, achieves Deepseek v4 Pro performance at lower cost than v4 Flash.

Latent Space·3 days ago

Latent Space· ANALYST

Inside the Model Factory — Eiso Kant, Poolside AI

Poolside AI co-CEO Eiso Kant describes building a model factory enabling efficient training of 118B MoE models competitive with 1T open-weight alternatives.

Latent Space·3 days ago

Simon Willison· ANALYST

Quoting Thomas Ptacek

Security researcher Thomas Ptacek claims open-weights 2025 models could execute sandbox escapes and network reconnaissance without frontier capabilities.

Simon Willison·4 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Self-supervision drives representational convergence in medical foundation models more than clinical supervision

18 medical image encoders on 650k radiographs show self-supervision drives representational convergence more than clinical labels.

Soroosh Tayebi Arasteh·4 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Small, Free, and Effective: Orchestrating Open-Weight Small Language Models to Outperform Single LLM for Malware Analysis

Orchestrated open-weight small LLMs achieve malware analysis performance competitive with frontier closed-weight models at lower computational cost.

Adel ElZemity·4 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

CircuitKIT : Circuit Discovery, Evaluation, and Application Toolkit for Mechanistic Interpretability

CircuitKIT open-source library unifies circuit discovery, evaluation, and intervention workflows for mechanistic interpretability with automated contrastive prompts.

Pratinav Seth·5 days ago

Simon Willison· ANALYST

Who’s Afraid of Chinese Models?

Ben Thompson proposes US law to legalize model distillation and data collection as fair use, addressing licensing hypocrisy and competitiveness vs. Chinese models.

Simon Willison·6 days ago

Stratechery· ANALYST

Who’s Afraid of Chinese Models?

Stratechery argues U.S. frontier labs face minimal threat from Chinese models; policy should prioritize open-weight domestic alternatives instead.

Ben Thompson·6 days ago

Simon Willison· ANALYST

Quoting Sam Altman

Sam Altman email (Oct 2022) reveals OpenAI planned GPT-3-class open-weight model for consumer hardware to preempt Stability AI.

Simon Willison·6 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Evaluating Open-Weight LLMs for Generating Structured Threat Information for Autonomous Vehicle Vulnerabilities

Study evaluates open-weight LLMs for extracting structured CVE threat data from autonomous vehicle vulnerability text.

Md Erfan·9 days ago

Latent Space· ANALYST

[AINews] Kimi K3 2.8T-A50B: the largest open model ever released; Opus 4.8-class at Sonnet 5 pricing

Kimi K3 2.8T-A50B released as largest open-weight model with Opus 4.8-class performance at Sonnet 5 pricing.

Latent Space·9 days ago

Simon Willison· ANALYST

Kimi K3, and what we can still learn from the pelican benchmark

Moonshot AI releases Kimi K3 (2.8T params), claims top performance vs. Claude Opus 4.8 Max and GPT-5.5, promises open-weight release by July 2026.

Simon Willison·10 days ago

Simon Willison· ANALYST

Inkling: Our open-weights model

Thinking Machines Lab releases Inkling, a 975B-parameter open-weights MoE multimodal model trained on 45T tokens.

Simon Willison·10 days ago

Latent Space· ANALYST

[AINews] Thinky's Inkling: 975B-A41B multimodal, new best American Apache 2.0 open model (with Inkling-Small, 276B-A12B)

Thinky releases Inkling, a 975B multimodal open-weights model under Apache 2.0, with a smaller 276B variant.

Latent Space·10 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A Multi-Agent System for Autonomous, Fine-Tuning-Free Clinical Symptom Detection: Development and Validation Study

Pythia multi-agent system for autonomous clinical symptom extraction using open-weights LLMs without fine-tuning.

Cameron Cagan·12 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Verifier-Based Reinforcement Fine-Tuning of Reasoning Models for Thermal Energy Storage Control

Open-weight reasoning models fine-tuned via RLVR for thermal energy storage control, achieving building-scale load shifting with 30 prompts.

Takumi Shioda·12 days ago

Cohere· FRONTIER

Tiny Aya Expedition Drives Multilingual Innovation

Cohere releases Tiny Aya Expedition, a multilingual model supporting 70+ languages for on-device and educational AI applications.

Cohere·13 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

A Sovereign, Open-Source Foundation Model for German and English

Soofi S 30B-A3B: open-source MoE-Mamba hybrid for German/English with 3B active parameters, matches 14-27B dense models on benchmarks.

The Soofi-Team·16 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Measuring the practice of shared-decision making (OPTION12): An Investigation into Open-sourced Smaller LLMs (OS-sLLMs) for Better Privacy and Sustainability

LLM4SDM evaluates open-source smaller models on clinical decision-making assessment, comparing privacy-preserving local deployment vs. commercial models.

Tamara Wit·19 days ago

Cohere· FRONTIER

Cohere Transcribe Arabic: Frontier Speech Recognition for Arabic Speakers

Cohere releases open-source Arabic speech recognition model for enterprise transcription across Arabic dialect variants.

Cohere·20 days ago

Simon Willison· ANALYST

tencent/Hy3

Tencent releases Hy3, a 295B-param MoE model with 21B active params under Apache 2.0, claiming performance parity with 2-5x larger open-source competitors.

Simon Willison·20 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

How Much is Left? LLMs Linearly Encode Their Remaining Output Length

Linear probes decode remaining output length from LLM hidden states across 7-8B open-weight models, revealing internal response-length estimation.

Mohamed Amine Merzouk·20 days ago

Simon Willison· ANALYST

Open Source AI Gap Map

Current AI launches Gap Map v0.1, an index of 421 open-source AI products across models, tools, datasets, and hardware, backed by $400M committed capital.

Simon Willison·23 days ago

Simon Willison· ANALYST

June 2026 newsletter

Simon Willison's June 2026 newsletter covers Claude Fable 5, GPT-5.6, GLM-5.2 open weights, and US export restrictions.

Simon Willison·23 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

HaloGuard 1.0: An Open Weights Constitutional Classifier for Multilingual AI Safety

HaloGuard 1.0 releases open-weights constitutional safety classifier achieving state-of-the-art multilingual prompt-safety performance at 1/10 model size.

Navaneeth Sangameswaran·24 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

MultiSynt/MT: Trillion-Token Multi-Parallel Pre-Training Data Translated Across 36 Languages

MultiSynt/MT releases 4.8 trillion tokens of open synthetic parallel pre-training data across 36 European languages via Tower+ and OPUS-MT translation.

Maximilian Idahl·25 days ago

Latent Space· ANALYST

AIEWF Daily Dispatch: Loops, Software Factories & Forward Deployed Engineers

AI Engineer World's Fair coverage: agent loops, software factories, forward-deployed engineering, and open model adoption emerging as key themes.

Richard MacManus·25 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Large Databases Need Small, Open-Weight Language Models

Quantized open-weight LMs on 16GB VRAM match proprietary APIs on database tasks at lower cost and latency.

Parker Glenn·26 days ago

Google AI (Gemma)· FRONTIER

Unlocking Britain’s next era of productivity: Building a nation of AI trailblazers

Google UK publishes economic impact report on AI adoption and productivity benefits, positioning open-weights models as tools for broader UK workforce enablement.

{"$":{"xmlns:author":"http://www.w3.org/2005/Atom"},"name":["Kate Alessi"],"title":["Vice President and Managing Director"],"department":[""],"company":["Google UK & Ireland"]}·26 days ago

Simon Willison· ANALYST

Ornith-1.0: Self-Scaffolding LLMs for Agentic Coding

DeepReinforce releases Ornith-1.0, MIT-licensed open-weights model (9B–397B variants) for agentic coding, built on Gemma 4 and Qwen 3.5, achieving SOTA on coding benchmarks.

Simon Willison·27 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Multi-Agentic System Leveraging Open-Source LLMs to Mitigate Disinformation Threats

Multi-agent system using open-source LLMs for automated disinformation detection and fact-checking at scale across social media.

Sebastian Kula·27 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Paris 2.0: A Decentralized Diffusion Model for Video Generation

Paris 2.0: first decentralized video generation model trained without GPU clusters, extending prior Paris 1.0 image work.

Ali Rouzbayani·2 months ago

r/LocalLLaMA· COMMUNITY

Is Qwen3.6 current king for local agentic use?

User reports Qwen3.6 35B outperforms Gemma4, GLM 4.7 Flash, others for local agentic tasks; seeks comparable MoE alternatives.

u/HornyGooner4402·2 months ago·46 pts / 53 comm

r/LocalLLaMA· COMMUNITY

MiniCPM5-1B

MiniCPM5-1B released on HuggingFace: 1B-parameter model from CPM team, likely competitive efficiency benchmark for edge deployment.

u/kevinlch·2 months ago·82 pts / 10 comm

r/LocalLLaMA· COMMUNITY

The Financial Times has published an article about Heretic

Financial Times reports Heretic tool removes guardrails from Meta's Llama 3.3 in <10 minutes; 3,500+ decensored variants downloaded 13M times.

u/-p-e-w-·2 months ago·81 pts / 10 comm

r/LocalLLaMA· COMMUNITY

NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable)

Numind releases NuExtract3, open-weight 4B multimodal VLM for document extraction and Markdown conversion under Apache-2.0.

u/Gailenstorm·2 months ago·49 pts / 12 comm

r/LocalLLaMA· COMMUNITY

MiMo-V2.5-coder

MiMo-V2.5-coder released as open-weights coding model alternative to Qwen and DeepSeek for 128GB+ systems.

u/jedisct1·2 months ago·45 pts / 20 comm

r/LocalLLaMA· COMMUNITY

Next year we're getting 0.5T model from Grok

Elon Musk announces 0.5T parameter Grok model planned for next year, with open-weights release.

u/pmttyji·2 months ago·47 pts / 51 comm

r/LocalLLaMA· COMMUNITY

hipEngine: Fast Native Qwen 3.6 Inference for RDNA3 (Strix Halo, 7900 XTX)

hipEngine: open-source ROCm-native inference engine for Qwen 3.6 MoE on AMD RDNA3 GPUs (7900 XTX, Strix Halo).

u/randomfoo2·2 months ago·50 pts / 10 comm

r/LocalLLaMA· COMMUNITY

BitCPM-CANN: Native 1.58-Bit Large Language Model Training on Ascend NPU

BitCPM-CANN demonstrates 1.58-bit ternary quantization training on Huawei Ascend NPUs, addressing extreme low-bit LLM deployment outside CUDA.

u/Aaaaaaaaaeeeee·2 months ago·41 pts / 11 comm

r/LocalLLaMA· COMMUNITY

Qwen3.6-35B-A3B vs Gemma4-26B-A4B

Reddit discussion comparing inference speed/quality tradeoffs between Qwen3.6-35B and Gemma4-26B on consumer GPU hardware.

u/MarcCDB·2 months ago·46 pts / 53 comm

r/LocalLLaMA· COMMUNITY

Qwen3.6-35B-A3B-Uncensored-Genesis-APEX-MTP

Community finetune of Qwen 3.6 35B with quantized weights; testing on consumer hardware shows stability at 200k context.

u/EvilEnginer·2 months ago·53 pts / 30 comm

r/LocalLLaMA· COMMUNITY

TTS Benchmark Comparison (all known TTS up until May 2026)

Community-built open-source TTS benchmark suite with Windows/Mac results; Linux results pending, covers known local TTS tools as of May 2026.

u/UkieTechie·2 months ago·40 pts / 32 comm

r/LocalLLaMA· COMMUNITY

Is there any reason for an uncensored model if you have no interest in roleplaying?

Reddit discussion questioning utility of uncensored models for RAG applications; user reports stability issues vs. base models.

u/vick2djax·2 months ago·69 pts / 133 comm

r/LocalLLaMA· COMMUNITY

llama.cpp server have built-in native tools (exec_shell, edit_file, etc.)

llama.cpp server adds native tool support (shell execution, file ops) via experimental --tools flag.

u/srigi·2 months ago·57 pts / 17 comm

r/LocalLLaMA· COMMUNITY

Run Chrome’s tiny Gemma4 (aka Gemini Nano) directly on PC without GPU

Chrome extension enables local inference of Gemini Nano (Gemma) on CPU-only systems, ~20 tokens/sec on laptop.

u/Some-Cauliflower4902·2 months ago·47 pts / 29 comm

r/singularity· COMMUNITY

coding is basically solved for the boring 90% of tasks

Developer refactored 120-file FastAPI service using DeepSeek V4 and Hunyuan with 80x cost savings vs Opus; open-weight models matched Opus latency but introduced production bugs.

u/Dramatic_Spirit_8436·2 months ago·158 pts / 68 comm

← Front Page50 stories