The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

AI companies including Nvidia and Mistral urge policymakers to avoid broad restrictions on open-weight AI models as Washington debates responses to Chinese AI and alleged model distillation.

Rebecca Bellan·2 days ago

TechCrunch AI· PRESS

What is Mistral AI? Everything to know about the OpenAI competitor

Mistral AI, which offers some open source AI models, has raised significant funding since its creation in 2023, with the ambition to “put frontier AI in the hands of everyone.”

Anna Heim·22 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Research Entity Extraction and Topic Detection from UKRI Grant Proposals

UKRI metascience project compares GPT-4o, Mistral, and DSIT-Taxonomies for research entity extraction from 42 funding proposal abstracts.

Xingran Ruan·27 days ago

Latent Space· ANALYST

The Professor of Outputmaxxing — Anjney Midha, AMP

We talk about how this legendary investor went from humble beginnings in Singapore to leading rounds in Anthropic, Mistral, Black Forest Labs, and Periodic Labs... and the AMP secret master plan!

Latent Space·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

The Truth Stays in the Family: Enhancing Contextual Grounding via Inherited Truthful Heads in Model Lineages

Recent advances in large language models (LLMs) have produced many specialized multimodal LLMs (MLLMs) that share common foundational LLMs, forming distinct model lineages. It remains unclear whether a fundamental behavioral link exists between the foundational LLMs and downstream variants. We investigate this question by quantifying head-level context-truthfulness scores. Across diverse LLM and MLLM lineages, including Vicuna-, Qwen2.5-, LLaMA2-, and Mistral-based models, we find that Truth Scores are strongly preserved within model families, even after instruction tuning or multimodal adapt...

Miso Choi·1 month ago

TechCrunch AI· PRESS

Mistral is rumored to be raising €3B at €20B valuation

The funding round would value the company at around €20 billion (about $23.15 billion), nearly double its Series C valuation of €11.7 billion.

Ram Iyer·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

The Shibboleth Effect: Auditing the Cross-Lingual Distributional Skew of Large Language Models

This study investigates cross-lingual distributional skew (the Shibboleth Effect) in frontier large language models (LLMs) subjected to sustained adversarial conditions. We develop a multi-agent geopolitical wargame, the Cerulean Sea Crisis, a synthetic maritime territorial dispute designed to mirror the structural dynamics of Eastern Mediterranean conflicts. Six frontier models (GPT-4o, Llama-4, Mistral-Large, Gemini-3.1-Pro, Qwen3.6-Plus, and DeepSeek-R1) participate in a between-groups experiment (N = 10 games per arm, K = 5 rounds per game) in which the sole manipulation is the language o...

Hakan Mehmetcik·2 months ago

r/ClaudeAI· COMMUNITY

I used Claude Code to build while delegating coding to Mistral/DeepSeek - 10 days, 57M tokens saved, over 90% costs savings, Claude quality result

User reports delegating Claude Code tasks to Mistral/DeepSeek via vibe-skill tool, achieving 90% cost savings over 10 days while maintaining output quality.

u/pcx_wave·2 months ago·24 pts / 10 comm

r/singularity· COMMUNITY

Mistral AI founder to French Parliament: "Engineers at Mistral no longer write a single line of code

Mistral AI founder tells French Parliament that engineers now manage AI agents writing code instead of writing it themselves, marking a shift in developer workflows.

u/Many_Consequence_337·2 months ago·106 pts / 60 comm

r/LocalLLaMA· COMMUNITY

I catalogued every way local models break JSON output and built a repair library, here's what I found across 288 model calls

Empirical study across 288 model calls identifying JSON output failures in Llama 3, Mistral, Command R, DeepSeek, Qwen; failure modes consistent across open and closed models but vary by rate.

u/kexxty·3 months ago·61 pts / 12 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

EQUITRIAGE: A Fairness Audit of Gender Bias in LLM-Based Emergency Department Triage

Fairness audit of five LLMs (Gemini, GPT-4, DeepSeek, Mistral, Nemotron) on emergency triage reveals gender bias persistence in clinical decision support.

Richard J. Young·3 months ago

r/LocalLLaMA· COMMUNITY

Unsloth solved bug in Mistral Medium 3.5 implementation

Unsloth and Mistral fixed YaRN parsing bug in Mistral Medium 3.5 inference; updated GGUFs released with mscale_all_dim correction.

u/Snail_Inference·3 months ago·49 pts / 12 comm

r/LocalLLaMA· COMMUNITY

Mistral Medium 3.5 128b ggufs are fixed

Unsloth fixes broken GGUF quantizations of Mistral Medium 3.5 128B, resolving long-context degradation issues.

u/Sunija_Dev·3 months ago·55 pts / 13 comm

r/singularity· COMMUNITY

Mistral Medium 3.5: A reliability first open source model from Europe

Mistral releases Medium 3.5, an open-weights model emphasizing reliability and robustness for production deployment.

u/Much_Ask3471·3 months ago·112 pts / 29 comm

r/LocalLLaMA· COMMUNITY

Mistral THICC DENSE BOI. He chonky! More dense models pls.

Reddit discussion praising dense model architectures, expresses preference for continued dense model releases.

u/Porespellar·3 months ago·103 pts / 15 comm

r/LocalLLaMA· COMMUNITY

Mistral Medium 3.5 Launched

Mistral Medium 3.5 launched with modified MIT license restricting commercial use without paid license.

u/DerpSenpai·3 months ago·47 pts / 17 comm

r/LocalLLaMA· COMMUNITY

Mistral Médium 3.5 is here

Mistral Medium 3.5 128B model released on Hugging Face.

u/Kathane37·3 months ago·44 pts / 30 comm

r/LocalLLaMA· COMMUNITY

mistralai/Mistral-Medium-3.5-128B · Hugging Face

Mistral releases Mistral Medium 3.5, a 128B dense model with 256k context window replacing Medium 3.1 and Magistral for instruction, reasoning, and coding tasks.

u/jacek2023·3 months ago·110 pts / 59 comm

Mistral AI· FRONTIER

Remote agents in Vibe. Powered by Mistral Medium 3.5.

Mistral AI launches Mistral Medium 3.5 with remote coding agents in Vibe and Work mode in Le Chat for complex tasks.

Mistral AI·3 months ago

r/LocalLLaMA· COMMUNITY

Mistral-Medium 3.5 (128B) spotted ?

Mistral-Medium 3.5 (128B) model reference discovered in vLLM repository commit, suggesting potential unreleased weight release.

u/tkon3·3 months ago·40 pts / 12 comm

r/LocalLLaMA· COMMUNITY

Mistral Medium Is On The Way

Mistral Medium incoming with 128B parameters; speculation on dense vs. MoE architecture based on Small model naming.

u/Few_Painter_5588·3 months ago·46 pts / 12 comm

r/LocalLLaMA· COMMUNITY

Something from Mistral (Vibe) tomorrow

Mistral teases unspecified announcement (model or tool) for tomorrow; source is social media rumor.

u/pmttyji·3 months ago·100 pts / 22 comm

Mistral AI· FRONTIER

Workflows for work that runs the business

Mistral AI launches Workflows in public preview, enabling automated business process orchestration.

Mistral AI·3 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Variance Is Not Importance: Structural Analysis of Transformer Compressibility Across Model Scales

Empirical study of 40+ transformer compression experiments on GPT-2 and Mistral 7B reveals variance-importance decoupling.

Samuel Salfati·3 months ago

Mistral AI· FRONTIER

Connect the dots: Build with built-in and custom MCPs in Studio

Mistral Studio adds Model Context Protocol support with custom connectors and approval workflows for enterprise data integration.

Mistral AI·3 months ago

Mistral AI· FRONTIER

Spaces: A CLI Built for Humans and Agents

Mistral releases Spaces, a CLI tool designed for both human developers and autonomous agents.

Mistral AI·4 months ago

Mistral AI· FRONTIER

Two users, one CLI: people and agents

Mistral AI shares design philosophy for CLI tools supporting both human users and AI agents, emphasizing unified tooling that improves developer experience.

Mistral AI·4 months ago

Latent Space· ANALYST

Mistral: Voxtral TTS, Forge, Leanstral, & what's next for Mistral 4 — w/ Pavan Kumar Reddy & Guillaume Lample

Mistral is one of the world's leading frontier model labs, and has just launched Voxtral TTS, their latest step in their strategy to offer open frontier intelligence for every modality.

Latent Space·4 months ago

Mistral AI· FRONTIER

Speaking of Voxtral

Mistral open-sources Voxtral, a fast, adaptable TTS model for voice agents with real-time synthesis.

Mistral AI·4 months ago

Mistral AI· FRONTIER

Introducing Forge

Mistral introduces Forge, enabling enterprises to build custom frontier models fine-tuned on proprietary data.

Mistral AI·4 months ago

← Front Page30 matches

Older →

The Archive

As US weighs response to Chinese AI, industry urges against broad open-weight restrictions

What is Mistral AI? Everything to know about the OpenAI competitor

Research Entity Extraction and Topic Detection from UKRI Grant Proposals

The Professor of Outputmaxxing — Anjney Midha, AMP

The Truth Stays in the Family: Enhancing Contextual Grounding via Inherited Truthful Heads in Model Lineages

Mistral is rumored to be raising €3B at €20B valuation

The Shibboleth Effect: Auditing the Cross-Lingual Distributional Skew of Large Language Models

I used Claude Code to build while delegating coding to Mistral/DeepSeek - 10 days, 57M tokens saved, over 90% costs savings, Claude quality result

Mistral AI founder to French Parliament: "Engineers at Mistral no longer write a single line of code

I catalogued every way local models break JSON output and built a repair library, here's what I found across 288 model calls

EQUITRIAGE: A Fairness Audit of Gender Bias in LLM-Based Emergency Department Triage

Unsloth solved bug in Mistral Medium 3.5 implementation

Mistral Medium 3.5 128b ggufs are fixed

Mistral Medium 3.5: A reliability first open source model from Europe

Mistral THICC DENSE BOI. He chonky! More dense models pls.

Mistral Medium 3.5 Launched

Mistral Médium 3.5 is here

mistralai/Mistral-Medium-3.5-128B · Hugging Face

Remote agents in Vibe. Powered by Mistral Medium 3.5.

Mistral-Medium 3.5 (128B) spotted ?

Mistral Medium Is On The Way

Something from Mistral (Vibe) tomorrow

Workflows for work that runs the business

Variance Is Not Importance: Structural Analysis of Transformer Compressibility Across Model Scales

Connect the dots: Build with built-in and custom MCPs in Studio

Spaces: A CLI Built for Humans and Agents

Two users, one CLI: people and agents

Mistral: Voxtral TTS, Forge, Leanstral, & what's next for Mistral 4 — w/ Pavan Kumar Reddy & Guillaume Lample

Speaking of Voxtral

Introducing Forge