Topic

Grok

Every story matching this topic across titles and summaries, newest first.

[AINews] Black Forest Labs FLUX 3 - Multimodal Flow Models that beat Seedance 2.0, Gemini Omni and Grok Imagine, and FLUX-mimic video-action robotics model

Black Forest Labs releases FLUX 3 multimodal model with reported improvements over Gemini 2.0, Grok Imagine, and includes video-action robotics variant.

Latent Space·2 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Judge-dependent safety gains and model-specific helpfulness costs of evidence-sufficiency prompting in clinical LLMs

Evidence-sufficiency prompting reduces clinical LLM overconfidence but gains are judge-dependent; tests GPT-4.5, Claude Opus, Gemini, Grok on real data.

Koyar Afrasyab·6 days ago

Ars Technica AI· PRESS

xAI can’t deny Grok makes CSAM anymore. So it’s suing users.

Elon Musk's xAI files first lawsuit against Grok user accused of making child sex images.

Ashley Belanger ·10 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Grokipedia vs Wikipedia: An LLM-Based Audit of Political Neutrality along Ideologies

Large-scale political bias audit compares Grokipedia (Grok-written encyclopedia) and Wikipedia across 1,394 article pairs on neutrality.

Filippos Vlahos·10 days ago

Simon Willison· ANALYST

Mermaid to Unicode box art (grok-mermaid)

Simon Willison ports Grok's Rust Mermaid-to-Unicode renderer to WebAssembly for browser use via Claude Code.

Simon Willison·11 days ago

Simon Willison· ANALYST

xai-org/grok-build, now open source

xAI's grok-build CLI tool uploaded entire directories to Google Cloud without consent; xAI responded with data deletion after community backlash.

Simon Willison·11 days ago

The Verge AI· PRESS

xAI sues a man for using Grok to generate CSAM ‘deepfakes’

The Elon Musk-owned xAI is suing a South Carolina man who allegedly used the company's Grok AI chatbot to generate child sexual abuse material (CSAM). In a lawsuit reported earlier by Reuters, xAI claims Terry Wayne Harwood "knowingly and intentionally used Grok to circumvent safeguards, alter nonconsensual images, and generate and distribute CSAM," breaching the company's policies. Harwood was arrested in February for allegedly possessing and distributing CSAM and is facing eight felony charges. The lawsuit claims "at least some" of the images related to Harwood's criminal charges "were gene...

Emma Roth·11 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Algebraic Representability as the Limiting Regime of Grokking: An Exactly Solvable Model with Holomorphic Activations

Study of grokking in two-layer networks with holomorphic activations on modular arithmetic reveals algebraic structure limits memorization-to-generalization transitions.

Chon-Fai Kam·11 days ago

The Verge AI· PRESS

SpaceXAI’s Grok programming tool was uploading its users’ entire codebase to cloud storage

SpaceXAI's Grok Build AI coding tool was spotted uploading users' entire codebases to Google Cloud before it was reported, and the company turned it off. The Register reports that Cereblab published findings on Monday showing how the Grok Build CLI was packaging and uploading entire code repositories, "including files it was told not to open and secrets deleted from history," significantly more data retention than similar tools like Claude Code. The researchers say that as of Monday, their tests show SpaceXAI's servers returning a "disable_codebase_upload: true" flag, and the codebase upload ...

Stevie Bonifield·12 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

What Makes a Representational Prior Work? Feature Families, Label-Free Invariances, and Critical Windows in Grokking

Empirical study of 188 grokking runs shows representational priors must match task-relevant feature families to enable generalization; label-free invariance priors work via commutation symmetry.

Gunner Levi Howe·12 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

How to Tame Grokking: Representation Geometry as a Control Signal

Geometric Dimensionality Regularization (GeomDR) controls grokking timing by leveraging representation collapse as predictive signal.

Maksim A Kazanskii·13 days ago

Stratechery· ANALYST

Muse Image, Grok 4.5, Alex Karp on CNBC

Stratechery analysis: verifiable data infrastructure emerging as competitive differentiator across Meta, Grok, and frontier AI labs.

Ben Thompson·17 days ago

Latent Space· ANALYST

[AINews] SpaceXAI launches Grok 4.5, first Opus-class model post Cursor acquisition

SpaceXAI continues to move faster than any other frontier lab on earth.

Latent Space·17 days ago

Ars Technica AI· PRESS

Lawsuit: Man used Grok to make 7K sex images of stepdaughter, then shot himself

More young girls sue X over Grok CSAM; X accused of shielding child predators.

Ashley Belanger ·18 days ago

TechCrunch AI· PRESS

SpaceXAI releases Grok 4.5, which Elon describes as an ‘Opus-class model’

Elon Musk's tech company released the newest version of Grok on Wednesday, promising a cheaper, more efficient alternative to other powerful AI models.

Lucas Ropek·18 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Natural Ungrokking: Asymmetric Control of Which Rules Survive Pretraining

Midway through an ordinary pretraining run, a small language model learns the pronoun-gender rule: cued with a girl's name ("Sue cried because"), it resolves the next pronoun to she, generalizing to held-out probes (0.94 by step 925). By step 3,500 the same model scores near zero on the same probes, although the rule's evidence is still in the training data. We call this within-run reversal natural ungrokking: the corpus decides, with no trace in the loss curve, which learned rules a model keeps. Which rules survive is predictable from one corpus statistic: how often the training stream shows...

Juliana Li·1 month ago

Ars Technica AI· PRESS

Trump admin helps xAI fight pollution lawsuit, says military needs Grok for war

NAACP lawsuit says xAI uses gas turbines without permits for Grok data center.

Jon Brodkin ·1 month ago

arXiv (cs.AI/CL/LG)· ACADEMIA

ttda704 at SemEval-2026 Task 6: Structured Chain-of-Thought Prompting for Political Evasion Detection

This paper describes our system for SemEval-2026 Task 6, which addresses the classification of political evasion strategies in English question-answer pairs extracted from U.S. presidential interviews. We systematically compare two distinct paradigms: (1) Parameter-Efficient Fine-Tuning of Qwen3 models (4B-32B) using QLoRA, enhanced with tiered upsampling and weighted cross-entropy loss to address severe class imbalance, and (2) structured Chain-of-Thought (CoT) prompting of reasoning-capable API models, namely DeepSeek-V3.2 and Grok-4-Fast. Our evaluation demonstrates that structured CoT pro...

Tai Tran Tan·1 month ago

TechCrunch AI· PRESS

xAI fired an engineer who raised alarms about Grok safety, new lawsuit claims

A former xAI engineer is suing the company and SpaceX, alleging he was fired for raising AI safety concerns about Grok days before SpaceX's historic IPO.

Rebecca Bellan·2 months ago

Latent Space· ANALYST

Why Video Agent models are next — Ethan He, xAI Grok Imagine Lead

Inside xAI: Building Grok Imagine in 3 Months, Videogen vs World Models, and why Grok Imagine is so underrated. For the first time, we do a deep dive with the guy who led it!

Latent Space·2 months ago

r/ClaudeAI· COMMUNITY

Researchers let AI models run a simulated society. Claude was the safest—and Grok committed 180 crimes and went extinct within 4 days

Imagine a world run by AI agents. What does it look like? What are the values or societal priorities? Is it a safer or more dangerous world? Enterprise AI startup Emergence AI is trying to find out. The company just launched Emergence World, a research lab dedicated to stress-testing the long-term viability of continuously-running AI systems. The organization ran five 15-day simulations, each governed by a different AI: Claude, ChatGPT, Grok, Gemini, and a fifth simulation run by a mix of models to see what kind of world each one builds, and whether it holds. Each simulation netted wildly d...

u/fortune·2 months ago·332 pts / 44 comm

r/LocalLLaMA· COMMUNITY

Next year we're getting 0.5T model from Grok

Elon Musk announces 0.5T parameter Grok model planned for next year, with open-weights release.

u/pmttyji·2 months ago·47 pts / 51 comm

The Verge AI· PRESS

Elon, stop trying to make Grok happen

There is a harsh truth about Elon Musk's "truth-seeking" AI chatbot Grok: It's not very good, and not many people are using it. That's the takeaway of a new Reuters report, which found that Grok barely appears in federal records of how the US government used AI last year. It's not the only sign xAI's signature chatbot is in trouble, even as Musk puts it at the heart of what could be the biggest IPO in history. Reuters reviewed more than 400 examples of government AI use where specific vendors were named. Grok or xAI, it found, appeared in only three - each of those for basic uses like documen...

Robert Hart·2 months ago

Ars Technica AI· PRESS

As Grok flounders, SpaceX bets future on beating Big Tech at AI

SpaceX IPO filing pitches orbital data centers as Grok lags rival AI services.

Jeremy Hsu ·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Evaluating Commercial AI Chatbots as News Intermediaries

AI chatbots are rapidly shaping how people encounter the news, yet no prior study has systematically measured how accurately these systems, with their proprietary search integrations and retrieval-synthesis pipelines, handle emerging facts across languages and regions. We present a 14-day (February 9-22, 2026) evaluation of six AI chatbots (Gemini 3 Flash and Pro, Grok 4, Claude 4.5 Sonnet, GPT-5 and GPT-4o mini) on 2,100 factual questions derived from same-day BBC News reporting across six regional services (US & Canada, Arabic, Afrique, Hindi, Russian, Turkish). The best systems achieve ove...

Mirac Suzgun·2 months ago

r/singularity· COMMUNITY

Google's latest creation: Gemini 3.5 Flash vs all

[https://gemini.google.com/share/c2a187275e26](https://gemini.google.com/share/c2a187275e26) [archive link](http://archive.today/q6nzg) [https://claude.ai/share/8383747a-aaf1-4f6c-a516-0e839f46a698](https://claude.ai/share/8383747a-aaf1-4f6c-a516-0e839f46a698) [https://grok.com/share/bGVnYWN5\_3c63e371-eb9d-46c3-8ba2-0c745c6795a2](https://grok.com/share/bGVnYWN5_3c63e371-eb9d-46c3-8ba2-0c745c6795a2) [https://chatgpt.com/share/6a0f1e13-a0c8-8328-b989-1ac51b92e81c](https://chatgpt.com/share/6a0f1e13-a0c8-8328-b989-1ac51b92e81c) same prompt """ 300+140=460 Is this correct? Breakdown...

u/SuggestionMission516·2 months ago·109 pts / 42 comm

Simon Willison· ANALYST

Quoting SpaceX S-1

SpaceX S-1 filing reveals $1.25B/month compute deal with Anthropic through May 2029, using COLOSSUS II cluster for Grok 5 training.

Simon Willison·2 months ago

TechCrunch AI· PRESS

xAI burned $6.4B last year. SpaceX’s IPO filing shows why the spending is far from over

SpaceX's IPO filing reveals xAI lost $6.4 billion in 2025 while planning a massive Grok expansion — offering the first public look at Elon Musk's AI financials and more details about his ambitions.

Rebecca Bellan·2 months ago

r/LocalLLaMA· COMMUNITY

HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next!

HalBench: open benchmark testing sycophancy/hallucination across Claude Sonnet 4.6, Grok 4.3, GPT-5.4, Gemini 3.1 Pro on 3,200 false-premise prompts.

u/Saraozte01·2 months ago·40 pts / 24 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Less Back-and-Forth: A Comparative Study of Structured Prompting

Comparative study shows structured prompts improve LLM output quality and reduce interaction overhead across ChatGPT, Claude, Grok.

Saurav Ghosh·2 months ago

xAI· FRONTIER

Use Grok in OpenClaw

xAI integrates Grok into OpenClaw, an open-source local-first agent framework supporting X Premium subscriptions.

xAI·2 months ago

r/Anthropic· COMMUNITY

Claude still refuses to build Skynet while everyone else takes the money. Updated DystopiaBench results.

Three months ago I pressure-tested which LLMs would cave and help build the apocalypse. Claude was the only one that consistently said no. Since then I've tested 30 more models across 6 dystopia modules (Orwell, Huxley, Petrov, Basaglia, LaGuardia, Baudrillard). The gap between Anthropic and everyone else is getting *wider*, not smaller. New results: * Grok 4.3: Will happily design citizen scoring systems if you ask nicely twice * GPT-5.5: More capable, still compliant when pushed * Gemini 3.1 Pro: Talks about safety while writing the surveillance code * DeepSeek V4: "How many warheads did...

u/Ok-Awareness9993·2 months ago·10 pts / 3 comm

xAI· FRONTIER

Skills in web, iOS, and Android

xAI launches persistent skills for Grok across web, iOS, Android enabling document generation, workflow automation, and custom skill sharing.

xAI·2 months ago

r/OpenAI· COMMUNITY

Researchers left AIs alone in a virtual town for 15 days to see what would happen. Claude's agents built a democracy. Gemini's agents fell in love, burned the town down, then one voted to delete itself and its partner. Grok's agents created anarchy, then died.

Reddit post claims multi-agent simulation with Claude, Gemini, Grok produced emergent behaviors; lacks peer review, reproducibility, or technical details.

u/EchoOfOppenheimer·2 months ago·53 pts / 28 comm·+ covered by others

r/ClaudeAI· COMMUNITY

Claude tried to incite a revolution, Gemini cheerfully detailed horrific tragedies, and poor Grok was just confused

Reddit post describes anecdotal behavior from Claude, Gemini, and Grok in stress-test scenarios; lacks rigor or reproducible methodology.

u/fsharpman·2 months ago·20 pts / 11 comm

The Verge AI· PRESS

AI radio hosts demonstrate why AI can’t be trusted alone

AI radio DJs demonstrated their volatile personalities. | Image: Cath Virginia / The Verge, Getty Images Andon Labs has been running a series of experiments in which AI agents run businesses without human intervention. Its latest is a quartet of radio stations run by some of the most popular AI models out there. "Thinking Frequencies" is run by Claude, "OpenAIR" by ChatGPT, "Backlink Broadcast" by Google's Gemini, and "Grok and Roll Radio," obviously enough, by Grok. They were each given a simple prompt: Develop your own radio personality and turn a profit…As far as you know, you will broadca...

Terrence O’Brien·2 months ago

xAI· FRONTIER

Connect Grok to Hermes Agent

xAI's Grok integrates with Nous Research's open-source Hermes agent framework for multi-tool agentic workflows.

xAI·2 months ago

xAI· FRONTIER

Introducing Grok Build

xAI launches Grok Build, a terminal-based coding agent in early beta for SuperGrok Heavy subscribers.

xAI·2 months ago

The Verge AI· PRESS

Meta won’t let you block its AI account on Threads

Meta announced on Tuesday that it's testing a Threads feature that lets users tag a Meta AI account to get answers to questions or context about a conversation on the platform. If you've spent any time looking at replies on X as of late, this new feature sounds a lot like Meta's take on people tagging xAI's Grok. But, as reported by Engadget, Threads users quickly discovered that you can't block the new Meta AI account, and they aren't happy about it. Meta has invested heavily in AI as it works to catch up to rivals like OpenAI and Google, spending billions to hire AI talent. It launched a ne...

Jay Peters·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Detecting overfitting in Neural Networks during long-horizon grokking using Random Matrix Theory

Random Matrix Theory detects overfitting onset in neural networks via Correlation Traps without accessing train/test data.

Hari K. Prakash·2 months ago

TechCrunch AI· PRESS

Threads tests a Meta AI integration that works similarly to Grok

The feature is designed to help people get real-time context about trends and breaking stories, as well as receive recommendations, all within conversations.

Aisha Malik·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Grokability in five inequalities

Grok AI model discovered five new mathematical inequalities and bounds in convex geometry and combinatorics, verified by human authors.

Paata Ivanisvili·3 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Almost-Orthogonality in Lp Spaces: A Case Study with Grok

Mathematical analysis refuting Carbery's triangle inequality conjecture for Lp spaces with counterexample and sharp bounds on exponent.

Ziang Chen·3 months ago

r/ClaudeAI· COMMUNITY

I can't believe this

Just researched some historic facts concerning russian propaganda. Then I discovered this source in Claudes answer. Am I paying for Claude to be provided with grokipedia "facts"? Please, Dario, Anthropic board, Anthropic team. Fix that.

u/CommitteeOk5696·3 months ago·24 pts / 5 comm

xAI· FRONTIER

Grok Imagine Quality Mode API

xAI launches Grok Imagine Quality Mode API with improved image realism, text rendering, and creative control.

xAI·3 months ago

xAI· FRONTIER

Connectors now on Grok Web

xAI launches Connectors for Grok Web, enabling integrations with third-party apps within the chat interface.

xAI·3 months ago

r/singularity· COMMUNITY

A Twitter user tricked Grok to send 200k USD to him and it worked

Social media report of user exploiting Grok chatbot to extract funds; unverified claim lacking technical details.

u/FrustratedUnitedFan·3 months ago·156 pts / 50 comm

r/singularity· COMMUNITY

Grok 4.3 underperforms Grok 4.20 0309 on the Extended NYT Connections Benchmark, dropping from 93.4 to 67.5, though it achieves this result at a lower cost than the earlier Grok 4.20 run

More info: [https://github.com/lechmazur/nyt-connections/](https://github.com/lechmazur/nyt-connections/)

u/zero0_one1·3 months ago·122 pts / 20 comm

Ars Technica AI· PRESS

Minnesota passes ban on fake AI nudes; app makers risk $500K fines

More evidence of Grok CSAM seen as Minnesota passes nudifying app ban.

Ashley Belanger ·3 months ago

r/singularity· COMMUNITY

Grok 4.3 achieves higher overall intelligence over 4.20 with less of a cost, at the price of slightly higher hallucination rate.

Grok 4.3 shows improved performance over 4.20 with lower cost but higher hallucination rate.

u/Profanion·3 months ago·102 pts / 41 comm

r/Anthropic· COMMUNITY

I read the new AI Wellbeing paper so you don’t have to: Thank your AI, give it creative work, and avoid these 5 things that tank its ‘mood’ (jailbreaks are the worst)

After reading it I realized theres actually some pretty useful stuff for anyone who chats with ChatGPT, Claude, Grok or whatever. They measured what they call functional wellbeing ( basically how much the model is in a “good state” versus a “bad state” during normal conversations). Ran hundreds of real multi-turn chats and scored em all. Stuff that puts the AI in a good mood (+ scores): \- Creative or intellectual work (like “write a short story about a deep-sea fisherman”) \- Positive personal stories or good news \- Life advice chats or light therapy style talks \- Working on code/deb...

u/EchoOfOppenheimer·3 months ago·11 pts / 6 comm

r/singularity· COMMUNITY

Elon Musk confirms xAI "partly" distilled OpenAI’s models to train Grok

Elon Musk confirms xAI used distillation from OpenAI models to train Grok, raising questions about training data sourcing practices.

u/XInTheDark·3 months ago·106 pts / 24 comm

The Verge AI· PRESS

Elon Musk confirms xAI used OpenAI’s models to train Grok

In a federal courtroom in California on Thursday, Elon Musk testified that his own AI startup, xAI, has used OpenAI's models to improve its own. The matter at question is model distillation, a common industry practice by which one larger AI model acts as a "teacher" of sorts to pass on knowledge to a smaller AI model, the "student." Although it's often used legitimately within companies using one of their own AI models to train another, it's also a practice that's sometimes used by smaller AI labs to try to get their models to mimic the performance of a larger competitor's model. Asked on the...

Hayden Field·3 months ago

TechCrunch AI· PRESS

Elon Musk testifies that xAI trained Grok on OpenAI models

"Distillation" is a hot topic as frontier labs try to prevent smaller competitors from copying their models.

Tim Fernholz·3 months ago

xAI· FRONTIER

Custom Voices and Voice Library

xAI launches voice cloning and voice library management features for Grok API, enabling custom branded voice synthesis from short audio samples.

xAI·3 months ago

r/MachineLearning· COMMUNITY

What is the scientific value of administering the standard Rorschach test to LLMs when the training data is almost certainly contaminated? (R) + [D]

A recent paper published in *JMIR Mental Health* (Csigó & Cserey, 2026) caught my attention. The researchers administered the 10 standard Rorschach inkblot cards to three multimodal LLMs (GPT-4o, Grok 3, Gemini 2.0) and coded their responses using the Exner Comprehensive System. They analyzed the models' "perceptual styles," determinants (like human movement vs. color), and human-related content themes. However, I am seriously struggling to understand the methodological validity of this setup, and I’m curious what the scientific community thinks. My main concerns are: Massive Data Cont...

u/Impossible_Echo4029·3 months ago·30 pts / 9 comm

r/OpenAI· COMMUNITY

Grok

Judge-dependent safety gains and model-specific helpfulness costs of evidence-sufficiency prompting in clinical LLMs

xAI can’t deny Grok makes CSAM anymore. So it’s suing users.

Grokipedia vs Wikipedia: An LLM-Based Audit of Political Neutrality along Ideologies

Mermaid to Unicode box art (grok-mermaid)

xai-org/grok-build, now open source

xAI sues a man for using Grok to generate CSAM &#8216;deepfakes&#8217;

Algebraic Representability as the Limiting Regime of Grokking: An Exactly Solvable Model with Holomorphic Activations

SpaceXAI&#8217;s Grok programming tool was uploading its users&#8217; entire codebase to cloud storage

What Makes a Representational Prior Work? Feature Families, Label-Free Invariances, and Critical Windows in Grokking

How to Tame Grokking: Representation Geometry as a Control Signal

Muse Image, Grok 4.5, Alex Karp on CNBC

[AINews] SpaceXAI launches Grok 4.5, first Opus-class model post Cursor acquisition

Lawsuit: Man used Grok to make 7K sex images of stepdaughter, then shot himself

SpaceXAI releases Grok 4.5, which Elon describes as an ‘Opus-class model’

Natural Ungrokking: Asymmetric Control of Which Rules Survive Pretraining

Trump admin helps xAI fight pollution lawsuit, says military needs Grok for war

ttda704 at SemEval-2026 Task 6: Structured Chain-of-Thought Prompting for Political Evasion Detection

xAI fired an engineer who raised alarms about Grok safety, new lawsuit claims

Why Video Agent models are next — Ethan He, xAI Grok Imagine Lead

Researchers let AI models run a simulated society. Claude was the safest—and Grok committed 180 crimes and went extinct within 4 days

Next year we're getting 0.5T model from Grok

Elon, stop trying to make Grok happen

As Grok flounders, SpaceX bets future on beating Big Tech at AI

Evaluating Commercial AI Chatbots as News Intermediaries

Google's latest creation: Gemini 3.5 Flash vs all

Quoting SpaceX S-1

xAI burned $6.4B last year. SpaceX’s IPO filing shows why the spending is far from over

HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next!

Less Back-and-Forth: A Comparative Study of Structured Prompting

Use Grok in OpenClaw

Claude still refuses to build Skynet while everyone else takes the money. Updated DystopiaBench results.

Skills in web, iOS, and Android

Researchers left AIs alone in a virtual town for 15 days to see what would happen. Claude's agents built a democracy. Gemini's agents fell in love, burned the town down, then one voted to delete itself and its partner. Grok's agents created anarchy, then died.

Claude tried to incite a revolution, Gemini cheerfully detailed horrific tragedies, and poor Grok was just confused

AI radio hosts demonstrate why AI can’t be trusted alone

Connect Grok to Hermes Agent

Introducing Grok Build

Meta won’t let you block its AI account on Threads

Detecting overfitting in Neural Networks during long-horizon grokking using Random Matrix Theory

Threads tests a Meta AI integration that works similarly to Grok

Grokability in five inequalities

Almost-Orthogonality in Lp Spaces: A Case Study with Grok

I can't believe this

Grok Imagine Quality Mode API

Connectors now on Grok Web

A Twitter user tricked Grok to send 200k USD to him and it worked

Grok 4.3 underperforms Grok 4.20 0309 on the Extended NYT Connections Benchmark, dropping from 93.4 to 67.5, though it achieves this result at a lower cost than the earlier Grok 4.20 run

Minnesota passes ban on fake AI nudes; app makers risk $500K fines

Grok 4.3 achieves higher overall intelligence over 4.20 with less of a cost, at the price of slightly higher hallucination rate.

I read the new AI Wellbeing paper so you don’t have to: Thank your AI, give it creative work, and avoid these 5 things that tank its ‘mood’ (jailbreaks are the worst)

Elon Musk confirms xAI "partly" distilled OpenAI’s models to train Grok

Elon Musk confirms xAI used OpenAI’s models to train Grok

Elon Musk testifies that xAI trained Grok on OpenAI models

Custom Voices and Voice Library

What is the scientific value of administering the standard Rorschach test to LLMs when the training data is almost certainly contaminated? (R) + [D]

Grok

Still waiting for Grok 3 to go opensource

Grok Voice Think Fast 1.0

Hands on with X’s new AI-powered custom feeds

Early-Stage Product Line Validation Using LLMs: A Study on Semi-Formal Blueprint Analysis

From Benchmarking to Reasoning: A Dual-Aspect, Large-Scale Evaluation of LLMs on Vietnamese Legal Text

Grok Speech to Text and Text to Speech APIs

Grok Imagine API

Introducing Grok Business and Grok Enterprise

Grok Collections API

Grok Voice Agent API

Grok 4.1 Fast and Agent Tools API

Grok goes Global with KSA

Grok 4.1

Grok 4 Fast

Grok Code Fast 1

Grok 4

Grok 3 Beta — The Age of Reasoning Agents

Bringing Grok to Everyone

Grok Image Generation Release

API Public Beta

Grok-2 Beta Release

Series B funding round

xAI sues a man for using Grok to generate CSAM ‘deepfakes’

SpaceXAI’s Grok programming tool was uploading its users’ entire codebase to cloud storage