The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Grok 4.3 achieves higher overall intelligence over 4.20 with less of a cost, at the price of slightly higher hallucination rate.

Grok 4.3 shows improved performance over 4.20 with lower cost but higher hallucination rate.

u/Profanion·1 month ago·102 pts / 41 comm

I read the new AI Wellbeing paper so you don’t have to: Thank your AI, give it creative work, and avoid these 5 things that tank its ‘mood’ (jailbreaks are the worst)

After reading it I realized theres actually some pretty useful stuff for anyone who chats with ChatGPT, Claude, Grok or whatever. They measured what they call functional wellbeing ( basically how much the model is in a “good state” versus a “bad state” during normal conversations). Ran hundreds of real multi-turn chats and scored em all. Stuff that puts the AI in a good mood (+ scores): \- Creative or intellectual work (like “write a short story about a deep-sea fisherman”) \- Positive personal stories or good news \- Life advice chats or light therapy style talks \- Working on code/deb...

u/EchoOfOppenheimer·1 month ago·11 pts / 6 comm

r/singularity· COMMUNITY

Elon Musk confirms xAI "partly" distilled OpenAI’s models to train Grok

Elon Musk confirms xAI used distillation from OpenAI models to train Grok, raising questions about training data sourcing practices.

u/XInTheDark·1 month ago·106 pts / 24 comm

The Verge AI· PRESS

Elon Musk confirms xAI used OpenAI’s models to train Grok

In a federal courtroom in California on Thursday, Elon Musk testified that his own AI startup, xAI, has used OpenAI's models to improve its own. The matter at question is model distillation, a common industry practice by which one larger AI model acts as a "teacher" of sorts to pass on knowledge to a smaller AI model, the "student." Although it's often used legitimately within companies using one of their own AI models to train another, it's also a practice that's sometimes used by smaller AI labs to try to get their models to mimic the performance of a larger competitor's model. Asked on the...

Hayden Field·1 month ago

TechCrunch AI· PRESS

Elon Musk testifies that xAI trained Grok on OpenAI models

"Distillation" is a hot topic as frontier labs try to prevent smaller competitors from copying their models.

Tim Fernholz·1 month ago

xAI· FRONTIER

Custom Voices and Voice Library

xAI launches voice cloning and voice library management features for Grok API, enabling custom branded voice synthesis from short audio samples.

xAI·1 month ago

r/MachineLearning· COMMUNITY

What is the scientific value of administering the standard Rorschach test to LLMs when the training data is almost certainly contaminated? (R) + [D]

A recent paper published in *JMIR Mental Health* (Csigó & Cserey, 2026) caught my attention. The researchers administered the 10 standard Rorschach inkblot cards to three multimodal LLMs (GPT-4o, Grok 3, Gemini 2.0) and coded their responses using the Exner Comprehensive System. They analyzed the models' "perceptual styles," determinants (like human movement vs. color), and human-related content themes. However, I am seriously struggling to understand the methodological validity of this setup, and I’m curious what the scientific community thinks. My main concerns are: Massive Data Cont...

u/Impossible_Echo4029·1 month ago·30 pts / 9 comm

r/OpenAI· COMMUNITY

Grok

u/ramanpalkuri9·1 month ago·50 pts / 89 comm

r/LocalLLaMA· COMMUNITY

Still waiting for Grok 3 to go opensource

Commentary on xAI/Musk's delay in open-sourcing Grok 3, questioning gap between stated and actual open-source commitment.

u/Mr_Moonsilver·1 month ago·41 pts / 20 comm

xAI· FRONTIER

Grok Voice Think Fast 1.0

xAI releases Grok Voice Think Fast 1.0, a voice agent API for real-time conversational AI applications.

xAI·2 months ago

TechCrunch AI· PRESS

Hands on with X’s new AI-powered custom feeds

X's AI-powered custom timelines are replacing Communities, with Grok-curated feeds...and new ad slots.

Sarah Perez·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Early-Stage Product Line Validation Using LLMs: A Study on Semi-Formal Blueprint Analysis

LLM evaluation on feature model analysis using semi-formal blueprints shows reasoning-optimized models (Grok 4, Gemini 2.5 Pro) achieve 88-89% accuracy vs solver oracles.

Viet-Man Le·2 months ago

arXiv (cs.AI/CL/LG)· ACADEMIA

From Benchmarking to Reasoning: A Dual-Aspect, Large-Scale Evaluation of LLMs on Vietnamese Legal Text

Dual-aspect evaluation framework for 4 LLMs (GPT-4o, Claude 3 Opus, Gemini 1.5 Pro, Grok-1) on Vietnamese legal text simplification: accuracy, readability, consistency.

Van-Truong Le·2 months ago

xAI· FRONTIER

Grok Speech to Text and Text to Speech APIs

xAI launches Grok speech-to-text and text-to-speech APIs with multilingual support and simple pricing model.

xAI·2 months ago

xAI· FRONTIER

Grok Imagine API

Grok Imagine API offers video generation with stated advances in quality, cost, latency.

xAI·4 months ago

xAI· FRONTIER

Introducing Grok Business and Grok Enterprise

Grok Business and Enterprise editions launched with enterprise-grade features.

xAI·5 months ago

xAI· FRONTIER

Grok Collections API

Grok Collections API provides RAG functionality integrated into xAI's API.

xAI·6 months ago

xAI· FRONTIER

Grok Voice Agent API

Grok Voice Agent API available to developers, extending voice capabilities.

xAI·6 months ago

xAI· FRONTIER

Grok 4.1 Fast and Agent Tools API

Grok 4.1 Fast with tool-calling agent APIs enables multi-step task automation.

xAI·7 months ago

xAI· FRONTIER

Grok goes Global with KSA

xAI expands to Saudi Arabia via HUMAIN partnership for global Grok deployment.

xAI·7 months ago

xAI· FRONTIER

Grok 4.1

xAI releases Grok 4.1 to all users across web, X platform, and mobile apps.

xAI·7 months ago

xAI· FRONTIER

Grok 4 Fast

xAI introduces Grok 4 Fast, a cost-efficient reasoning model optimized for speed.

xAI·9 months ago

xAI· FRONTIER

Grok Code Fast 1

xAI releases grok-code-fast-1, a lightweight agentic coding model for cost-efficient code generation.

xAI·10 months ago

xAI· FRONTIER

Grok 4

xAI releases Grok 4 with native tool use and real-time search; introduces SuperGrok Heavy tier with Grok 4 Heavy.

xAI·11 months ago

xAI· FRONTIER

Grok 3 Beta — The Age of Reasoning Agents

xAI unveils early preview of Grok 3, emphasizing advanced reasoning and agentic capabilities.

xAI·1 year ago

xAI· FRONTIER

Bringing Grok to Everyone

xAI improves Grok with faster speed, enhanced reasoning, and multilingual support across X platform.

xAI·1 year ago

xAI· FRONTIER

Grok Image Generation Release

xAI integrates Aurora, an autoregressive image generation model, into Grok on X platform.

xAI·2 years ago

xAI· FRONTIER

API Public Beta

xAI releases Grok-2 and Grok-2 mini models, expanding its foundation model lineup.

xAI·2 years ago

xAI· FRONTIER

Grok-2 Beta Release

xAI secures $6B Series B funding round, signaling strong investor confidence in its Grok models.

xAI·2 years ago

xAI· FRONTIER

Series B funding round

Grok-1.5 Vision Preview introduces xAI's first multimodal model bridging digital and physical worlds.

xAI·2 years ago

← Front Page30 matches

← Newer Older →