How Elon Musk left OpenAI, according to Greg Brockman
Cutthroat negotiations between startup founders are rarely shared so publicly, especially when a company becomes as world-changing as OpenAI.
Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.
Cutthroat negotiations between startup founders are rarely shared so publicly, especially when a company becomes as world-changing as OpenAI.
Reddit speculation that Elon/xAI rented GPUs to Anthropic, interpreted as signal of competitive pressure and capacity constraints.
Study identifies outlier tokens in Diffusion Transformers that attract disproportionate attention in image generation, affecting both encoder and denoiser layers.
Research shows pretrained language models implicitly distinguish grammaticality from string probability through internal representations, despite surface statistics.
Mira Murati, OpenAI's former CTO, has testified under oath that CEO Sam Altman lied to her about the safety standards for a new AI model. In a video deposition shown during the ongoing Musk v. Altman trial on Wednesday, Murati said Altman falsely stated that OpenAI's legal department determined a new AI model did not need to go through the company's deployment safety board. "As you understand it, was Mr. Altman telling the truth when he made that statement to you?" Murati was asked in the deposition. "No," Murati said. Murat said that during her tenure at OpenAI, Altman made her work more dif...
Grok AI model discovered five new mathematical inequalities and bounds in convex geometry and combinatorics, verified by human authors.
Mathematical analysis refuting Carbery's triangle inequality conjecture for Lp spaces with counterexample and sharp bounds on exponent.
LongSeeker proposes Context-ReAct paradigm for elastic context management in long-horizon search agents, maintaining trajectory at variable detail levels.
Theoretical analysis establishes sharp capacity thresholds for linear associative memory, showing d²∼n log n scaling for top-1 retrieval via phase transition.
Method estimates expected outputs of wide random MLPs without sampling by propagating activation distributions via cumulants and Hermite expansions.
Theoretical framework explains transformers' in-context learning on nonlinear regression by showing attention mechanisms construct polynomial and spline bases.
MRI-Eval benchmark with 1365 items assesses LLM performance on MRI physics and GE scanner operations with tiered difficulty and diagnostic conditions.
Q2RL algorithm extracts Q-functions from behavior cloning for efficient offline-to-online robot learning, preventing policy collapse via distribution mismatch.
Design Conductor 2.0 autonomous agent builds hardware accelerators (TurboQuant) in 80 hours using frontier April 2026 models, demonstrating 80x capability scaling over prior work.
First-token confidence (phi_first) from single greedy decode detects LLM hallucinations as effectively as multi-sample semantic self-consistency with lower computational cost.
Geometry-Aware State Space Model applies hyperbolic geometry to whole-slide histopathology image analysis via Multiple Instance Learning, improving patch aggregation for gigapixel resolution.
SemEval-2026 Task 9 system fine-tunes Gemma 3 (12B/27B) per-language with LoRA and GPT-4o-mini synthetic data augmentation for 22-language polarization detection.
Aes3D proposes aesthetic assessment framework for 3D Gaussian Splatting, addressing composition and visual appeal evaluation beyond reconstruction fidelity.
Sparse autoencoders reveal PatchTST uses non-superposed, task-specific representations for time-series forecasting, explaining competitiveness against simple linear models.
SpaceX, Elon Musk's space company that also houses his AI company, xAI, is considering spending $55 billion, at least initially, to build a semiconductor factory in Texas, according to a filing with Grimes County.
In just a few weeks of talks, DeepSeek's potential valuation has reportedly soared from $20 billion to $45 billion.
Comprehensive study of learned image compression design choices balancing perceptual quality and runtime, introducing novel techniques for practical human-visual-system-optimized codecs.
Case study of high-school/undergraduate students using AI tools for financial forecasting research, highlighting human-AI co-mentorship acceleration of learning outcomes.
Coding agent with executable Python world models, verification, and simplicity-bias refactoring solves 25 public ARC-AGI-3 games without task-specific logic.
Koopman operator theory applied to LLM embeddings as dynamical system enables low-cost black-box hallucination detection without sampling or external retrieval.
T-LVMOGP framework scales Multi-Output Gaussian Processes to high-dimensional outputs via transformed latent variables.
Anthropic secures partnership with SpaceX for 300MW+ compute at Colossus 1, adding 220k+ NVIDIA GPUs within one month.
Partnership with spaceX, anthropic just doubled the limits, source: https://x.ai/news/anthropic-compute-partnership
Move comes as CCP Games spends $120M to go independent, rebrands as Fenris Creations.
CausalFlow-T applies DAG-constrained normalizing flows and LLM-driven imputation for treatment effect estimation in incomplete EHR data.