…

Vol. I · No. 57MON, JUN 15, 2026

Archive

The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Anthropic· FRONTIER

Our framework for developing safe and trustworthy agents

Anthropic publishes framework for developing safe and trustworthy autonomous agents with specified governance principles.

Anthropic·10 months ago

OpenAI· FRONTIER

Resolving digital threats 100x faster with OpenAI

Outtake uses GPT-4.1 and OpenAI o3 agents to detect security threats 100x faster.

OpenAI·11 months ago

OpenAI· FRONTIER

Model ML is helping financial firms rebuild with AI from the ground up

Model ML CEO discusses AI-native infrastructure and autonomous agents for financial services transformation.

OpenAI·11 months ago

Hugging Face· INFRA

Back to The Future: Evaluating AI Agents on Predicting Future Events

Hugging Face·11 months ago

OpenAI· FRONTIER

No-code personal agents, powered by GPT-4.1 and Realtime API

Genspark built $36M ARR no-code agent product in 45 days using GPT-4.1 and OpenAI Realtime API.

OpenAI·1 year ago

Hugging Face· INFRA

ScreenSuite - The most comprehensive evaluation suite for GUI Agents!

Hugging Face·1 year ago

Hugging Face· INFRA

CodeAgents + Structure: A Better Way to Execute Actions

Hugging Face·1 year ago

Hugging Face· INFRA

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

Hugging Face·1 year ago

Mistral AI· FRONTIER

Devstral

Devstral: Mistral AI open-source model optimized for autonomous coding agents and software development.

Mistral AI ·1 year ago

Hugging Face· INFRA

Tiny Agents: an MCP-powered agent in 50 lines of code

Hugging Face·1 year ago

OpenAI· FRONTIER

BrowseComp: a benchmark for browsing agents

OpenAI introduces BrowseComp benchmark for evaluating web browsing agent capabilities.

OpenAI·1 year ago

OpenAI· FRONTIER

PaperBench: Evaluating AI’s Ability to Replicate AI Research

PaperBench: new benchmark measuring AI agents' ability to replicate state-of-the-art research papers.

OpenAI·1 year ago

OpenAI· FRONTIER

Moving from intent-based bots to proactive AI agents

OpenAI shifts from intent-based bots to proactive AI agents architecture.

OpenAI·1 year ago

OpenAI· FRONTIER

Automating 90% of finance and legal work with agents

Hebbia's AI platform claims to automate 90% of finance and legal work tasks using OpenAI models.

OpenAI·1 year ago

OpenAI· FRONTIER

Introducing next-generation audio models in the API

OpenAI released advanced text-to-speech and speech-to-text APIs with customizable voice instructions for voice agents.

OpenAI·1 year ago

OpenAI· FRONTIER

New tools for building agents

OpenAI releases new tools for building and deploying AI agents.

OpenAI·1 year ago

xAI· FRONTIER

Grok 3 Beta — The Age of Reasoning Agents

xAI unveils early preview of Grok 3, emphasizing advanced reasoning and agentic capabilities.

xAI·1 year ago

Hugging Face· INFRA

Open-source DeepResearch – Freeing our search agents

Hugging Face·1 year ago

Hugging Face· INFRA

We now support VLMs in smolagents!

Hugging Face·1 year ago

Hugging Face· INFRA

AI Agents Are Here. What Now?

Hugging Face·1 year ago

Hugging Face· INFRA

Introducing smolagents: simple agents that write actions in code.

Hugging Face·1 year ago

Google DeepMind· FRONTIER

Google DeepMind at NeurIPS 2024

Google DeepMind presents NeurIPS 2024 research spanning adaptive agents, 3D scene generation, and LLM training safety.

Google DeepMind·2 years ago

OpenAI· FRONTIER

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

MLE-bench introduces benchmark for evaluating AI agents on machine learning engineering tasks.

OpenAI·2 years ago

OpenAI· FRONTIER

Automating customer support agents

MavenAGI launches GPT-4-powered customer service agent; Tripadvisor, Clickup, Rho deploy for support automation.

OpenAI·2 years ago

Hugging Face· INFRA

License to Call: Introducing Transformers Agents 2.0

Hugging Face·2 years ago

OpenAI· FRONTIER

Klarna's AI assistant does the work of 700 full-time agents

Klarna is using AI to revolutionize personal shopping, customer service, and employee productivity.

OpenAI·2 years ago

Hugging Face· INFRA

Open-source LLMs as LangChain Agents

Hugging Face·2 years ago

Hugging Face· INFRA

Introducing Agents.js: Give tools to your LLMs using JavaScript

Hugging Face·3 years ago

Hugging Face· INFRA

Introducing ⚔️ AI vs. AI ⚔️ a deep reinforcement learning multi-agents competition system

Hugging Face·3 years ago

OpenAI· FRONTIER

Learning to play Minecraft with Video PreTraining

We trained a neural network to play Minecraft by Video PreTraining (VPT) on a massive unlabeled video dataset of human Minecraft play, while using only a small amount of labeled contractor data. With fine-tuning, our model can learn to craft diamond tools, a task that usually takes proficient humans over 20 minutes (24,000 actions). Our model uses the native human interface of keypresses and mouse movements, making it quite general, and represents a step towards general computer-using agents.

OpenAI·4 years ago

← Front Page30 matches

← Newer Older →