The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Microsoft offers devs a better way to control AI agent behavior

The specification lets developer, compliance and security teams define their own policies for agents to follow in portable policy files.

Ram Iyer·8 days ago

The Verge AI· PRESS

Microsoft’s Project Solara is an OS for AI agent gadgets

Microsoft just announced "Project Solara," a new OS designed for gadgets that run AI agents, at Build 2026. The company is calling it "a new platform built from the ground up to power agent-driven experiences." It's built on Android, not Windows. Microsoft demonstrated two concept Project Solara devices at Build today: Desk concept and badge concept. The desk concept is an Amazon Echo Show-like device that unlocks with facial recognition and provides access to AI agents. The badge concept is a wearable, the type of badge you'd typically use to access a work building. It has a camera and a fin...

Jay Peters·8 days ago

The Archive

Microsoft offers devs a better way to control AI agent behavior

Microsoft&#8217;s Project Solara is an OS for AI agent gadgets

Hedge-Bench: Benchmarking Agents on Hard, Realistic Tasks Pertaining to Financial Reasoning

Agent libOS: A Library-OS-Inspired Runtime for Long-Running, Capability-Controlled LLM Agents

RealClawBench: Live OpenClaw Benchmarks from Real Developer-Agent Sessions

GitHub's plan for Agents — Kyle Daigle, GitHub

A Training-Free Mixture-of-Agents Framework for Multi-Document Summarization using LLMs and Knowledge Graphs

EvoDS: Self-Evolving Autonomous Data Science Agent with Skill Learning and Context Management

BigFinanceBench: A Workflow-Grounded Benchmark for Financial-Research Agents

Deploy Self-Evolving Agents for Faster, More Secure Research with a Hermes Agent and NVIDIA NemoClaw

Holo3.1: Fast & Local Computer Use Agents

Cross-Lingual Token Arbitrage: Optimizing Code Agent Context Windows via Local LLM Preprocessing

A 3D Isovist World Model -- Revealing a City's Unseen Geometry and Its Emergent Cross-City Signature

SAGE: A Quantitative Evaluation of Socialized Evolution in Agent Ecosystems

Post-Hoc Robustness for Model-Based Reinforcement Learning

Overlaying Governance: A Compositional Authorization Framework for Delegation and Scope in Agentic AI

DMF: A Deterministic Memory Framework for Conversational AI Agents

What Makes Interaction Trajectories Effective for Training Terminal Agents?

FORGE: Multi-Agent Graduated Exploitation and Detection Engineering

Deploy Agentic-Ready AI at the Edge with Memory Efficiency in NVIDIA JetPack 7.2

Run Local AI Agents with Faster Models and Multi-Node Clustering on NVIDIA DGX Spark

Nvidia chases $200B CPU market with AI agent PCs from Microsoft, Dell, and HP

ClinEnv: An Interactive Multi-Stage Long Horizon EHR Environment for Agents

HERO'S JOURNEY: Testing Complex Rule Induction with Text Games

SkillHarm: Lifecycle-Aware Skill-Based Attacks via Automated Construction

Tracking the Behavioral Trajectories of Adapting Agents

Auditing Asset-Specific Preferences in Financial Large Language Models: Evidence from Bitcoin Representations and Portfolio Allocation

Bridging the Last Mile of Time Series Forecasting with LLM Agents

Ghost Tool Calls: Issue-Time Privacy for Speculative Agent Tools

MCP-Persona: Benchmarking LLM Agents on Real-World Personal Applications via Environment Simulation

Microsoft’s Project Solara is an OS for AI agent gadgets