Topic feed
Agents
AI agents, computer-use systems, and automation workflows.
OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
The Trust Stack: Identity, Reputation, and Accountability for AI Agents - Mitchell Bryson
As AI agents become economic actors - negotiating, transacting, and making commitments - they'll need infrastructure we haven't built yet: verifiable identity, earned reputation, and enforceable accountability.
Introducing the Codex app
Introducing the Codex app for macOS—a command center for AI coding and software development with multiple agents, parallel workflows, and long-running tasks.
AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality
A Blog post by IBM Research on Hugging Face