AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Agents

Agentic AI capabilities to be integrated into defense platforms by BAE Systems, Scale AI

FALLS CHURCH, Virginia. BAE Systems and Scale AI have signed a strategic relationship agreement to speed the development and fielding of ...

AI News - General · 3 min · about 4 hours ago

Llms

I cut Claude Code's token usage by 68.5% by giving agents their own OS

Al agents are running on infrastructure built for humans. Every state check runs 9 shell commands. Every cold start re-discovers context ...

Reddit - Artificial Intelligence · 1 min · about 18 hours ago

Ai Agents

AMD introduces GAIA agent UI for privacy-first web app for local AI agents

submitted by /u/Fcking_Chuck [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 18 hours ago

All Content

Llms

[2603.21972] Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe

Abstract page for arXiv paper 2603.21972: Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe

arXiv - Machine Learning · 4 min · 5 days ago

Machine Learning

[2603.21331] AutoKernel: Autonomous GPU Kernel Optimization via Iterative Agent-Driven Search

Abstract page for arXiv paper 2603.21331: AutoKernel: Autonomous GPU Kernel Optimization via Iterative Agent-Driven Search

arXiv - Machine Learning · 4 min · 5 days ago

Llms

[2603.20405] Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP

Abstract page for arXiv paper 2603.20405: Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP

arXiv - Machine Learning · 3 min · 5 days ago

Llms

[2603.19220] Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Abstract page for arXiv paper 2603.19220: Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

arXiv - Machine Learning · 4 min · 5 days ago

Llms

[2603.07496] From Thinker to Society: Security in Hierarchical Autonomy Evolution of AI Agents

Abstract page for arXiv paper 2603.07496: From Thinker to Society: Security in Hierarchical Autonomy Evolution of AI Agents

arXiv - AI · 3 min · 5 days ago

Machine Learning

[2602.07150] On Randomness in Agentic Evals

Abstract page for arXiv paper 2602.07150: On Randomness in Agentic Evals

arXiv - Machine Learning · 4 min · 5 days ago

Llms

[2601.07148] Measuring Iterative Temporal Reasoning with Time Puzzles

Abstract page for arXiv paper 2601.07148: Measuring Iterative Temporal Reasoning with Time Puzzles

arXiv - AI · 3 min · 5 days ago

Llms

[2511.11828] Conformal Constrained Policy Optimization for Cost-Effective LLM Agents

Abstract page for arXiv paper 2511.11828: Conformal Constrained Policy Optimization for Cost-Effective LLM Agents

arXiv - Machine Learning · 4 min · 5 days ago

Robotics

[2509.08157] Risk-Bounded Multi-Agent Visual Navigation via Iterative Risk Allocation

Abstract page for arXiv paper 2509.08157: Risk-Bounded Multi-Agent Visual Navigation via Iterative Risk Allocation

arXiv - AI · 4 min · 5 days ago

Llms

[2603.11721] When OpenClaw Meets Hospital: Toward an Agentic Operating System for Dynamic Clinical Workflows

Abstract page for arXiv paper 2603.11721: When OpenClaw Meets Hospital: Toward an Agentic Operating System for Dynamic Clinical Workflows

arXiv - AI · 4 min · 5 days ago

Llms

[2603.11679] LLMs can construct powerful representations and streamline sample-efficient supervised learning

Abstract page for arXiv paper 2603.11679: LLMs can construct powerful representations and streamline sample-efficient supervised learning

arXiv - AI · 4 min · 5 days ago

Robotics

[2603.11382] Detecting Intrinsic and Instrumental Self-Preservation in Autonomous Agents: The Unified Continuation-Interest Protocol

Abstract page for arXiv paper 2603.11382: Detecting Intrinsic and Instrumental Self-Preservation in Autonomous Agents: The Unified Contin...

arXiv - Machine Learning · 4 min · 5 days ago

Llms

[2603.08388] A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Generation

Abstract page for arXiv paper 2603.08388: A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Gen...

arXiv - AI · 4 min · 5 days ago

Llms

[2602.01297] RE-MCDF: Closed-Loop Multi-Expert LLM Reasoning for Knowledge-Grounded Clinical Diagnosis

Abstract page for arXiv paper 2602.01297: RE-MCDF: Closed-Loop Multi-Expert LLM Reasoning for Knowledge-Grounded Clinical Diagnosis

arXiv - AI · 4 min · 5 days ago

Ai Agents

[2511.12876] Think, Speak, Decide: Language-Augmented Multi-Agent Reinforcement Learning for Economic Decision-Making

Abstract page for arXiv paper 2511.12876: Think, Speak, Decide: Language-Augmented Multi-Agent Reinforcement Learning for Economic Decisi...

arXiv - AI · 4 min · 5 days ago

Llms

[2511.06626] Spilling the Beans: Teaching LLMs to Self-Report Their Hidden Objectives

Abstract page for arXiv paper 2511.06626: Spilling the Beans: Teaching LLMs to Self-Report Their Hidden Objectives

arXiv - AI · 4 min · 5 days ago

Machine Learning

[2506.13113] Dynamic Reinsurance Treaty Bidding via Multi-Agent Reinforcement Learning

Abstract page for arXiv paper 2506.13113: Dynamic Reinsurance Treaty Bidding via Multi-Agent Reinforcement Learning

arXiv - AI · 4 min · 5 days ago

Llms

[2603.21613] AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents

Abstract page for arXiv paper 2603.21613: AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents

arXiv - AI · 3 min · 5 days ago

Ai Agents

[2603.21594] Spatio-Temporal Attention Enhanced Multi-Agent DRL for UAV-Assisted Wireless Networks with Limited Communications

Abstract page for arXiv paper 2603.21594: Spatio-Temporal Attention Enhanced Multi-Agent DRL for UAV-Assisted Wireless Networks with Limi...

arXiv - AI · 4 min · 5 days ago

Ai Agents

[2603.21564] Toward a Theory of Hierarchical Memory for Language Agents

Abstract page for arXiv paper 2603.21564: Toward a Theory of Hierarchical Memory for Language Agents

arXiv - AI · 3 min · 5 days ago

Previous Page 7 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

Agentic AI capabilities to be integrated into defense platforms by BAE Systems, Scale AI

I cut Claude Code's token usage by 68.5% by giving agents their own OS

AMD introduces GAIA agent UI for privacy-first web app for local AI agents

All Content

[2603.21972] Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe

[2603.21331] AutoKernel: Autonomous GPU Kernel Optimization via Iterative Agent-Driven Search

[2603.20405] Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP

[2603.19220] Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

[2603.07496] From Thinker to Society: Security in Hierarchical Autonomy Evolution of AI Agents

[2602.07150] On Randomness in Agentic Evals

[2601.07148] Measuring Iterative Temporal Reasoning with Time Puzzles

[2511.11828] Conformal Constrained Policy Optimization for Cost-Effective LLM Agents

[2509.08157] Risk-Bounded Multi-Agent Visual Navigation via Iterative Risk Allocation

[2603.11721] When OpenClaw Meets Hospital: Toward an Agentic Operating System for Dynamic Clinical Workflows

[2603.11679] LLMs can construct powerful representations and streamline sample-efficient supervised learning

[2603.11382] Detecting Intrinsic and Instrumental Self-Preservation in Autonomous Agents: The Unified Continuation-Interest Protocol

[2603.08388] A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Generation

[2602.01297] RE-MCDF: Closed-Loop Multi-Expert LLM Reasoning for Knowledge-Grounded Clinical Diagnosis

[2511.12876] Think, Speak, Decide: Language-Augmented Multi-Agent Reinforcement Learning for Economic Decision-Making

[2511.06626] Spilling the Beans: Teaching LLMs to Self-Report Their Hidden Objectives

[2506.13113] Dynamic Reinsurance Treaty Bidding via Multi-Agent Reinforcement Learning

[2603.21613] AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents

[2603.21594] Spatio-Temporal Attention Enhanced Multi-Agent DRL for UAV-Assisted Wireless Networks with Limited Communications

[2603.21564] Toward a Theory of Hierarchical Memory for Language Agents

Related Topics

Stay updated with AI News