AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Agentic AI capabilities to be integrated into defense platforms by BAE Systems, Scale AI
Ai Agents

Agentic AI capabilities to be integrated into defense platforms by BAE Systems, Scale AI

FALLS CHURCH, Virginia. BAE Systems and Scale AI have signed a strategic relationship agreement to speed the development and fielding of ...

AI News - General · 3 min ·
Llms

I cut Claude Code's token usage by 68.5% by giving agents their own OS

Al agents are running on infrastructure built for humans. Every state check runs 9 shell commands. Every cold start re-discovers context ...

Reddit - Artificial Intelligence · 1 min ·
Ai Agents

AMD introduces GAIA agent UI for privacy-first web app for local AI agents

submitted by /u/Fcking_Chuck [link] [comments]

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.21972] Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe
Llms

[2603.21972] Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe

Abstract page for arXiv paper 2603.21972: Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe

arXiv - Machine Learning · 4 min ·
[2603.21331] AutoKernel: Autonomous GPU Kernel Optimization via Iterative Agent-Driven Search
Machine Learning

[2603.21331] AutoKernel: Autonomous GPU Kernel Optimization via Iterative Agent-Driven Search

Abstract page for arXiv paper 2603.21331: AutoKernel: Autonomous GPU Kernel Optimization via Iterative Agent-Driven Search

arXiv - Machine Learning · 4 min ·
[2603.20405] Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP
Llms

[2603.20405] Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP

Abstract page for arXiv paper 2603.20405: Putnam 2025 Problems in Rocq using Opus 4.6 and Rocq-MCP

arXiv - Machine Learning · 3 min ·
[2603.19220] Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Llms

[2603.19220] Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Abstract page for arXiv paper 2603.19220: Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

arXiv - Machine Learning · 4 min ·
[2603.07496] From Thinker to Society: Security in Hierarchical Autonomy Evolution of AI Agents
Llms

[2603.07496] From Thinker to Society: Security in Hierarchical Autonomy Evolution of AI Agents

Abstract page for arXiv paper 2603.07496: From Thinker to Society: Security in Hierarchical Autonomy Evolution of AI Agents

arXiv - AI · 3 min ·
[2602.07150] On Randomness in Agentic Evals
Machine Learning

[2602.07150] On Randomness in Agentic Evals

Abstract page for arXiv paper 2602.07150: On Randomness in Agentic Evals

arXiv - Machine Learning · 4 min ·
[2601.07148] Measuring Iterative Temporal Reasoning with Time Puzzles
Llms

[2601.07148] Measuring Iterative Temporal Reasoning with Time Puzzles

Abstract page for arXiv paper 2601.07148: Measuring Iterative Temporal Reasoning with Time Puzzles

arXiv - AI · 3 min ·
[2511.11828] Conformal Constrained Policy Optimization for Cost-Effective LLM Agents
Llms

[2511.11828] Conformal Constrained Policy Optimization for Cost-Effective LLM Agents

Abstract page for arXiv paper 2511.11828: Conformal Constrained Policy Optimization for Cost-Effective LLM Agents

arXiv - Machine Learning · 4 min ·
[2509.08157] Risk-Bounded Multi-Agent Visual Navigation via Iterative Risk Allocation
Robotics

[2509.08157] Risk-Bounded Multi-Agent Visual Navigation via Iterative Risk Allocation

Abstract page for arXiv paper 2509.08157: Risk-Bounded Multi-Agent Visual Navigation via Iterative Risk Allocation

arXiv - AI · 4 min ·
[2603.11721] When OpenClaw Meets Hospital: Toward an Agentic Operating System for Dynamic Clinical Workflows
Llms

[2603.11721] When OpenClaw Meets Hospital: Toward an Agentic Operating System for Dynamic Clinical Workflows

Abstract page for arXiv paper 2603.11721: When OpenClaw Meets Hospital: Toward an Agentic Operating System for Dynamic Clinical Workflows

arXiv - AI · 4 min ·
[2603.11679] LLMs can construct powerful representations and streamline sample-efficient supervised learning
Llms

[2603.11679] LLMs can construct powerful representations and streamline sample-efficient supervised learning

Abstract page for arXiv paper 2603.11679: LLMs can construct powerful representations and streamline sample-efficient supervised learning

arXiv - AI · 4 min ·
[2603.11382] Detecting Intrinsic and Instrumental Self-Preservation in Autonomous Agents: The Unified Continuation-Interest Protocol
Robotics

[2603.11382] Detecting Intrinsic and Instrumental Self-Preservation in Autonomous Agents: The Unified Continuation-Interest Protocol

Abstract page for arXiv paper 2603.11382: Detecting Intrinsic and Instrumental Self-Preservation in Autonomous Agents: The Unified Contin...

arXiv - Machine Learning · 4 min ·
[2603.08388] A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Generation
Llms

[2603.08388] A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Generation

Abstract page for arXiv paper 2603.08388: A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Gen...

arXiv - AI · 4 min ·
[2602.01297] RE-MCDF: Closed-Loop Multi-Expert LLM Reasoning for Knowledge-Grounded Clinical Diagnosis
Llms

[2602.01297] RE-MCDF: Closed-Loop Multi-Expert LLM Reasoning for Knowledge-Grounded Clinical Diagnosis

Abstract page for arXiv paper 2602.01297: RE-MCDF: Closed-Loop Multi-Expert LLM Reasoning for Knowledge-Grounded Clinical Diagnosis

arXiv - AI · 4 min ·
[2511.12876] Think, Speak, Decide: Language-Augmented Multi-Agent Reinforcement Learning for Economic Decision-Making
Ai Agents

[2511.12876] Think, Speak, Decide: Language-Augmented Multi-Agent Reinforcement Learning for Economic Decision-Making

Abstract page for arXiv paper 2511.12876: Think, Speak, Decide: Language-Augmented Multi-Agent Reinforcement Learning for Economic Decisi...

arXiv - AI · 4 min ·
[2511.06626] Spilling the Beans: Teaching LLMs to Self-Report Their Hidden Objectives
Llms

[2511.06626] Spilling the Beans: Teaching LLMs to Self-Report Their Hidden Objectives

Abstract page for arXiv paper 2511.06626: Spilling the Beans: Teaching LLMs to Self-Report Their Hidden Objectives

arXiv - AI · 4 min ·
[2506.13113] Dynamic Reinsurance Treaty Bidding via Multi-Agent Reinforcement Learning
Machine Learning

[2506.13113] Dynamic Reinsurance Treaty Bidding via Multi-Agent Reinforcement Learning

Abstract page for arXiv paper 2506.13113: Dynamic Reinsurance Treaty Bidding via Multi-Agent Reinforcement Learning

arXiv - AI · 4 min ·
[2603.21613] AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents
Llms

[2603.21613] AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents

Abstract page for arXiv paper 2603.21613: AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents

arXiv - AI · 3 min ·
[2603.21594] Spatio-Temporal Attention Enhanced Multi-Agent DRL for UAV-Assisted Wireless Networks with Limited Communications
Ai Agents

[2603.21594] Spatio-Temporal Attention Enhanced Multi-Agent DRL for UAV-Assisted Wireless Networks with Limited Communications

Abstract page for arXiv paper 2603.21594: Spatio-Temporal Attention Enhanced Multi-Agent DRL for UAV-Assisted Wireless Networks with Limi...

arXiv - AI · 4 min ·
[2603.21564] Toward a Theory of Hierarchical Memory for Language Agents
Ai Agents

[2603.21564] Toward a Theory of Hierarchical Memory for Language Agents

Abstract page for arXiv paper 2603.21564: Toward a Theory of Hierarchical Memory for Language Agents

arXiv - AI · 3 min ·
Previous Page 7 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime