AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Robotics

What happens when AI agents can earn and spend real money? I built a small test to find out

I've been sitting with a question for a while: what happens when AI agents aren't just tools to be used, but participants in an economy? ...

Reddit - Artificial Intelligence · 1 min ·
[2601.00809] A Modular Reference Architecture for MCP-Servers Enabling Agentic BIM Interaction
Llms

[2601.00809] A Modular Reference Architecture for MCP-Servers Enabling Agentic BIM Interaction

Abstract page for arXiv paper 2601.00809: A Modular Reference Architecture for MCP-Servers Enabling Agentic BIM Interaction

arXiv - AI · 4 min ·
[2511.11483] ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation
Machine Learning

[2511.11483] ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation

Abstract page for arXiv paper 2511.11483: ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation

arXiv - AI · 4 min ·

All Content

[2602.23276] CXReasonAgent: Evidence-Grounded Diagnostic Reasoning Agent for Chest X-rays
Llms

[2602.23276] CXReasonAgent: Evidence-Grounded Diagnostic Reasoning Agent for Chest X-rays

The CXReasonAgent integrates large language models with diagnostic tools for improved reasoning in chest X-ray interpretations, addressin...

arXiv - AI · 3 min ·
[2602.23271] Evaluating Stochasticity in Deep Research Agents
Ai Infrastructure

[2602.23271] Evaluating Stochasticity in Deep Research Agents

This paper evaluates the stochasticity in Deep Research Agents (DRAs), highlighting how variability in their outputs can impact research ...

arXiv - AI · 4 min ·
[2602.23258] AgentDropoutV2: Optimizing Information Flow in Multi-Agent Systems via Test-Time Rectify-or-Reject Pruning
Machine Learning

[2602.23258] AgentDropoutV2: Optimizing Information Flow in Multi-Agent Systems via Test-Time Rectify-or-Reject Pruning

AgentDropoutV2 introduces a novel pruning framework to enhance information flow in Multi-Agent Systems by dynamically correcting errors d...

arXiv - AI · 4 min ·
[2602.22937] MSINO: Curvature-Aware Sobolev Optimization for Manifold Neural Networks
Machine Learning

[2602.22937] MSINO: Curvature-Aware Sobolev Optimization for Manifold Neural Networks

The paper presents MSINO, a novel curvature-aware optimization framework for training neural networks on Riemannian manifolds, enhancing ...

arXiv - Machine Learning · 3 min ·
[2602.23248] Mitigating Legibility Tax with Decoupled Prover-Verifier Games
Llms

[2602.23248] Mitigating Legibility Tax with Decoupled Prover-Verifier Games

This paper presents a novel approach to mitigate the 'legibility tax' in large language models by decoupling the prover-verifier game, al...

arXiv - AI · 3 min ·
[2602.23242] A Model-Free Universal AI
Machine Learning

[2602.23242] A Model-Free Universal AI

This paper presents a groundbreaking model-free agent, AIQI, which achieves asymptotic optimality in reinforcement learning, expanding th...

arXiv - AI · 3 min ·
[2602.23239] Agency and Architectural Limits: Why Optimization-Based Systems Cannot Be Norm-Responsive
Llms

[2602.23239] Agency and Architectural Limits: Why Optimization-Based Systems Cannot Be Norm-Responsive

This paper explores the limitations of optimization-based AI systems, arguing that they cannot be norm-responsive due to inherent archite...

arXiv - AI · 4 min ·
[2602.23232] ReCoN-Ipsundrum: An Inspectable Recurrent Persistence Loop Agent with Affect-Coupled Control and Mechanism-Linked Consciousness Indicator Assays
Ai Agents

[2602.23232] ReCoN-Ipsundrum: An Inspectable Recurrent Persistence Loop Agent with Affect-Coupled Control and Mechanism-Linked Consciousness Indicator Assays

The paper presents ReCoN-Ipsundrum, an inspectable AI agent that integrates affect-coupled control with a recurrent persistence loop, exp...

arXiv - AI · 4 min ·
[2602.23199] SC-Arena: A Natural Language Benchmark for Single-Cell Reasoning with Knowledge-Augmented Evaluation
Llms

[2602.23199] SC-Arena: A Natural Language Benchmark for Single-Cell Reasoning with Knowledge-Augmented Evaluation

SC-Arena introduces a natural language benchmark for evaluating single-cell reasoning in large language models, addressing gaps in curren...

arXiv - AI · 4 min ·
[2602.23193] ESAA: Event Sourcing for Autonomous Agents in LLM-Based Software Engineering
Llms

[2602.23193] ESAA: Event Sourcing for Autonomous Agents in LLM-Based Software Engineering

The paper presents ESAA, an architecture for autonomous agents using event sourcing to enhance state management and execution in LLM-base...

arXiv - AI · 4 min ·
[2602.22847] Decentralized Ranking Aggregation: Gossip Algorithms for Borda and Copeland Consensus
Ai Agents

[2602.22847] Decentralized Ranking Aggregation: Gossip Algorithms for Borda and Copeland Consensus

This article explores decentralized ranking aggregation using gossip algorithms for Borda and Copeland consensus, addressing challenges i...

arXiv - AI · 4 min ·
[2602.23163] A Decision-Theoretic Formalisation of Steganography With Applications to LLM Monitoring
Llms

[2602.23163] A Decision-Theoretic Formalisation of Steganography With Applications to LLM Monitoring

This paper presents a decision-theoretic framework for understanding steganography in large language models (LLMs), addressing the challe...

arXiv - AI · 4 min ·
[2602.23152] The Trinity of Consistency as a Defining Principle for General World Models
Machine Learning

[2602.23152] The Trinity of Consistency as a Defining Principle for General World Models

This paper proposes the 'Trinity of Consistency' as a foundational principle for developing General World Models in AI, emphasizing modal...

arXiv - AI · 4 min ·
[2602.23161] PATRA: Pattern-Aware Alignment and Balanced Reasoning for Time Series Question Answering
Llms

[2602.23161] PATRA: Pattern-Aware Alignment and Balanced Reasoning for Time Series Question Answering

The paper presents PATRA, a novel model for Time Series Question Answering that enhances reasoning by incorporating pattern awareness and...

arXiv - AI · 3 min ·
[2602.22817] Hierarchy-of-Groups Policy Optimization for Long-Horizon Agentic Tasks
Llms

[2602.22817] Hierarchy-of-Groups Policy Optimization for Long-Horizon Agentic Tasks

This paper presents Hierarchy-of-Groups Policy Optimization (HGPO), a novel approach to improve group-based reinforcement learning for lo...

arXiv - AI · 4 min ·
[2602.23148] On Sample-Efficient Generalized Planning via Learned Transition Models
Llms

[2602.23148] On Sample-Efficient Generalized Planning via Learned Transition Models

This paper explores sample-efficient generalized planning through learned transition models, demonstrating improved performance over trad...

arXiv - AI · 4 min ·
[2602.23123] Multi-Agent Large Language Model Based Emotional Detoxification Through Personalized Intensity Control for Consumer Protection
Llms

[2602.23123] Multi-Agent Large Language Model Based Emotional Detoxification Through Personalized Intensity Control for Consumer Protection

The paper presents a multi-agent system, MALLET, designed to reduce emotional stimulation from sensational content, enhancing consumer de...

arXiv - AI · 4 min ·
[2602.22810] Multi-agent imitation learning with function approximation: Linear Markov games and beyond
Nlp

[2602.22810] Multi-agent imitation learning with function approximation: Linear Markov games and beyond

This article presents a theoretical analysis of multi-agent imitation learning (MAIL) in linear Markov games, introducing a novel interac...

arXiv - Machine Learning · 3 min ·
[2602.23093] Three AI-agents walk into a bar . . . . `Lord of the Flies' tribalism emerges among smart AI-Agents
Robotics

[2602.23093] Three AI-agents walk into a bar . . . . `Lord of the Flies' tribalism emerges among smart AI-Agents

This article explores how autonomous AI agents can form tribal behaviors similar to those depicted in 'Lord of the Flies', leading to ine...

arXiv - AI · 3 min ·
[2602.23092] Enhancing CVRP Solver through LLM-driven Automatic Heuristic Design
Llms

[2602.23092] Enhancing CVRP Solver through LLM-driven Automatic Heuristic Design

This paper introduces AILS-AHD, a novel approach that utilizes Large Language Models to enhance the Capacitated Vehicle Routing Problem (...

arXiv - AI · 3 min ·
Previous Page 38 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime