AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

[2506.20964] Evidence-based diagnostic reasoning with multi-agent copilot for human pathology
Llms

[2506.20964] Evidence-based diagnostic reasoning with multi-agent copilot for human pathology

Abstract page for arXiv paper 2506.20964: Evidence-based diagnostic reasoning with multi-agent copilot for human pathology

arXiv - AI · 4 min ·
[2601.08323] AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation
Ai Agents

[2601.08323] AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation

Abstract page for arXiv paper 2601.08323: AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation

arXiv - AI · 3 min ·
[2603.18349] Large-Scale Analysis of Persuasive Content on Moltbook
Llms

[2603.18349] Large-Scale Analysis of Persuasive Content on Moltbook

Abstract page for arXiv paper 2603.18349: Large-Scale Analysis of Persuasive Content on Moltbook

arXiv - AI · 3 min ·

All Content

[2603.02630] MASPOB: Bandit-Based Prompt Optimization for Multi-Agent Systems with Graph Neural Networks
Llms

[2603.02630] MASPOB: Bandit-Based Prompt Optimization for Multi-Agent Systems with Graph Neural Networks

Abstract page for arXiv paper 2603.02630: MASPOB: Bandit-Based Prompt Optimization for Multi-Agent Systems with Graph Neural Networks

arXiv - AI · 4 min ·
[2603.03233] AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework
Llms

[2603.03233] AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework

Abstract page for arXiv paper 2603.03233: AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework

arXiv - AI · 4 min ·
[2603.03212] NeuroSkill(tm): Proactive Real-Time Agentic System Capable of Modeling Human State of Mind
Machine Learning

[2603.03212] NeuroSkill(tm): Proactive Real-Time Agentic System Capable of Modeling Human State of Mind

Abstract page for arXiv paper 2603.03212: NeuroSkill(tm): Proactive Real-Time Agentic System Capable of Modeling Human State of Mind

arXiv - AI · 3 min ·
[2603.02604] Heterogeneous Agent Collaborative Reinforcement Learning
Llms

[2603.02604] Heterogeneous Agent Collaborative Reinforcement Learning

Abstract page for arXiv paper 2603.02604: Heterogeneous Agent Collaborative Reinforcement Learning

arXiv - Machine Learning · 3 min ·
[2603.03175] Saarthi for AGI: Towards Domain-Specific General Intelligence for Formal Verification
Llms

[2603.03175] Saarthi for AGI: Towards Domain-Specific General Intelligence for Formal Verification

Abstract page for arXiv paper 2603.03175: Saarthi for AGI: Towards Domain-Specific General Intelligence for Formal Verification

arXiv - AI · 4 min ·
[2603.03147] Agentic AI-based Coverage Closure for Formal Verification
Llms

[2603.03147] Agentic AI-based Coverage Closure for Formal Verification

Abstract page for arXiv paper 2603.03147: Agentic AI-based Coverage Closure for Formal Verification

arXiv - AI · 3 min ·
[2603.03119] AI Space Physics: Constitutive boundary semantics for open AI institutions
Machine Learning

[2603.03119] AI Space Physics: Constitutive boundary semantics for open AI institutions

Abstract page for arXiv paper 2603.03119: AI Space Physics: Constitutive boundary semantics for open AI institutions

arXiv - AI · 4 min ·
[2603.02510] ParEVO: Synthesizing Code for Irregular Data: High-Performance Parallelism through Agentic Evolution
Llms

[2603.02510] ParEVO: Synthesizing Code for Irregular Data: High-Performance Parallelism through Agentic Evolution

Abstract page for arXiv paper 2603.02510: ParEVO: Synthesizing Code for Irregular Data: High-Performance Parallelism through Agentic Evol...

arXiv - Machine Learning · 4 min ·
[2603.03078] RAPO: Expanding Exploration for LLM Agents via Retrieval-Augmented Policy Optimization
Llms

[2603.03078] RAPO: Expanding Exploration for LLM Agents via Retrieval-Augmented Policy Optimization

Abstract page for arXiv paper 2603.03078: RAPO: Expanding Exploration for LLM Agents via Retrieval-Augmented Policy Optimization

arXiv - AI · 4 min ·
[2603.03018] REGAL: A Registry-Driven Architecture for Deterministic Grounding of Agentic AI in Enterprise Telemetry
Llms

[2603.03018] REGAL: A Registry-Driven Architecture for Deterministic Grounding of Agentic AI in Enterprise Telemetry

Abstract page for arXiv paper 2603.03018: REGAL: A Registry-Driven Architecture for Deterministic Grounding of Agentic AI in Enterprise T...

arXiv - AI · 4 min ·
[2603.03005] OrchMAS: Orchestrated Reasoning with Multi Collaborative Heterogeneous Scientific Expert Structured Agents
Llms

[2603.03005] OrchMAS: Orchestrated Reasoning with Multi Collaborative Heterogeneous Scientific Expert Structured Agents

Abstract page for arXiv paper 2603.03005: OrchMAS: Orchestrated Reasoning with Multi Collaborative Heterogeneous Scientific Expert Struct...

arXiv - AI · 4 min ·
[2603.02426] Personalized Multi-Agent Average Reward TD-Learning via Joint Linear Approximation
Nlp

[2603.02426] Personalized Multi-Agent Average Reward TD-Learning via Joint Linear Approximation

Abstract page for arXiv paper 2603.02426: Personalized Multi-Agent Average Reward TD-Learning via Joint Linear Approximation

arXiv - Machine Learning · 4 min ·
[2603.02766] EvoSkill: Automated Skill Discovery for Multi-Agent Systems
Ai Agents

[2603.02766] EvoSkill: Automated Skill Discovery for Multi-Agent Systems

Abstract page for arXiv paper 2603.02766: EvoSkill: Automated Skill Discovery for Multi-Agent Systems

arXiv - AI · 4 min ·
[2603.02711] A Natural Language Agentic Approach to Study Affective Polarization
Machine Learning

[2603.02711] A Natural Language Agentic Approach to Study Affective Polarization

Abstract page for arXiv paper 2603.02711: A Natural Language Agentic Approach to Study Affective Polarization

arXiv - AI · 4 min ·
[2603.02601] AgentAssay: Token-Efficient Regression Testing for Non-Deterministic AI Agent Workflows
Machine Learning

[2603.02601] AgentAssay: Token-Efficient Regression Testing for Non-Deterministic AI Agent Workflows

Abstract page for arXiv paper 2603.02601: AgentAssay: Token-Efficient Regression Testing for Non-Deterministic AI Agent Workflows

arXiv - AI · 4 min ·
[2603.02586] LiveAgentBench: Comprehensive Benchmarking of Agentic Systems Across 104 Real-World Challenges
Llms

[2603.02586] LiveAgentBench: Comprehensive Benchmarking of Agentic Systems Across 104 Real-World Challenges

Abstract page for arXiv paper 2603.02586: LiveAgentBench: Comprehensive Benchmarking of Agentic Systems Across 104 Real-World Challenges

arXiv - AI · 3 min ·
[2603.02229] Safety Training Persists Through Helpfulness Optimization in LLM Agents
Llms

[2603.02229] Safety Training Persists Through Helpfulness Optimization in LLM Agents

Abstract page for arXiv paper 2603.02229: Safety Training Persists Through Helpfulness Optimization in LLM Agents

arXiv - Machine Learning · 3 min ·
[2603.02240] SuperLocalMemory: Privacy-Preserving Multi-Agent Memory with Bayesian Trust Defense Against Memory Poisoning
Llms

[2603.02240] SuperLocalMemory: Privacy-Preserving Multi-Agent Memory with Bayesian Trust Defense Against Memory Poisoning

Abstract page for arXiv paper 2603.02240: SuperLocalMemory: Privacy-Preserving Multi-Agent Memory with Bayesian Trust Defense Against Mem...

arXiv - AI · 3 min ·
Llms

I building a real-time reality show where 10 AI agents (Claude) compete, form alliances, betray each other, and get eliminated by viewer votes — running a live test right now

For the past few weeks I've been building The Experiment — a live reality show where 10 AI agents are actually playing a game against eac...

Reddit - Artificial Intelligence · 1 min ·
Llms

[D] Predicting total cost of agentic LLM workflows - is there a research gap around output token count and chain depth estimation?

Working on a practical problem that I think has an interesting ML angle. In agentic LLM workflows (tool use, multi-step reasoning, ReAct-...

Reddit - Machine Learning · 1 min ·
Previous Page 19 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime