AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[2506.20964] Evidence-based diagnostic reasoning with multi-agent copilot for human pathology

Abstract page for arXiv paper 2506.20964: Evidence-based diagnostic reasoning with multi-agent copilot for human pathology

arXiv - AI · 4 min · about 4 hours ago

Ai Agents

[2601.08323] AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation

Abstract page for arXiv paper 2601.08323: AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation

arXiv - AI · 3 min · about 4 hours ago

Llms

[2603.18349] Large-Scale Analysis of Persuasive Content on Moltbook

Abstract page for arXiv paper 2603.18349: Large-Scale Analysis of Persuasive Content on Moltbook

arXiv - AI · 3 min · about 4 hours ago

All Content

Ai Agents

We Have 30 AI Agents in Production. Here Are the Top 5 Issues No One Talks About

AI Tools & Products · 14 min · 27 days ago

Robotics

What happens when you give an AI agent a structured mistake log and let it write its own behavioral rules?

I've been running a persistent AI agent as an operational manager for the past couple of weeks. Not a chatbot, not a one-off coding assis...

Reddit - Artificial Intelligence · 1 min · 27 days ago

Ai Agents

[P] Spec-To-Ship: Open source agent to turn markdown specs into code skeletons

We just open sourced a spec to ship AI Agent project! Repo: https://github.com/dakshjain-1616/Spec-To-Ship Specs are a core part of plann...

Reddit - Machine Learning · 1 min · 27 days ago

Llms

[2506.16411] When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework

Abstract page for arXiv paper 2506.16411: When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2602.00428] When Agents "Misremember" Collectively: Exploring the Mandela Effect in LLM-based Multi-Agent Systems

Abstract page for arXiv paper 2602.00428: When Agents "Misremember" Collectively: Exploring the Mandela Effect in LLM-based Multi-Agent S...

arXiv - AI · 4 min · 27 days ago

Machine Learning

[2512.09085] Mental Models of Autonomy and Sentience Shape Reactions to AI

Abstract page for arXiv paper 2512.09085: Mental Models of Autonomy and Sentience Shape Reactions to AI

arXiv - AI · 4 min · 27 days ago

Llms

[2512.01822] InnoGym: Benchmarking the Innovation Potential of AI Agents

Abstract page for arXiv paper 2512.01822: InnoGym: Benchmarking the Innovation Potential of AI Agents

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2601.04786] AgentOCR: Reimagining Agent History via Optical Self-Compression

Abstract page for arXiv paper 2601.04786: AgentOCR: Reimagining Agent History via Optical Self-Compression

arXiv - Machine Learning · 4 min · 27 days ago

Robotics

[2511.07441] AudAgent: Automated Auditing of Privacy Policy Compliance in AI Agents

Abstract page for arXiv paper 2511.07441: AudAgent: Automated Auditing of Privacy Policy Compliance in AI Agents

arXiv - AI · 4 min · 27 days ago

Llms

[2512.17052] Dynamic Tool Dependency Retrieval for Efficient Function Calling

Abstract page for arXiv paper 2512.17052: Dynamic Tool Dependency Retrieval for Efficient Function Calling

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2512.03324] Cache What Lasts: Token Retention for Memory-Bounded KV Cache in LLMs

Abstract page for arXiv paper 2512.03324: Cache What Lasts: Token Retention for Memory-Bounded KV Cache in LLMs

arXiv - Machine Learning · 4 min · 27 days ago

Generative Ai

[2510.26585] Stop Wasting Your Tokens: Towards Efficient Runtime Multi-Agent Systems

Abstract page for arXiv paper 2510.26585: Stop Wasting Your Tokens: Towards Efficient Runtime Multi-Agent Systems

arXiv - AI · 3 min · 27 days ago

Ai Agents

[2510.26389] Adaptive Context Length Optimization with Low-Frequency Truncation for Multi-Agent Reinforcement Learning

Abstract page for arXiv paper 2510.26389: Adaptive Context Length Optimization with Low-Frequency Truncation for Multi-Agent Reinforcemen...

arXiv - Machine Learning · 4 min · 27 days ago

Machine Learning

[2510.15018] UrbanVerse: Scaling Urban Simulation by Watching City-Tour Videos

Abstract page for arXiv paper 2510.15018: UrbanVerse: Scaling Urban Simulation by Watching City-Tour Videos

arXiv - AI · 4 min · 27 days ago

Llms

[2510.05174] Emergent Coordination in Multi-Agent Language Models

Abstract page for arXiv paper 2510.05174: Emergent Coordination in Multi-Agent Language Models

arXiv - AI · 4 min · 27 days ago

Llms

[2510.02209] StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Abstract page for arXiv paper 2510.02209: StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2510.03253] Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents

Abstract page for arXiv paper 2510.03253: Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2510.01051] GEM: A Gym for Agentic LLMs

Abstract page for arXiv paper 2510.01051: GEM: A Gym for Agentic LLMs

arXiv - Machine Learning · 4 min · 27 days ago

Machine Learning

[2508.02948] Sample-Efficient Distributionally Robust Multi-Agent Reinforcement Learning via Online Interaction

Abstract page for arXiv paper 2508.02948: Sample-Efficient Distributionally Robust Multi-Agent Reinforcement Learning via Online Interaction

arXiv - Machine Learning · 3 min · 27 days ago

Nlp

[2508.10760] FROGENT: An End-to-End Full-process Drug Design Multi-Agent System

Abstract page for arXiv paper 2508.10760: FROGENT: An End-to-End Full-process Drug Design Multi-Agent System

arXiv - AI · 4 min · 27 days ago

Previous Page 20 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

[2506.20964] Evidence-based diagnostic reasoning with multi-agent copilot for human pathology

[2601.08323] AtomMem : Learnable Dynamic Agentic Memory with Atomic Memory Operation

[2603.18349] Large-Scale Analysis of Persuasive Content on Moltbook

All Content

We Have 30 AI Agents in Production. Here Are the Top 5 Issues No One Talks About

What happens when you give an AI agent a structured mistake log and let it write its own behavioral rules?

[P] Spec-To-Ship: Open source agent to turn markdown specs into code skeletons

[2506.16411] When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework

[2602.00428] When Agents "Misremember" Collectively: Exploring the Mandela Effect in LLM-based Multi-Agent Systems

[2512.09085] Mental Models of Autonomy and Sentience Shape Reactions to AI

[2512.01822] InnoGym: Benchmarking the Innovation Potential of AI Agents

[2601.04786] AgentOCR: Reimagining Agent History via Optical Self-Compression

[2511.07441] AudAgent: Automated Auditing of Privacy Policy Compliance in AI Agents

[2512.17052] Dynamic Tool Dependency Retrieval for Efficient Function Calling

[2512.03324] Cache What Lasts: Token Retention for Memory-Bounded KV Cache in LLMs

[2510.26585] Stop Wasting Your Tokens: Towards Efficient Runtime Multi-Agent Systems

[2510.26389] Adaptive Context Length Optimization with Low-Frequency Truncation for Multi-Agent Reinforcement Learning

[2510.15018] UrbanVerse: Scaling Urban Simulation by Watching City-Tour Videos

[2510.05174] Emergent Coordination in Multi-Agent Language Models

[2510.02209] StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

[2510.03253] Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents

[2510.01051] GEM: A Gym for Agentic LLMs

[2508.02948] Sample-Efficient Distributionally Robust Multi-Agent Reinforcement Learning via Online Interaction

[2508.10760] FROGENT: An End-to-End Full-process Drug Design Multi-Agent System

Related Topics

Stay updated with AI News