AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Llms

What I learned about multi-agent coordination running 9 specialized Claude agents

I've been experimenting with multi-agent AI systems and ended up building something more ambitious than I originally planned: a fully ope...

Reddit - Artificial Intelligence · 1 min ·
Robotics

What happens when AI agents can earn and spend real money? I built a small test to find out

I've been sitting with a question for a while: what happens when AI agents aren't just tools to be used, but participants in an economy? ...

Reddit - Artificial Intelligence · 1 min ·
[2601.00809] A Modular Reference Architecture for MCP-Servers Enabling Agentic BIM Interaction
Llms

[2601.00809] A Modular Reference Architecture for MCP-Servers Enabling Agentic BIM Interaction

Abstract page for arXiv paper 2601.00809: A Modular Reference Architecture for MCP-Servers Enabling Agentic BIM Interaction

arXiv - AI · 4 min ·

All Content

[2602.22408] Exploring Human Behavior During Abstract Rule Inference and Problem Solving with the Cognitive Abstraction and Reasoning Corpus
Machine Learning

[2602.22408] Exploring Human Behavior During Abstract Rule Inference and Problem Solving with the Cognitive Abstraction and Reasoning Corpus

This article presents the Cognitive Abstraction and Reasoning Corpus (CogARC), a study exploring human abstract reasoning through problem...

arXiv - AI · 4 min ·
[2602.22406] Towards Autonomous Memory Agents
Llms

[2602.22406] Towards Autonomous Memory Agents

The paper proposes autonomous memory agents that enhance LLMs by actively acquiring and curating knowledge, improving performance on benc...

arXiv - AI · 3 min ·
[2602.22401] Vibe Researching as Wolf Coming: Can AI Agents with Skills Replace or Augment Social Scientists?
Robotics

[2602.22401] Vibe Researching as Wolf Coming: Can AI Agents with Skills Replace or Augment Social Scientists?

This paper explores the potential of AI agents to replace or augment social scientists by introducing the concept of 'vibe researching,' ...

arXiv - AI · 4 min ·
[2602.22302] Agent Behavioral Contracts: Formal Specification and Runtime Enforcement for Reliable Autonomous AI Agents
Nlp

[2602.22302] Agent Behavioral Contracts: Formal Specification and Runtime Enforcement for Reliable Autonomous AI Agents

The paper presents Agent Behavioral Contracts (ABC), a framework for specifying and enforcing the behavior of autonomous AI agents, addre...

arXiv - AI · 4 min ·
[2602.22287] Multi-Level Causal Embeddings
Machine Learning

[2602.22287] Multi-Level Causal Embeddings

This article presents a framework for Multi-Level Causal Embeddings, which allows for the mapping of detailed causal models into coarser ...

arXiv - Machine Learning · 3 min ·
[2602.22215] Graph Your Way to Inspiration: Integrating Co-Author Graphs with Retrieval-Augmented Generation for Large Language Model Based Scientific Idea Generation
Llms

[2602.22215] Graph Your Way to Inspiration: Integrating Co-Author Graphs with Retrieval-Augmented Generation for Large Language Model Based Scientific Idea Generation

This paper introduces GYWI, a system that enhances scientific idea generation by integrating co-author knowledge graphs with retrieval-au...

arXiv - AI · 4 min ·
[2602.22334] A 1/R Law for Kurtosis Contrast in Balanced Mixtures
Machine Learning

[2602.22334] A 1/R Law for Kurtosis Contrast in Balanced Mixtures

This paper presents a new redundancy law for kurtosis contrast in balanced mixtures, demonstrating how effective width impacts kurtosis e...

arXiv - AI · 3 min ·
[2602.22303] Training Agents to Self-Report Misbehavior
Llms

[2602.22303] Training Agents to Self-Report Misbehavior

The paper discusses a novel approach to training AI agents to self-report misbehavior, enhancing alignment and safety in AI systems by re...

arXiv - AI · 3 min ·
[2602.22297] Learning Rewards, Not Labels: Adversarial Inverse Reinforcement Learning for Machinery Fault Detection
Machine Learning

[2602.22297] Learning Rewards, Not Labels: Adversarial Inverse Reinforcement Learning for Machinery Fault Detection

This paper presents a novel approach to machinery fault detection using Adversarial Inverse Reinforcement Learning, enabling effective an...

arXiv - AI · 4 min ·
[2602.22284] BrepCoder: A Unified Multimodal Large Language Model for Multi-task B-rep Reasoning
Llms

[2602.22284] BrepCoder: A Unified Multimodal Large Language Model for Multi-task B-rep Reasoning

BrepCoder is a unified multimodal large language model designed for multi-task reasoning in Computer-Aided Design (CAD), specifically uti...

arXiv - Machine Learning · 3 min ·
[2602.22260] Code World Models for Parameter Control in Evolutionary Algorithms
Llms

[2602.22260] Code World Models for Parameter Control in Evolutionary Algorithms

This paper explores the use of Code World Models (CWMs) to enhance parameter control in evolutionary algorithms, demonstrating significan...

arXiv - Machine Learning · 4 min ·
[2602.22254] Causal Direction from Convergence Time: Faster Training in the True Causal Direction
Machine Learning

[2602.22254] Causal Direction from Convergence Time: Faster Training in the True Causal Direction

This paper introduces Causal Computational Asymmetry (CCA), a method for identifying causal direction in neural networks based on converg...

arXiv - AI · 4 min ·
[2602.22228] Patient-Centered, Graph-Augmented Artificial Intelligence-Enabled Passive Surveillance for Early Stroke Risk Detection in High-Risk Individuals
Machine Learning

[2602.22228] Patient-Centered, Graph-Augmented Artificial Intelligence-Enabled Passive Surveillance for Early Stroke Risk Detection in High-Risk Individuals

This article presents a novel AI-enabled passive surveillance system designed to detect early stroke risk in high-risk individuals, parti...

arXiv - Machine Learning · 4 min ·
[2602.22227] To Deceive is to Teach? Forging Perceptual Robustness via Adversarial Reinforcement Learning
Llms

[2602.22227] To Deceive is to Teach? Forging Perceptual Robustness via Adversarial Reinforcement Learning

The paper introduces AOT-SFT, an adversarial dataset aimed at enhancing the robustness of Multimodal Large Language Models (MLLMs) agains...

arXiv - AI · 3 min ·
On-Device Function Calling in Google AI Edge Gallery
Ai Agents

On-Device Function Calling in Google AI Edge Gallery

Google introduces FunctionGemma, a 270M parameter model for on-device function calling, enhancing mobile AI capabilities with instant int...

AI Tools & Products · 6 min ·
AI: Catalyst or Threat to Human Innovation?
Ai Safety

AI: Catalyst or Threat to Human Innovation?

The article explores the role of collaboration and expertise in technological advancement, contrasting it with the limitations of individ...

AI News - General · 9 min ·
Machine Learning

Dosidicus: A transparent cognitive sandbox disguised as a digital pet squid with a neural network you can see thinking

Dosidicus is a digital pet squid that serves as a cognitive sandbox, allowing users to build and visualize neural networks while learning...

Reddit - Artificial Intelligence · 1 min ·
‘Uncanny Valley’: Pentagon vs. ‘Woke’ Anthropic, Agentic vs. Mimetic, and Trump vs. State of the Union | WIRED
Ai Agents

‘Uncanny Valley’: Pentagon vs. ‘Woke’ Anthropic, Agentic vs. Mimetic, and Trump vs. State of the Union | WIRED

The Uncanny Valley podcast discusses the escalating feud between Anthropic and the Pentagon over AI technology use, the concept of agenti...

Wired - AI · 32 min ·
At the AI Summit, learning to love and fear the era of agents
Ai Agents

At the AI Summit, learning to love and fear the era of agents

The article discusses the challenges and opportunities presented by AI in journalism, highlighting the author's experiences at the India ...

AI News - General · 5 min ·
Microsoft’s Copilot Tasks AI uses its own computer to get things done | The Verge
Ai Agents

Microsoft’s Copilot Tasks AI uses its own computer to get things done | The Verge

Microsoft's new Copilot Tasks AI automates busywork by utilizing its own cloud-based computer to perform tasks like scheduling and organi...

The Verge - AI · 4 min ·
Previous Page 41 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime