AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

What I learned about multi-agent coordination running 9 specialized Claude agents

I've been experimenting with multi-agent AI systems and ended up building something more ambitious than I originally planned: a fully ope...

Reddit - Artificial Intelligence · 1 min · about 7 hours ago

Robotics

What happens when AI agents can earn and spend real money? I built a small test to find out

I've been sitting with a question for a while: what happens when AI agents aren't just tools to be used, but participants in an economy? ...

Reddit - Artificial Intelligence · 1 min · about 9 hours ago

Llms

[2601.00809] A Modular Reference Architecture for MCP-Servers Enabling Agentic BIM Interaction

Abstract page for arXiv paper 2601.00809: A Modular Reference Architecture for MCP-Servers Enabling Agentic BIM Interaction

arXiv - AI · 4 min · about 15 hours ago

All Content

Llms

AI Tips and Prompts That Can Take Your Job Search to the Next Level

This article by Bryan Blair discusses how job seekers can effectively use AI tools, particularly large language models, to enhance their ...

AI Tools & Products · 9 min · about 1 month ago

Llms

Google's Gemini AI now handles multi-step tasks on Android

Google's Gemini AI now automates multi-step tasks on Android, enhancing its capabilities to handle rideshare, food, and grocery delivery ...

AI Tools & Products · 6 min · about 1 month ago

Ai Safety

AI chatbots operating in Colorado would have to take steps to protect kids, prevent suicides under bipartisan bill

Colorado's bipartisan bill mandates AI chatbots to protect children by preventing harmful interactions and providing suicide prevention r...

AI Tools & Products · 5 min · about 1 month ago

Generative Ai

Samsung's S26 gives an advance look at what the Google-powered Apple Siri could do

Samsung's Galaxy S26 integrates Google's Gemini AI, enabling advanced autonomous app interactions, while Apple’s Siri upgrade faces delay...

AI Tools & Products · 5 min · about 1 month ago

Llms

[D] Mobile-MCP: Letting LLMs autonomously discover Android app capabilities (no pre-coordination required)

The article discusses Mobile-MCP, a framework allowing LLMs to autonomously discover Android app capabilities without pre-coordination, e...

Reddit - Machine Learning · 1 min · about 1 month ago

Data Science

[2602.08786] Empirically Understanding the Value of Prediction in Allocation

This paper explores the empirical value of prediction in resource allocation, comparing it to other investments like capacity expansion a...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2512.16902] In-Context Algebra

The paper 'In-Context Algebra' explores how transformers can solve arithmetic problems using variable tokens whose meanings are context-d...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2510.11789] Minimax Rates for Learning Pairwise Interactions in Attention-Style Models

This paper examines the convergence rates for learning pairwise interactions in attention-style models, demonstrating a minimax rate that...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2509.14659] Aligning Audio Captions with Human Preferences

The paper presents a novel framework for audio captioning that aligns captions with human preferences using Reinforcement Learning from H...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2502.18615] A Distributional Treatment of Real2Sim2Real for Object-Centric Agent Adaptation in Vision-Driven Deformable Linear Object Manipulation

This article presents a novel framework for adapting object-centric agents in manipulating deformable linear objects using visual percept...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2411.19253] Quantum feedback control with a transformer neural network architecture

This article presents a novel approach to quantum feedback control using transformer neural networks, demonstrating their effectiveness i...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2404.12097] MPC of Uncertain Nonlinear Systems with Meta-Learning for Fast Adaptation of Neural Predictive Models

This paper presents a novel approach to model predictive control (MPC) for uncertain nonlinear systems using a neural state-space model a...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2601.07524] Stagewise Reinforcement Learning and the Geometry of the Regret Landscape

This paper explores Stagewise Reinforcement Learning (SRL) and its relation to the geometry of the regret landscape, demonstrating how le...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2601.02439] WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks

WebGym is an innovative open-source environment designed for training visual web agents, featuring nearly 300,000 tasks and a high-throug...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2511.21104] BRIDGE: Building Representations In Domain Guided Program Synthesis

The paper presents BRIDGE, a framework for improving program synthesis through structured prompting, enhancing correctness and efficiency...

arXiv - Machine Learning · 4 min · about 1 month ago

Nlp

[2512.02435] Efficient Cross-Domain Offline Reinforcement Learning with Dynamics- and Value-Aligned Data Filtering

This paper presents a novel framework for cross-domain offline reinforcement learning, introducing a method that filters data based on bo...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2511.07922] SERL: Self-Examining Reinforcement Learning on Open-Domain

The paper introduces Self-Examining Reinforcement Learning (SERL), a novel framework that enhances the performance of large language mode...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2506.21427] Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning

The paper presents the Single-Step Completion Policy (SSCP), a novel approach in reinforcement learning that enhances efficiency and expr...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2503.03178] Active operator learning with predictive uncertainty quantification for partial differential equations

The paper presents a lightweight predictive uncertainty quantification method for neural operators in solving partial differential equati...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2501.16443] Object-Centric World Models from Few-Shot Annotations for Sample-Efficient Reinforcement Learning

The paper presents OC-STORM, an object-centric model-based reinforcement learning framework that enhances sample efficiency by leveraging...

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 43 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

What I learned about multi-agent coordination running 9 specialized Claude agents

What happens when AI agents can earn and spend real money? I built a small test to find out

[2601.00809] A Modular Reference Architecture for MCP-Servers Enabling Agentic BIM Interaction

All Content

AI Tips and Prompts That Can Take Your Job Search to the Next Level

Google's Gemini AI now handles multi-step tasks on Android

AI chatbots operating in Colorado would have to take steps to protect kids, prevent suicides under bipartisan bill

Samsung's S26 gives an advance look at what the Google-powered Apple Siri could do

[D] Mobile-MCP: Letting LLMs autonomously discover Android app capabilities (no pre-coordination required)

[2602.08786] Empirically Understanding the Value of Prediction in Allocation

[2512.16902] In-Context Algebra

[2510.11789] Minimax Rates for Learning Pairwise Interactions in Attention-Style Models

[2509.14659] Aligning Audio Captions with Human Preferences

[2502.18615] A Distributional Treatment of Real2Sim2Real for Object-Centric Agent Adaptation in Vision-Driven Deformable Linear Object Manipulation

[2411.19253] Quantum feedback control with a transformer neural network architecture

[2404.12097] MPC of Uncertain Nonlinear Systems with Meta-Learning for Fast Adaptation of Neural Predictive Models

[2601.07524] Stagewise Reinforcement Learning and the Geometry of the Regret Landscape

[2601.02439] WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks

[2511.21104] BRIDGE: Building Representations In Domain Guided Program Synthesis

[2512.02435] Efficient Cross-Domain Offline Reinforcement Learning with Dynamics- and Value-Aligned Data Filtering

[2511.07922] SERL: Self-Examining Reinforcement Learning on Open-Domain

[2506.21427] Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning

[2503.03178] Active operator learning with predictive uncertainty quantification for partial differential equations

[2501.16443] Object-Centric World Models from Few-Shot Annotations for Sample-Efficient Reinforcement Learning

Related Topics

Stay updated with AI News