AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Walmart CEO reportedly brags that company's in-app AI agent is making people spend 35% more money
Nlp

Walmart CEO reportedly brags that company's in-app AI agent is making people spend 35% more money

AI Tools & Products · 4 min ·
Open Source Ai

we just hit 555 stars on our open source AI agent config tool and i'm honestly still in shock

so a while back me and a few folks started working on Caliber, an open source tool for managing AI agent configs and syncing them with yo...

Reddit - Artificial Intelligence · 1 min ·
Robotics

[P] Cadenza: Connect Wandb logs to agents easily for autonomous research.

Wandb CLI and MCP is atrocious to use with agents for full autonomous research loops. They are slow, clunky, and result in context rot. S...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.19578] Goal-Oriented Influence-Maximizing Data Acquisition for Learning and Optimization
Machine Learning

[2602.19578] Goal-Oriented Influence-Maximizing Data Acquisition for Learning and Optimization

The paper presents Goal-Oriented Influence-Maximizing Data Acquisition (GOIMDA), a novel algorithm for active data acquisition in machine...

arXiv - Machine Learning · 3 min ·
[2602.19651] Denoising Particle Filters: Learning State Estimation with Single-Step Objectives
Machine Learning

[2602.19651] Denoising Particle Filters: Learning State Estimation with Single-Step Objectives

This paper presents a novel particle filtering algorithm for state estimation in robotics, leveraging single-step objectives to improve i...

arXiv - Machine Learning · 3 min ·
[2602.19629] Cooperation After the Algorithm: Designing Human-AI Coexistence Beyond the Illusion of Collaboration
Ai Safety

[2602.19629] Cooperation After the Algorithm: Designing Human-AI Coexistence Beyond the Illusion of Collaboration

The paper discusses the design of human-AI coexistence, emphasizing the need for governance frameworks to ensure responsible collaboratio...

arXiv - AI · 4 min ·
[2602.19623] PedaCo-Gen: Scaffolding Pedagogical Agency in Human-AI Collaborative Video Authoring
Machine Learning

[2602.19623] PedaCo-Gen: Scaffolding Pedagogical Agency in Human-AI Collaborative Video Authoring

PedaCo-Gen is a novel AI system designed to enhance the quality of instructional video creation by integrating pedagogical principles and...

arXiv - AI · 3 min ·
[2602.19605] CLCR: Cross-Level Semantic Collaborative Representation for Multimodal Learning
Ai Safety

[2602.19605] CLCR: Cross-Level Semantic Collaborative Representation for Multimodal Learning

The paper presents CLCR, a novel approach for multimodal learning that organizes features into a three-level semantic hierarchy to enhanc...

arXiv - AI · 4 min ·
[2602.19569] Temporal-Aware Heterogeneous Graph Reasoning with Multi-View Fusion for Temporal Question Answering
Ai Safety

[2602.19569] Temporal-Aware Heterogeneous Graph Reasoning with Multi-View Fusion for Temporal Question Answering

This paper presents a novel framework for Temporal Question Answering over Temporal Knowledge Graphs, addressing limitations in temporal ...

arXiv - AI · 3 min ·
[2602.19565] DICArt: Advancing Category-level Articulated Object Pose Estimation in Discrete State-Spaces
Generative Ai

[2602.19565] DICArt: Advancing Category-level Articulated Object Pose Estimation in Discrete State-Spaces

DICArt introduces a novel framework for category-level articulated object pose estimation, utilizing a discrete diffusion process to enha...

arXiv - AI · 4 min ·
[2602.19555] Agentic AI as a Cybersecurity Attack Surface: Threats, Exploits, and Defenses in Runtime Supply Chains
Llms

[2602.19555] Agentic AI as a Cybersecurity Attack Surface: Threats, Exploits, and Defenses in Runtime Supply Chains

This article discusses the cybersecurity implications of agentic AI systems, focusing on threats and defenses in runtime supply chains, h...

arXiv - AI · 3 min ·
[2602.19538] Cost-Aware Diffusion Active Search
Generative Ai

[2602.19538] Cost-Aware Diffusion Active Search

The paper presents a novel approach to active search using cost-aware diffusion models, improving efficiency in decision-making for auton...

arXiv - Machine Learning · 4 min ·
[2602.19534] Large Language Model-Assisted UAV Operations and Communications: A Multifaceted Survey and Tutorial
Llms

[2602.19534] Large Language Model-Assisted UAV Operations and Communications: A Multifaceted Survey and Tutorial

This article surveys the integration of Large Language Models (LLMs) in Uncrewed Aerial Vehicles (UAVs), exploring their potential to enh...

arXiv - AI · 4 min ·
[2602.19536] Fore-Mamba3D: Mamba-based Foreground-Enhanced Encoding for 3D Object Detection
Machine Learning

[2602.19536] Fore-Mamba3D: Mamba-based Foreground-Enhanced Encoding for 3D Object Detection

The paper presents Fore-Mamba3D, a novel approach for 3D object detection that enhances foreground encoding while addressing limitations ...

arXiv - AI · 4 min ·
[2602.19372] Seeing Farther and Smarter: Value-Guided Multi-Path Reflection for VLM Policy Optimization
Llms

[2602.19372] Seeing Farther and Smarter: Value-Guided Multi-Path Reflection for VLM Policy Optimization

The paper presents a novel framework for optimizing Vision-Language Models (VLMs) in robotic manipulation tasks, enhancing decision-makin...

arXiv - Machine Learning · 4 min ·
[2602.19491] Botson: An Accessible and Low-Cost Platform for Social Robotics Research
Llms

[2602.19491] Botson: An Accessible and Low-Cost Platform for Social Robotics Research

The paper presents Botson, a low-cost, accessible platform for social robotics research, designed to enhance trust in AI through anthropo...

arXiv - AI · 3 min ·
[2602.19463] PuppetChat: Fostering Intimate Communication through Bidirectional Actions and Micronarratives
Ai Agents

[2602.19463] PuppetChat: Fostering Intimate Communication through Bidirectional Actions and Micronarratives

PuppetChat is a messaging prototype designed to enhance intimate communication by fostering bidirectional actions and creating personaliz...

arXiv - AI · 3 min ·
[2602.19467] Can Large Language Models Replace Human Coders? Introducing ContentBench
Llms

[2602.19467] Can Large Language Models Replace Human Coders? Introducing ContentBench

This article introduces ContentBench, a benchmark suite assessing the ability of low-cost large language models (LLMs) to perform interpr...

arXiv - AI · 4 min ·
[2602.19441] When AI Teammates Meet Code Review: Collaboration Signals Shaping the Integration of Agent-Authored Pull Requests
Robotics

[2602.19441] When AI Teammates Meet Code Review: Collaboration Signals Shaping the Integration of Agent-Authored Pull Requests

This paper investigates how AI-generated pull requests integrate into human-led code review processes, emphasizing the importance of coll...

arXiv - AI · 3 min ·
[2602.19400] Hilbert-Augmented Reinforcement Learning for Scalable Multi-Robot Coverage and Exploration
Nlp

[2602.19400] Hilbert-Augmented Reinforcement Learning for Scalable Multi-Robot Coverage and Exploration

This paper presents a novel framework integrating Hilbert space-filling priors into decentralized multi-robot learning, enhancing coverag...

arXiv - AI · 3 min ·
[2602.19008] Capable but Unreliable: Canonical Path Deviation as a Causal Mechanism of Agent Failure in Long-Horizon Tasks
Ai Agents

[2602.19008] Capable but Unreliable: Canonical Path Deviation as a Causal Mechanism of Agent Failure in Long-Horizon Tasks

This article explores the reliability failures of language agents in long-horizon tasks, attributing these failures to deviations from ca...

arXiv - Machine Learning · 4 min ·
[2602.19326] City Editing: Hierarchical Agentic Execution for Dependency-Aware Urban Geospatial Modification
Ai Agents

[2602.19326] City Editing: Hierarchical Agentic Execution for Dependency-Aware Urban Geospatial Modification

The paper presents a hierarchical framework for urban geospatial modification, enabling efficient urban renewal through agentic systems a...

arXiv - AI · 4 min ·
[2602.19322] US-JEPA: A Joint Embedding Predictive Architecture for Medical Ultrasound
Nlp

[2602.19322] US-JEPA: A Joint Embedding Predictive Architecture for Medical Ultrasound

The paper presents US-JEPA, a novel self-supervised framework for medical ultrasound imaging that enhances representation learning by pre...

arXiv - Machine Learning · 4 min ·
Previous Page 74 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime