AI Agents
Autonomous agents, tool use, and agentic systems
Top This Week
we just hit 555 stars on our open source AI agent config tool and i'm honestly still in shock
so a while back me and a few folks started working on Caliber, an open source tool for managing AI agent configs and syncing them with yo...
[P] Cadenza: Connect Wandb logs to agents easily for autonomous research.
Wandb CLI and MCP is atrocious to use with agents for full autonomous research loops. They are slow, clunky, and result in context rot. S...
All Content
[2602.19578] Goal-Oriented Influence-Maximizing Data Acquisition for Learning and Optimization
The paper presents Goal-Oriented Influence-Maximizing Data Acquisition (GOIMDA), a novel algorithm for active data acquisition in machine...
[2602.19651] Denoising Particle Filters: Learning State Estimation with Single-Step Objectives
This paper presents a novel particle filtering algorithm for state estimation in robotics, leveraging single-step objectives to improve i...
[2602.19629] Cooperation After the Algorithm: Designing Human-AI Coexistence Beyond the Illusion of Collaboration
The paper discusses the design of human-AI coexistence, emphasizing the need for governance frameworks to ensure responsible collaboratio...
[2602.19623] PedaCo-Gen: Scaffolding Pedagogical Agency in Human-AI Collaborative Video Authoring
PedaCo-Gen is a novel AI system designed to enhance the quality of instructional video creation by integrating pedagogical principles and...
[2602.19605] CLCR: Cross-Level Semantic Collaborative Representation for Multimodal Learning
The paper presents CLCR, a novel approach for multimodal learning that organizes features into a three-level semantic hierarchy to enhanc...
[2602.19569] Temporal-Aware Heterogeneous Graph Reasoning with Multi-View Fusion for Temporal Question Answering
This paper presents a novel framework for Temporal Question Answering over Temporal Knowledge Graphs, addressing limitations in temporal ...
[2602.19565] DICArt: Advancing Category-level Articulated Object Pose Estimation in Discrete State-Spaces
DICArt introduces a novel framework for category-level articulated object pose estimation, utilizing a discrete diffusion process to enha...
[2602.19555] Agentic AI as a Cybersecurity Attack Surface: Threats, Exploits, and Defenses in Runtime Supply Chains
This article discusses the cybersecurity implications of agentic AI systems, focusing on threats and defenses in runtime supply chains, h...
[2602.19538] Cost-Aware Diffusion Active Search
The paper presents a novel approach to active search using cost-aware diffusion models, improving efficiency in decision-making for auton...
[2602.19534] Large Language Model-Assisted UAV Operations and Communications: A Multifaceted Survey and Tutorial
This article surveys the integration of Large Language Models (LLMs) in Uncrewed Aerial Vehicles (UAVs), exploring their potential to enh...
[2602.19536] Fore-Mamba3D: Mamba-based Foreground-Enhanced Encoding for 3D Object Detection
The paper presents Fore-Mamba3D, a novel approach for 3D object detection that enhances foreground encoding while addressing limitations ...
[2602.19372] Seeing Farther and Smarter: Value-Guided Multi-Path Reflection for VLM Policy Optimization
The paper presents a novel framework for optimizing Vision-Language Models (VLMs) in robotic manipulation tasks, enhancing decision-makin...
[2602.19491] Botson: An Accessible and Low-Cost Platform for Social Robotics Research
The paper presents Botson, a low-cost, accessible platform for social robotics research, designed to enhance trust in AI through anthropo...
[2602.19463] PuppetChat: Fostering Intimate Communication through Bidirectional Actions and Micronarratives
PuppetChat is a messaging prototype designed to enhance intimate communication by fostering bidirectional actions and creating personaliz...
[2602.19467] Can Large Language Models Replace Human Coders? Introducing ContentBench
This article introduces ContentBench, a benchmark suite assessing the ability of low-cost large language models (LLMs) to perform interpr...
[2602.19441] When AI Teammates Meet Code Review: Collaboration Signals Shaping the Integration of Agent-Authored Pull Requests
This paper investigates how AI-generated pull requests integrate into human-led code review processes, emphasizing the importance of coll...
[2602.19400] Hilbert-Augmented Reinforcement Learning for Scalable Multi-Robot Coverage and Exploration
This paper presents a novel framework integrating Hilbert space-filling priors into decentralized multi-robot learning, enhancing coverag...
[2602.19008] Capable but Unreliable: Canonical Path Deviation as a Causal Mechanism of Agent Failure in Long-Horizon Tasks
This article explores the reliability failures of language agents in long-horizon tasks, attributing these failures to deviations from ca...
[2602.19326] City Editing: Hierarchical Agentic Execution for Dependency-Aware Urban Geospatial Modification
The paper presents a hierarchical framework for urban geospatial modification, enabling efficient urban renewal through agentic systems a...
[2602.19322] US-JEPA: A Joint Embedding Predictive Architecture for Medical Ultrasound
The paper presents US-JEPA, a novel self-supervised framework for medical ultrasound imaging that enhances representation learning by pre...
Related Topics
Stay updated with AI News
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime