AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Open Source Ai

we just hit 555 stars on our open source AI agent config tool and i'm honestly still in shock

so a while back me and a few folks started working on Caliber, an open source tool for managing AI agent configs and syncing them with yo...

Reddit - Artificial Intelligence · 1 min ·
Robotics

[P] Cadenza: Connect Wandb logs to agents easily for autonomous research.

Wandb CLI and MCP is atrocious to use with agents for full autonomous research loops. They are slow, clunky, and result in context rot. S...

Reddit - Artificial Intelligence · 1 min ·
Robotics

[P] Cadenza: Connect Wandb logs to agents easily for autonomous research.

Wandb CLI and MCP is atrocious to use with agents for full autonomous research loops. They are slow, clunky, and result in context rot. S...

Reddit - Machine Learning · 1 min ·

All Content

[2602.20057] AdaWorldPolicy: World-Model-Driven Diffusion Policy with Online Adaptive Learning for Robotic Manipulation
Machine Learning

[2602.20057] AdaWorldPolicy: World-Model-Driven Diffusion Policy with Online Adaptive Learning for Robotic Manipulation

The paper presents AdaWorldPolicy, a novel framework for robotic manipulation that utilizes world models and online adaptive learning to ...

arXiv - AI · 4 min ·
[2310.01770] A simple connection from loss flatness to compressed neural representations
Machine Learning

[2310.01770] A simple connection from loss flatness to compressed neural representations

This article explores the relationship between loss flatness and compressed neural representations, introducing new measures and empirica...

arXiv - AI · 4 min ·
[2602.20055] To Move or Not to Move: Constraint-based Planning Enables Zero-Shot Generalization for Interactive Navigation
Robotics

[2602.20055] To Move or Not to Move: Constraint-based Planning Enables Zero-Shot Generalization for Interactive Navigation

This paper presents a novel constraint-based planning framework for mobile robots, enabling zero-shot generalization in interactive navig...

arXiv - AI · 4 min ·
[2602.20159] A Very Big Video Reasoning Suite
Machine Learning

[2602.20159] A Very Big Video Reasoning Suite

The paper introduces the Very Big Video Reasoning (VBVR) Dataset, a large-scale resource for studying video reasoning capabilities, featu...

arXiv - Machine Learning · 4 min ·
[2602.20040] AgenticSum: An Agentic Inference-Time Framework for Faithful Clinical Text Summarization
Llms

[2602.20040] AgenticSum: An Agentic Inference-Time Framework for Faithful Clinical Text Summarization

AgenticSum presents a novel framework for improving clinical text summarization using large language models, focusing on reducing factual...

arXiv - AI · 3 min ·
[2602.20156] Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks
Llms

[2602.20156] Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks

The paper introduces SkillInject, a benchmark for evaluating the vulnerability of LLM agents to skill file attacks, revealing high suscep...

arXiv - Machine Learning · 4 min ·
[2602.19969] ReAttn: Improving Attention-based Re-ranking via Attention Re-weighting
Llms

[2602.19969] ReAttn: Improving Attention-based Re-ranking via Attention Re-weighting

The paper presents ReAttn, a novel strategy to enhance attention-based re-ranking in large language models by reducing lexical bias and i...

arXiv - AI · 3 min ·
[2602.20078] Descent-Guided Policy Gradient for Scalable Cooperative Multi-Agent Learning
Ai Agents

[2602.20078] Descent-Guided Policy Gradient for Scalable Cooperative Multi-Agent Learning

The paper presents the Descent-Guided Policy Gradient (DG-PG) method, which enhances cooperative multi-agent reinforcement learning by re...

arXiv - Machine Learning · 3 min ·
[2602.19881] Make Some Noise: Unsupervised Remote Sensing Change Detection Using Latent Space Perturbations
Llms

[2602.19881] Make Some Noise: Unsupervised Remote Sensing Change Detection Using Latent Space Perturbations

The paper presents MaSoN, an innovative framework for unsupervised change detection in remote sensing that generates diverse changes in l...

arXiv - AI · 4 min ·
[2602.19872] GOAL: Geometrically Optimal Alignment for Continual Generalized Category Discovery
Ai Safety

[2602.19872] GOAL: Geometrically Optimal Alignment for Continual Generalized Category Discovery

The paper presents GOAL, a framework for Continual Generalized Category Discovery (C-GCD) that enhances class discovery while minimizing ...

arXiv - AI · 3 min ·
[2602.19984] Multivariate time-series forecasting of ASTRI-Horn monitoring data: A Normal Behavior Model
Machine Learning

[2602.19984] Multivariate time-series forecasting of ASTRI-Horn monitoring data: A Normal Behavior Model

This article presents a Normal Behavior Model (NBM) for forecasting monitoring data from the ASTRI-Horn telescope, demonstrating effectiv...

arXiv - Machine Learning · 4 min ·
[2602.19843] MAS-FIRE: Fault Injection and Reliability Evaluation for LLM-Based Multi-Agent Systems
Llms

[2602.19843] MAS-FIRE: Fault Injection and Reliability Evaluation for LLM-Based Multi-Agent Systems

The paper presents MAS-FIRE, a framework for evaluating the reliability of LLM-based Multi-Agent Systems through fault injection, address...

arXiv - AI · 4 min ·
[2602.19919] Janus-Q: End-to-End Event-Driven Trading via Hierarchical-Gated Reward Modeling
Machine Learning

[2602.19919] Janus-Q: End-to-End Event-Driven Trading via Hierarchical-Gated Reward Modeling

The paper presents Janus-Q, an innovative framework for event-driven trading that leverages financial news events as primary decision-mak...

arXiv - Machine Learning · 4 min ·
[2602.19816] Depth-Structured Music Recurrence: Budgeted Recurrent Attention for Full-Piece Symbolic Music Modeling
Machine Learning

[2602.19816] Depth-Structured Music Recurrence: Budgeted Recurrent Attention for Full-Piece Symbolic Music Modeling

The paper presents Depth-Structured Music Recurrence (DSMR), a novel approach for symbolic music modeling that optimizes long-context pro...

arXiv - Machine Learning · 4 min ·
[2602.19702] DReX: An Explainable Deep Learning-based Multimodal Recommendation Framework
Machine Learning

[2602.19702] DReX: An Explainable Deep Learning-based Multimodal Recommendation Framework

DReX is a novel multimodal recommendation framework that enhances user and item representation through explainable deep learning, address...

arXiv - AI · 4 min ·
[2602.19775] Exact Discrete Stochastic Simulation with Deep-Learning-Scale Gradient Optimization
Machine Learning

[2602.19775] Exact Discrete Stochastic Simulation with Deep-Learning-Scale Gradient Optimization

The paper presents a novel approach to exact discrete stochastic simulation using deep-learning-scale gradient optimization, enhancing sc...

arXiv - Machine Learning · 3 min ·
[2602.19698] Iconographic Classification and Content-Based Recommendation for Digitized Artworks
Machine Learning

[2602.19698] Iconographic Classification and Content-Based Recommendation for Digitized Artworks

This article presents a proof-of-concept system for automating iconographic classification and content-based recommendations for digitize...

arXiv - AI · 3 min ·
[2602.19679] TeHOR: Text-Guided 3D Human and Object Reconstruction with Textures
Robotics

[2602.19679] TeHOR: Text-Guided 3D Human and Object Reconstruction with Textures

TeHOR introduces a novel framework for 3D human and object reconstruction using text descriptions, addressing limitations in current meth...

arXiv - AI · 3 min ·
[2602.19674] Continuous Telemonitoring of Heart Failure using Personalised Speech Dynamics
Machine Learning

[2602.19674] Continuous Telemonitoring of Heart Failure using Personalised Speech Dynamics

This article presents a novel approach for continuous telemonitoring of heart failure through personalized speech dynamics, showcasing si...

arXiv - AI · 4 min ·
[2602.19691] Smoothness Adaptivity in Constant-Depth Neural Networks: Optimal Rates via Smooth Activations
Machine Learning

[2602.19691] Smoothness Adaptivity in Constant-Depth Neural Networks: Optimal Rates via Smooth Activations

This paper explores the advantages of smooth activation functions in constant-depth neural networks, demonstrating their ability to achie...

arXiv - Machine Learning · 3 min ·
Previous Page 73 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime