AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Open Source Ai

we just hit 555 stars on our open source AI agent config tool and i'm honestly still in shock

so a while back me and a few folks started working on Caliber, an open source tool for managing AI agent configs and syncing them with yo...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Robotics

[P] Cadenza: Connect Wandb logs to agents easily for autonomous research.

Wandb CLI and MCP is atrocious to use with agents for full autonomous research loops. They are slow, clunky, and result in context rot. S...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Robotics

[P] Cadenza: Connect Wandb logs to agents easily for autonomous research.

Wandb CLI and MCP is atrocious to use with agents for full autonomous research loops. They are slow, clunky, and result in context rot. S...

Reddit - Machine Learning · 1 min · about 1 hour ago

All Content

Machine Learning

[2602.20057] AdaWorldPolicy: World-Model-Driven Diffusion Policy with Online Adaptive Learning for Robotic Manipulation

The paper presents AdaWorldPolicy, a novel framework for robotic manipulation that utilizes world models and online adaptive learning to ...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2310.01770] A simple connection from loss flatness to compressed neural representations

This article explores the relationship between loss flatness and compressed neural representations, introducing new measures and empirica...

arXiv - AI · 4 min · about 1 month ago

Robotics

[2602.20055] To Move or Not to Move: Constraint-based Planning Enables Zero-Shot Generalization for Interactive Navigation

This paper presents a novel constraint-based planning framework for mobile robots, enabling zero-shot generalization in interactive navig...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.20159] A Very Big Video Reasoning Suite

The paper introduces the Very Big Video Reasoning (VBVR) Dataset, a large-scale resource for studying video reasoning capabilities, featu...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.20040] AgenticSum: An Agentic Inference-Time Framework for Faithful Clinical Text Summarization

AgenticSum presents a novel framework for improving clinical text summarization using large language models, focusing on reducing factual...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.20156] Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks

The paper introduces SkillInject, a benchmark for evaluating the vulnerability of LLM agents to skill file attacks, revealing high suscep...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.19969] ReAttn: Improving Attention-based Re-ranking via Attention Re-weighting

The paper presents ReAttn, a novel strategy to enhance attention-based re-ranking in large language models by reducing lexical bias and i...

arXiv - AI · 3 min · about 1 month ago

Ai Agents

[2602.20078] Descent-Guided Policy Gradient for Scalable Cooperative Multi-Agent Learning

The paper presents the Descent-Guided Policy Gradient (DG-PG) method, which enhances cooperative multi-agent reinforcement learning by re...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.19881] Make Some Noise: Unsupervised Remote Sensing Change Detection Using Latent Space Perturbations

The paper presents MaSoN, an innovative framework for unsupervised change detection in remote sensing that generates diverse changes in l...

arXiv - AI · 4 min · about 1 month ago

Ai Safety

[2602.19872] GOAL: Geometrically Optimal Alignment for Continual Generalized Category Discovery

The paper presents GOAL, a framework for Continual Generalized Category Discovery (C-GCD) that enhances class discovery while minimizing ...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.19984] Multivariate time-series forecasting of ASTRI-Horn monitoring data: A Normal Behavior Model

This article presents a Normal Behavior Model (NBM) for forecasting monitoring data from the ASTRI-Horn telescope, demonstrating effectiv...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.19843] MAS-FIRE: Fault Injection and Reliability Evaluation for LLM-Based Multi-Agent Systems

The paper presents MAS-FIRE, a framework for evaluating the reliability of LLM-based Multi-Agent Systems through fault injection, address...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.19919] Janus-Q: End-to-End Event-Driven Trading via Hierarchical-Gated Reward Modeling

The paper presents Janus-Q, an innovative framework for event-driven trading that leverages financial news events as primary decision-mak...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.19816] Depth-Structured Music Recurrence: Budgeted Recurrent Attention for Full-Piece Symbolic Music Modeling

The paper presents Depth-Structured Music Recurrence (DSMR), a novel approach for symbolic music modeling that optimizes long-context pro...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.19702] DReX: An Explainable Deep Learning-based Multimodal Recommendation Framework

DReX is a novel multimodal recommendation framework that enhances user and item representation through explainable deep learning, address...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.19775] Exact Discrete Stochastic Simulation with Deep-Learning-Scale Gradient Optimization

The paper presents a novel approach to exact discrete stochastic simulation using deep-learning-scale gradient optimization, enhancing sc...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.19698] Iconographic Classification and Content-Based Recommendation for Digitized Artworks

This article presents a proof-of-concept system for automating iconographic classification and content-based recommendations for digitize...

arXiv - AI · 3 min · about 1 month ago

Robotics

[2602.19679] TeHOR: Text-Guided 3D Human and Object Reconstruction with Textures

TeHOR introduces a novel framework for 3D human and object reconstruction using text descriptions, addressing limitations in current meth...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.19674] Continuous Telemonitoring of Heart Failure using Personalised Speech Dynamics

This article presents a novel approach for continuous telemonitoring of heart failure through personalized speech dynamics, showcasing si...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.19691] Smoothness Adaptivity in Constant-Depth Neural Networks: Optimal Rates via Smooth Activations

This paper explores the advantages of smooth activation functions in constant-depth neural networks, demonstrating their ability to achie...

arXiv - Machine Learning · 3 min · about 1 month ago

Previous Page 73 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

we just hit 555 stars on our open source AI agent config tool and i'm honestly still in shock

[P] Cadenza: Connect Wandb logs to agents easily for autonomous research.

[P] Cadenza: Connect Wandb logs to agents easily for autonomous research.

All Content

[2602.20057] AdaWorldPolicy: World-Model-Driven Diffusion Policy with Online Adaptive Learning for Robotic Manipulation

[2310.01770] A simple connection from loss flatness to compressed neural representations

[2602.20055] To Move or Not to Move: Constraint-based Planning Enables Zero-Shot Generalization for Interactive Navigation

[2602.20159] A Very Big Video Reasoning Suite

[2602.20040] AgenticSum: An Agentic Inference-Time Framework for Faithful Clinical Text Summarization

[2602.20156] Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks

[2602.19969] ReAttn: Improving Attention-based Re-ranking via Attention Re-weighting

[2602.20078] Descent-Guided Policy Gradient for Scalable Cooperative Multi-Agent Learning

[2602.19881] Make Some Noise: Unsupervised Remote Sensing Change Detection Using Latent Space Perturbations

[2602.19872] GOAL: Geometrically Optimal Alignment for Continual Generalized Category Discovery

[2602.19984] Multivariate time-series forecasting of ASTRI-Horn monitoring data: A Normal Behavior Model

[2602.19843] MAS-FIRE: Fault Injection and Reliability Evaluation for LLM-Based Multi-Agent Systems

[2602.19919] Janus-Q: End-to-End Event-Driven Trading via Hierarchical-Gated Reward Modeling

[2602.19816] Depth-Structured Music Recurrence: Budgeted Recurrent Attention for Full-Piece Symbolic Music Modeling

[2602.19702] DReX: An Explainable Deep Learning-based Multimodal Recommendation Framework

[2602.19775] Exact Discrete Stochastic Simulation with Deep-Learning-Scale Gradient Optimization

[2602.19698] Iconographic Classification and Content-Based Recommendation for Digitized Artworks

[2602.19679] TeHOR: Text-Guided 3D Human and Object Reconstruction with Textures

[2602.19674] Continuous Telemonitoring of Heart Failure using Personalised Speech Dynamics

[2602.19691] Smoothness Adaptivity in Constant-Depth Neural Networks: Optimal Rates via Smooth Activations

Related Topics

Stay updated with AI News