AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

AWS turns its S3 storage service into a file system for AI agents
Nlp

AWS turns its S3 storage service into a file system for AI agents

AI News - General ·
Moody’s Integrates AI Agents With Anthropic’s Claude
Llms

Moody’s Integrates AI Agents With Anthropic’s Claude

AI Tools & Products · 4 min ·
Llms

Started a video series on building an orchestration layer for LLM post-training [P]

Hi everyone! Context, motivation, a lot of yapping, feel free to skip to TL;DR. A while back I posted here asking [D] What framework do y...

Reddit - Machine Learning · 1 min ·

All Content

[2507.10134] FRSICL: LLM-Enabled In-Context Learning Flight Resource Allocation for Fresh Data Collection in UAV-Assisted Wildfire Monitoring
Llms

[2507.10134] FRSICL: LLM-Enabled In-Context Learning Flight Resource Allocation for Fresh Data Collection in UAV-Assisted Wildfire Monitoring

The paper presents FRSICL, a novel method utilizing LLMs for optimizing UAV data collection in wildfire monitoring, enhancing efficiency ...

arXiv - AI · 4 min ·
[2602.10452] Distributed Online Convex Optimization with Nonseparable Costs and Constraints
Machine Learning

[2602.10452] Distributed Online Convex Optimization with Nonseparable Costs and Constraints

This paper explores distributed online convex optimization with nonseparable costs and constraints, presenting a novel algorithm that imp...

arXiv - Machine Learning · 4 min ·
[2507.06134] OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety
Ai Infrastructure

[2507.06134] OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety

OpenAgentSafety introduces a modular framework for evaluating AI agent safety in real-world tasks, addressing critical vulnerabilities in...

arXiv - AI · 4 min ·
[2602.02236] Online Fine-Tuning of Pretrained Controllers for Autonomous Driving via Real-Time Recurrent RL
Machine Learning

[2602.02236] Online Fine-Tuning of Pretrained Controllers for Autonomous Driving via Real-Time Recurrent RL

The paper discusses the use of Real-Time Recurrent Reinforcement Learning (RTRRL) for fine-tuning pretrained controllers in autonomous dr...

arXiv - Machine Learning · 3 min ·
[2602.01664] FlowSteer: Interactive Agentic Workflow Orchestration via End-to-End Reinforcement Learning
Llms

[2602.01664] FlowSteer: Interactive Agentic Workflow Orchestration via End-to-End Reinforcement Learning

FlowSteer introduces an end-to-end reinforcement learning framework for automating workflow orchestration, addressing challenges like man...

arXiv - Machine Learning · 4 min ·
[2602.15814] Avey-B
Nlp

[2602.15814] Avey-B

The paper 'Avey-B' presents a reformulated architecture for Avey, an autoregressive, attention-free model, demonstrating superior perform...

arXiv - AI · 3 min ·
[2510.17886] Graphical model for factorization and completion of relatively high rank tensors by sparse sampling
Machine Learning

[2510.17886] Graphical model for factorization and completion of relatively high rank tensors by sparse sampling

This paper presents a graphical model for factorization and completion of high-rank tensors using sparse sampling, addressing challenges ...

arXiv - Machine Learning · 4 min ·
[2510.04407] Scale-Invariant Regret Matching and Online Learning with Optimal Convergence: Bridging Theory and Practice in Zero-Sum Games
Machine Learning

[2510.04407] Scale-Invariant Regret Matching and Online Learning with Optimal Convergence: Bridging Theory and Practice in Zero-Sum Games

This paper presents a new scale-invariant variant of predictive regret matching, called IREG-PRM+, which bridges the gap between theoreti...

arXiv - Machine Learning · 4 min ·
[2602.15767] Robot-Assisted Social Dining as a White Glove Service
Robotics

[2602.15767] Robot-Assisted Social Dining as a White Glove Service

The paper explores robot-assisted dining for individuals with disabilities, emphasizing the need for robots that can adapt to social dini...

arXiv - AI · 3 min ·
[2602.15733] MeshMimic: Geometry-Aware Humanoid Motion Learning through 3D Scene Reconstruction
Robotics

[2602.15733] MeshMimic: Geometry-Aware Humanoid Motion Learning through 3D Scene Reconstruction

MeshMimic introduces a novel framework for humanoid motion learning by integrating 3D scene reconstruction with motion control, enhancing...

arXiv - AI · 4 min ·
[2509.08535] Agents of Discovery
Machine Learning

[2509.08535] Agents of Discovery

The paper 'Agents of Discovery' explores the use of large language models (LLMs) as agents to automate data analysis in high energy physi...

arXiv - AI · 4 min ·
[2602.15724] Learning to Retrieve Navigable Candidates for Efficient Vision-and-Language Navigation
Llms

[2602.15724] Learning to Retrieve Navigable Candidates for Efficient Vision-and-Language Navigation

This paper presents a retrieval-augmented framework to enhance efficiency in Vision-and-Language Navigation (VLN) by leveraging large lan...

arXiv - AI · 4 min ·
[2508.14949] XAI-Driven Spectral Analysis of Cough Sounds for Respiratory Disease Characterization
Machine Learning

[2508.14949] XAI-Driven Spectral Analysis of Cough Sounds for Respiratory Disease Characterization

The paper presents an XAI-driven methodology for analyzing cough sounds to improve respiratory disease diagnosis, highlighting significan...

arXiv - Machine Learning · 3 min ·
[2602.15721] Lifelong Scalable Multi-Agent Realistic Testbed and A Comprehensive Study on Design Choices in Lifelong AGV Fleet Management Systems
Ai Agents

[2602.15721] Lifelong Scalable Multi-Agent Realistic Testbed and A Comprehensive Study on Design Choices in Lifelong AGV Fleet Management Systems

The paper presents LSMART, an open-source simulator for evaluating Multi-Agent Path Finding (MAPF) algorithms in Automated Guided Vehicle...

arXiv - AI · 4 min ·
[2507.16641] Hybrid Reward-Driven Reinforcement Learning for Efficient Quantum Circuit Synthesis
Machine Learning

[2507.16641] Hybrid Reward-Driven Reinforcement Learning for Efficient Quantum Circuit Synthesis

This article presents a novel reinforcement learning framework for synthesizing quantum circuits efficiently, addressing challenges in th...

arXiv - AI · 4 min ·
[2602.15708] Outer Diversity of Structured Domains
Machine Learning

[2602.15708] Outer Diversity of Structured Domains

The paper introduces the concept of outer diversity in ordinal preference domains, analyzing its implications for various structured doma...

arXiv - AI · 3 min ·
[2507.12202] Sparse Autoencoders for Sequential Recommendation Models: Interpretation and Flexible Control
Machine Learning

[2507.12202] Sparse Autoencoders for Sequential Recommendation Models: Interpretation and Flexible Control

This article presents a novel approach using Sparse Autoencoders (SAE) for enhancing the interpretability and control of sequential recom...

arXiv - AI · 4 min ·
[2506.19923] Prover Agent: An Agent-Based Framework for Formal Mathematical Proofs
Llms

[2506.19923] Prover Agent: An Agent-Based Framework for Formal Mathematical Proofs

The paper introduces Prover Agent, an AI framework that combines large language models with formal proof assistants to enhance automated ...

arXiv - Machine Learning · 4 min ·
[2505.22914] cadrille: Multi-modal CAD Reconstruction with Reinforcement Learning
Machine Learning

[2505.22914] cadrille: Multi-modal CAD Reconstruction with Reinforcement Learning

The paper presents 'cadrille', a multi-modal CAD reconstruction model utilizing reinforcement learning to process diverse input data, ach...

arXiv - Machine Learning · 4 min ·
[2602.15660] Bayesian Optimization for Design Parameters of 3D Image Data Analysis
Machine Learning

[2602.15660] Bayesian Optimization for Design Parameters of 3D Image Data Analysis

This paper presents a novel 3D data Analysis Optimization Pipeline that utilizes Bayesian Optimization to enhance segmentation and classi...

arXiv - AI · 4 min ·
Previous Page 121 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime