AI Agents
Autonomous agents, tool use, and agentic systems
Top This Week
Started a video series on building an orchestration layer for LLM post-training [P]
Hi everyone! Context, motivation, a lot of yapping, feel free to skip to TL;DR. A while back I posted here asking [D] What framework do y...
All Content
[2507.10134] FRSICL: LLM-Enabled In-Context Learning Flight Resource Allocation for Fresh Data Collection in UAV-Assisted Wildfire Monitoring
The paper presents FRSICL, a novel method utilizing LLMs for optimizing UAV data collection in wildfire monitoring, enhancing efficiency ...
[2602.10452] Distributed Online Convex Optimization with Nonseparable Costs and Constraints
This paper explores distributed online convex optimization with nonseparable costs and constraints, presenting a novel algorithm that imp...
[2507.06134] OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety
OpenAgentSafety introduces a modular framework for evaluating AI agent safety in real-world tasks, addressing critical vulnerabilities in...
[2602.02236] Online Fine-Tuning of Pretrained Controllers for Autonomous Driving via Real-Time Recurrent RL
The paper discusses the use of Real-Time Recurrent Reinforcement Learning (RTRRL) for fine-tuning pretrained controllers in autonomous dr...
[2602.01664] FlowSteer: Interactive Agentic Workflow Orchestration via End-to-End Reinforcement Learning
FlowSteer introduces an end-to-end reinforcement learning framework for automating workflow orchestration, addressing challenges like man...
[2602.15814] Avey-B
The paper 'Avey-B' presents a reformulated architecture for Avey, an autoregressive, attention-free model, demonstrating superior perform...
[2510.17886] Graphical model for factorization and completion of relatively high rank tensors by sparse sampling
This paper presents a graphical model for factorization and completion of high-rank tensors using sparse sampling, addressing challenges ...
[2510.04407] Scale-Invariant Regret Matching and Online Learning with Optimal Convergence: Bridging Theory and Practice in Zero-Sum Games
This paper presents a new scale-invariant variant of predictive regret matching, called IREG-PRM+, which bridges the gap between theoreti...
[2602.15767] Robot-Assisted Social Dining as a White Glove Service
The paper explores robot-assisted dining for individuals with disabilities, emphasizing the need for robots that can adapt to social dini...
[2602.15733] MeshMimic: Geometry-Aware Humanoid Motion Learning through 3D Scene Reconstruction
MeshMimic introduces a novel framework for humanoid motion learning by integrating 3D scene reconstruction with motion control, enhancing...
[2509.08535] Agents of Discovery
The paper 'Agents of Discovery' explores the use of large language models (LLMs) as agents to automate data analysis in high energy physi...
[2602.15724] Learning to Retrieve Navigable Candidates for Efficient Vision-and-Language Navigation
This paper presents a retrieval-augmented framework to enhance efficiency in Vision-and-Language Navigation (VLN) by leveraging large lan...
[2508.14949] XAI-Driven Spectral Analysis of Cough Sounds for Respiratory Disease Characterization
The paper presents an XAI-driven methodology for analyzing cough sounds to improve respiratory disease diagnosis, highlighting significan...
[2602.15721] Lifelong Scalable Multi-Agent Realistic Testbed and A Comprehensive Study on Design Choices in Lifelong AGV Fleet Management Systems
The paper presents LSMART, an open-source simulator for evaluating Multi-Agent Path Finding (MAPF) algorithms in Automated Guided Vehicle...
[2507.16641] Hybrid Reward-Driven Reinforcement Learning for Efficient Quantum Circuit Synthesis
This article presents a novel reinforcement learning framework for synthesizing quantum circuits efficiently, addressing challenges in th...
[2602.15708] Outer Diversity of Structured Domains
The paper introduces the concept of outer diversity in ordinal preference domains, analyzing its implications for various structured doma...
[2507.12202] Sparse Autoencoders for Sequential Recommendation Models: Interpretation and Flexible Control
This article presents a novel approach using Sparse Autoencoders (SAE) for enhancing the interpretability and control of sequential recom...
[2506.19923] Prover Agent: An Agent-Based Framework for Formal Mathematical Proofs
The paper introduces Prover Agent, an AI framework that combines large language models with formal proof assistants to enhance automated ...
[2505.22914] cadrille: Multi-modal CAD Reconstruction with Reinforcement Learning
The paper presents 'cadrille', a multi-modal CAD reconstruction model utilizing reinforcement learning to process diverse input data, ach...
[2602.15660] Bayesian Optimization for Design Parameters of 3D Image Data Analysis
This paper presents a novel 3D data Analysis Optimization Pipeline that utilizes Bayesian Optimization to enhance segmentation and classi...
Related Topics
Stay updated with AI News
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime