AI Agents
Autonomous agents, tool use, and agentic systems
Top This Week
Started a video series on building an orchestration layer for LLM post-training [P]
Hi everyone! Context, motivation, a lot of yapping, feel free to skip to TL;DR. A while back I posted here asking [D] What framework do y...
All Content
[2510.01510] Flock: A Knowledge Graph Foundation Model via Learning on Random Walks
The paper presents Flock, a knowledge graph foundation model that enhances zero-shot link prediction by employing probabilistic node-rela...
[2509.22626] Learning Admissible Heuristics for A*: Theory and Practice
This paper explores learning admissible heuristics for the A* search algorithm, introducing a new loss function that ensures admissibilit...
[2602.15286] AI-Paging: Lease-Based Execution Anchoring for Network-Exposed AI-as-a-Service
The paper presents AI-Paging, a framework for optimizing AI-as-a-Service by enabling network providers to manage model selection and exec...
[2602.15281] High-Fidelity Network Management for Federated AI-as-a-Service: Cross-Domain Orchestration
This paper presents a framework for high-fidelity network management in Federated AI-as-a-Service, focusing on cross-domain orchestration...
[2504.20823] Hybrid quantum recurrent neural network for remaining useful life prediction
This article presents a Hybrid Quantum Recurrent Neural Network framework for predicting the remaining useful life of jet engines, showca...
[2503.00509] Functional multi-armed bandit and the best function identification problems
This article introduces the functional multi-armed bandit (FMAB) problem and the best function identification problem, proposing a new al...
[2602.15265] From Diagnosis to Inoculation: Building Cognitive Resistance to AI Disempowerment
This article discusses the need for cognitive resistance to AI disempowerment, proposing an AI literacy framework based on pedagogical in...
[2502.03576] Clone-Robust Weights in Metric Spaces: Handling Redundancy Bias for Benchmark Aggregation
This article presents a theoretical framework for clone-robust weighting functions in metric spaces, addressing redundancy bias in benchm...
[2502.00225] Should You Use Your Large Language Model to Explore or Exploit?
This article evaluates the effectiveness of large language models (LLMs) in addressing exploration-exploitation tradeoffs in decision-mak...
[2602.15245] MyoInteract: A Framework for Fast Prototyping of Biomechanical HCI Tasks using Reinforcement Learning
MyoInteract is a novel framework that simplifies the prototyping of biomechanical HCI tasks using reinforcement learning, significantly r...
[2411.18954] NeuroLifting: Neural Inference on Markov Random Fields at Scale
NeuroLifting introduces a novel approach for inference in large-scale Markov Random Fields (MRFs) using Graph Neural Networks, achieving ...
[2410.05225] ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control
The paper introduces ETGL-DDPG, a novel deep deterministic policy gradient algorithm designed to enhance exploration in reinforcement lea...
[2410.02605] Policy Gradients for Cumulative Prospect Theory in Reinforcement Learning
This paper presents a policy gradient theorem for Cumulative Prospect Theory (CPT) in reinforcement learning, introducing a new algorithm...
[2602.15198] Colosseum: Auditing Collusion in Cooperative Multi-Agent Systems
The paper introduces Colosseum, a framework designed to audit collusion in cooperative multi-agent systems, highlighting the risks of age...
[2602.15197] OpaqueToolsBench: Learning Nuances of Tool Behavior Through Interaction
The paper introduces OpaqueToolsBench, a benchmark for evaluating Large Language Model (LLM) agents' performance with opaque tools, propo...
[2602.15189] ScrapeGraphAI-100k: A Large-Scale Dataset for LLM-Based Web Information Extraction
ScrapeGraphAI-100k introduces a large-scale dataset for LLM-based web information extraction, addressing limitations of existing datasets...
[2406.07990] Topological quantification of ambiguity in semantic search
This article explores the topological quantification of ambiguity in semantic search, linking sentence-embedding neighborhoods to semanti...
[2406.03862] Robust Deep Reinforcement Learning against Adversarial Behavior Manipulation
This paper explores behavior-targeted attacks on reinforcement learning systems and proposes a novel defense strategy using time-discount...
[2602.15139] CGRA-DeBERTa Concept Guided Residual Augmentation Transformer for Theologically Islamic Understanding
The paper presents CGRA-DeBERTa, a novel transformer model designed to enhance question-answering over classical Islamic texts by integra...
[2602.15827] Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching
The paper presents a framework for humanoid robots to perform dynamic parkour using motion matching and reinforcement learning, enabling ...
Related Topics
Stay updated with AI News
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime