AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Nlp

AWS turns its S3 storage service into a file system for AI agents

AI News - General · about 5 hours ago

Llms

Moody’s Integrates AI Agents With Anthropic’s Claude

AI Tools & Products · 4 min · about 6 hours ago

Llms

Started a video series on building an orchestration layer for LLM post-training [P]

Hi everyone! Context, motivation, a lot of yapping, feel free to skip to TL;DR. A while back I posted here asking [D] What framework do y...

Reddit - Machine Learning · 1 min · about 11 hours ago

All Content

Llms

[2510.01510] Flock: A Knowledge Graph Foundation Model via Learning on Random Walks

The paper presents Flock, a knowledge graph foundation model that enhances zero-shot link prediction by employing probabilistic node-rela...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2509.22626] Learning Admissible Heuristics for A*: Theory and Practice

This paper explores learning admissible heuristics for the A* search algorithm, introducing a new loss function that ensures admissibilit...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.15286] AI-Paging: Lease-Based Execution Anchoring for Network-Exposed AI-as-a-Service

The paper presents AI-Paging, a framework for optimizing AI-as-a-Service by enabling network providers to manage model selection and exec...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.15281] High-Fidelity Network Management for Federated AI-as-a-Service: Cross-Domain Orchestration

This paper presents a framework for high-fidelity network management in Federated AI-as-a-Service, focusing on cross-domain orchestration...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2504.20823] Hybrid quantum recurrent neural network for remaining useful life prediction

This article presents a Hybrid Quantum Recurrent Neural Network framework for predicting the remaining useful life of jet engines, showca...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2503.00509] Functional multi-armed bandit and the best function identification problems

This article introduces the functional multi-armed bandit (FMAB) problem and the best function identification problem, proposing a new al...

arXiv - AI · 4 min · about 2 months ago

Ai Safety

[2602.15265] From Diagnosis to Inoculation: Building Cognitive Resistance to AI Disempowerment

This article discusses the need for cognitive resistance to AI disempowerment, proposing an AI literacy framework based on pedagogical in...

arXiv - AI · 4 min · about 2 months ago

Robotics

[2502.03576] Clone-Robust Weights in Metric Spaces: Handling Redundancy Bias for Benchmark Aggregation

This article presents a theoretical framework for clone-robust weighting functions in metric spaces, addressing redundancy bias in benchm...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2502.00225] Should You Use Your Large Language Model to Explore or Exploit?

This article evaluates the effectiveness of large language models (LLMs) in addressing exploration-exploitation tradeoffs in decision-mak...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.15245] MyoInteract: A Framework for Fast Prototyping of Biomechanical HCI Tasks using Reinforcement Learning

MyoInteract is a novel framework that simplifies the prototyping of biomechanical HCI tasks using reinforcement learning, significantly r...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2411.18954] NeuroLifting: Neural Inference on Markov Random Fields at Scale

NeuroLifting introduces a novel approach for inference in large-scale Markov Random Fields (MRFs) using Graph Neural Networks, achieving ...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2410.05225] ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control

The paper introduces ETGL-DDPG, a novel deep deterministic policy gradient algorithm designed to enhance exploration in reinforcement lea...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2410.02605] Policy Gradients for Cumulative Prospect Theory in Reinforcement Learning

This paper presents a policy gradient theorem for Cumulative Prospect Theory (CPT) in reinforcement learning, introducing a new algorithm...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.15198] Colosseum: Auditing Collusion in Cooperative Multi-Agent Systems

The paper introduces Colosseum, a framework designed to audit collusion in cooperative multi-agent systems, highlighting the risks of age...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.15197] OpaqueToolsBench: Learning Nuances of Tool Behavior Through Interaction

The paper introduces OpaqueToolsBench, a benchmark for evaluating Large Language Model (LLM) agents' performance with opaque tools, propo...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.15189] ScrapeGraphAI-100k: A Large-Scale Dataset for LLM-Based Web Information Extraction

ScrapeGraphAI-100k introduces a large-scale dataset for LLM-based web information extraction, addressing limitations of existing datasets...

arXiv - AI · 3 min · about 2 months ago

Nlp

[2406.07990] Topological quantification of ambiguity in semantic search

This article explores the topological quantification of ambiguity in semantic search, linking sentence-embedding neighborhoods to semanti...

arXiv - AI · 4 min · about 2 months ago

Robotics

[2406.03862] Robust Deep Reinforcement Learning against Adversarial Behavior Manipulation

This paper explores behavior-targeted attacks on reinforcement learning systems and proposes a novel defense strategy using time-discount...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.15139] CGRA-DeBERTa Concept Guided Residual Augmentation Transformer for Theologically Islamic Understanding

The paper presents CGRA-DeBERTa, a novel transformer model designed to enhance question-answering over classical Islamic texts by integra...

arXiv - AI · 4 min · about 2 months ago

Robotics

[2602.15827] Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching

The paper presents a framework for humanoid robots to perform dynamic parkour using motion matching and reinforcement learning, enabling ...

arXiv - AI · 4 min · about 2 months ago

Previous Page 123 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

AWS turns its S3 storage service into a file system for AI agents

Moody’s Integrates AI Agents With Anthropic’s Claude

Started a video series on building an orchestration layer for LLM post-training [P]

All Content

[2510.01510] Flock: A Knowledge Graph Foundation Model via Learning on Random Walks

[2509.22626] Learning Admissible Heuristics for A*: Theory and Practice

[2602.15286] AI-Paging: Lease-Based Execution Anchoring for Network-Exposed AI-as-a-Service

[2602.15281] High-Fidelity Network Management for Federated AI-as-a-Service: Cross-Domain Orchestration

[2504.20823] Hybrid quantum recurrent neural network for remaining useful life prediction

[2503.00509] Functional multi-armed bandit and the best function identification problems

[2602.15265] From Diagnosis to Inoculation: Building Cognitive Resistance to AI Disempowerment

[2502.03576] Clone-Robust Weights in Metric Spaces: Handling Redundancy Bias for Benchmark Aggregation

[2502.00225] Should You Use Your Large Language Model to Explore or Exploit?

[2602.15245] MyoInteract: A Framework for Fast Prototyping of Biomechanical HCI Tasks using Reinforcement Learning

[2411.18954] NeuroLifting: Neural Inference on Markov Random Fields at Scale

[2410.05225] ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control

[2410.02605] Policy Gradients for Cumulative Prospect Theory in Reinforcement Learning

[2602.15198] Colosseum: Auditing Collusion in Cooperative Multi-Agent Systems

[2602.15197] OpaqueToolsBench: Learning Nuances of Tool Behavior Through Interaction

[2602.15189] ScrapeGraphAI-100k: A Large-Scale Dataset for LLM-Based Web Information Extraction

[2406.07990] Topological quantification of ambiguity in semantic search

[2406.03862] Robust Deep Reinforcement Learning against Adversarial Behavior Manipulation

[2602.15139] CGRA-DeBERTa Concept Guided Residual Augmentation Transformer for Theologically Islamic Understanding

[2602.15827] Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching

Related Topics

Stay updated with AI News