AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Nlp

Persistent memory MCP server for AI agents (MCP + REST)

Pluribus is a memory service for agents (MCP + HTTP, Postgres-backed) that stores structured memory: constraints, decisions, patterns, an...

Reddit - Artificial Intelligence · 1 min ·
Robotics

[D] Awesome AI Agent Incidents - A curated list of incidents, attack vectors, failure modes, and defensive tools for autonomous AI agents.

https://github.com/h5i-dev/awesome-ai-agent-incidents submitted by /u/Living_Impression_37 [link] [comments]

Reddit - Machine Learning · 1 min ·
Llms

we open sourced a tool that auto generates your AI agent context from your actual codebase, just hit 250 stars

hey everyone. been lurking here for a while and wanted to share something we been building. the problem: ai coding agents are only as goo...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.12125] Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation
Machine Learning

[2602.12125] Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

The paper presents Generalized On-Policy Distillation (G-OPD), an advancement in machine learning that enhances student model performance...

arXiv - Machine Learning · 4 min ·
[2508.01780] LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?
Llms

[2508.01780] LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?

The paper presents LiveMCPBench, a benchmark designed to evaluate the capabilities of agents using Model Context Protocol (MCP) tools in ...

arXiv - AI · 4 min ·
[2505.19792] Types of Relations: Defining Analogies with Category Theory
Ai Agents

[2505.19792] Types of Relations: Defining Analogies with Category Theory

This paper explores the representation of knowledge through analogies using category theory, highlighting how features of domains can fac...

arXiv - AI · 3 min ·
[2602.01434] Phase Transitions for Feature Learning in Neural Networks
Machine Learning

[2602.01434] Phase Transitions for Feature Learning in Neural Networks

This paper explores phase transitions in neural networks, specifically focusing on feature learning dynamics in two-layer networks and es...

arXiv - Machine Learning · 4 min ·
[2505.04317] Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning
Robotics

[2505.04317] Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning

This paper presents a novel approach to multi-drone volleyball using a hierarchical reinforcement learning framework, achieving high perf...

arXiv - AI · 4 min ·
[2602.00299] Agentic Framework for Epidemiological Modeling
Machine Learning

[2602.00299] Agentic Framework for Epidemiological Modeling

The paper introduces EPIAGENT, an innovative agentic framework for epidemiological modeling that automates the synthesis, calibration, an...

arXiv - Machine Learning · 3 min ·
[2504.01445] Compositional-ARC: Assessing Systematic Generalization in Abstract Spatial Reasoning
Llms

[2504.01445] Compositional-ARC: Assessing Systematic Generalization in Abstract Spatial Reasoning

The paper introduces Compositional-ARC, a dataset for evaluating systematic generalization in abstract spatial reasoning, demonstrating t...

arXiv - AI · 4 min ·
[2601.18231] Rethinking Cross-Modal Fine-Tuning: Optimizing the Interaction between Feature Alignment and Target Fitting
Machine Learning

[2601.18231] Rethinking Cross-Modal Fine-Tuning: Optimizing the Interaction between Feature Alignment and Target Fitting

This paper presents a framework for optimizing cross-modal fine-tuning by addressing the interaction between feature alignment and target...

arXiv - Machine Learning · 4 min ·
[2412.17287] LLM4AD: A Platform for Algorithm Design with Large Language Model
Llms

[2412.17287] LLM4AD: A Platform for Algorithm Design with Large Language Model

LLM4AD introduces a unified Python platform for algorithm design using large language models, featuring modular components for various ta...

arXiv - AI · 3 min ·
[2602.23359] SeeThrough3D: Occlusion Aware 3D Control in Text-to-Image Generation
Machine Learning

[2602.23359] SeeThrough3D: Occlusion Aware 3D Control in Text-to-Image Generation

The paper introduces SeeThrough3D, a model for occlusion-aware 3D control in text-to-image generation, enhancing the realism of synthesiz...

arXiv - AI · 4 min ·
[2602.23335] Understanding Usage and Engagement in AI-Powered Scientific Research Tools: The Asta Interaction Dataset
Llms

[2602.23335] Understanding Usage and Engagement in AI-Powered Scientific Research Tools: The Asta Interaction Dataset

This paper presents the Asta Interaction Dataset, analyzing over 200,000 user queries from AI-powered research tools to understand user e...

arXiv - AI · 4 min ·
[2510.15464] Learning to Answer from Correct Demonstrations
Machine Learning

[2510.15464] Learning to Answer from Correct Demonstrations

The paper explores a novel approach to learning answer generation from correct demonstrations, formalizing it as imitation learning withi...

arXiv - Machine Learning · 4 min ·
[2602.23331] Utilizing LLMs for Industrial Process Automation
Llms

[2602.23331] Utilizing LLMs for Industrial Process Automation

This article explores the application of Large Language Models (LLMs) in industrial process automation, focusing on their potential to en...

arXiv - AI · 3 min ·
[2510.05725] Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies
Llms

[2510.05725] Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies

This article presents a novel approach to improving masked diffusion models (MDMs) for language modeling by introducing a learned schedul...

arXiv - Machine Learning · 4 min ·
[2509.21936] Statistical Advantage of Softmax Attention: Insights from Single-Location Regression
Llms

[2509.21936] Statistical Advantage of Softmax Attention: Insights from Single-Location Regression

This article explores the statistical advantages of softmax attention mechanisms in large language models, particularly in single-locatio...

arXiv - Machine Learning · 4 min ·
[2602.23286] SPARTA: Scalable and Principled Benchmark of Tree-Structured Multi-hop QA over Text and Tables
Machine Learning

[2602.23286] SPARTA: Scalable and Principled Benchmark of Tree-Structured Multi-hop QA over Text and Tables

The paper presents SPARTA, a novel framework for generating scalable benchmarks for tree-structured multi-hop question answering (QA) ove...

arXiv - AI · 4 min ·
[2602.23235] Spatio-Temporal Token Pruning for Efficient High-Resolution GUI Agents
Ai Safety

[2602.23235] Spatio-Temporal Token Pruning for Efficient High-Resolution GUI Agents

The paper presents GUIPruner, a framework for enhancing the efficiency of high-resolution GUI agents by addressing spatiotemporal redunda...

arXiv - AI · 4 min ·
[2506.15190] Learning Task-Agnostic Motifs to Capture the Continuous Nature of Animal Behavior
Computer Vision

[2506.15190] Learning Task-Agnostic Motifs to Capture the Continuous Nature of Animal Behavior

The paper presents a novel framework, Motif-based Continuous Dynamics (MCD), to model animal behavior by identifying continuous motor mot...

arXiv - Machine Learning · 4 min ·
[2602.23172] Latent Gaussian Splatting for 4D Panoptic Occupancy Tracking
Robotics

[2602.23172] Latent Gaussian Splatting for 4D Panoptic Occupancy Tracking

The paper presents Latent Gaussian Splatting (LaGS) for 4D panoptic occupancy tracking, enhancing robot perception in dynamic environment...

arXiv - AI · 3 min ·
[2505.24403] On the Lipschitz Continuity of Set Aggregation Functions and Neural Networks for Sets
Machine Learning

[2505.24403] On the Lipschitz Continuity of Set Aggregation Functions and Neural Networks for Sets

This paper explores the Lipschitz continuity of set aggregation functions and neural networks designed for set data, providing insights i...

arXiv - Machine Learning · 4 min ·
Previous Page 30 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime