AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Nlp

Persistent memory MCP server for AI agents (MCP + REST)

Pluribus is a memory service for agents (MCP + HTTP, Postgres-backed) that stores structured memory: constraints, decisions, patterns, an...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Robotics

[D] Awesome AI Agent Incidents - A curated list of incidents, attack vectors, failure modes, and defensive tools for autonomous AI agents.

https://github.com/h5i-dev/awesome-ai-agent-incidents submitted by /u/Living_Impression_37 [link] [comments]

Reddit - Machine Learning · 1 min · about 3 hours ago

Llms

we open sourced a tool that auto generates your AI agent context from your actual codebase, just hit 250 stars

hey everyone. been lurking here for a while and wanted to share something we been building. the problem: ai coding agents are only as goo...

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

All Content

Machine Learning

[2602.12125] Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

The paper presents Generalized On-Policy Distillation (G-OPD), an advancement in machine learning that enhances student model performance...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2508.01780] LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?

The paper presents LiveMCPBench, a benchmark designed to evaluate the capabilities of agents using Model Context Protocol (MCP) tools in ...

arXiv - AI · 4 min · about 1 month ago

Ai Agents

[2505.19792] Types of Relations: Defining Analogies with Category Theory

This paper explores the representation of knowledge through analogies using category theory, highlighting how features of domains can fac...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.01434] Phase Transitions for Feature Learning in Neural Networks

This paper explores phase transitions in neural networks, specifically focusing on feature learning dynamics in two-layer networks and es...

arXiv - Machine Learning · 4 min · about 1 month ago

Robotics

[2505.04317] Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning

This paper presents a novel approach to multi-drone volleyball using a hierarchical reinforcement learning framework, achieving high perf...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.00299] Agentic Framework for Epidemiological Modeling

The paper introduces EPIAGENT, an innovative agentic framework for epidemiological modeling that automates the synthesis, calibration, an...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2504.01445] Compositional-ARC: Assessing Systematic Generalization in Abstract Spatial Reasoning

The paper introduces Compositional-ARC, a dataset for evaluating systematic generalization in abstract spatial reasoning, demonstrating t...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2601.18231] Rethinking Cross-Modal Fine-Tuning: Optimizing the Interaction between Feature Alignment and Target Fitting

This paper presents a framework for optimizing cross-modal fine-tuning by addressing the interaction between feature alignment and target...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2412.17287] LLM4AD: A Platform for Algorithm Design with Large Language Model

LLM4AD introduces a unified Python platform for algorithm design using large language models, featuring modular components for various ta...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.23359] SeeThrough3D: Occlusion Aware 3D Control in Text-to-Image Generation

The paper introduces SeeThrough3D, a model for occlusion-aware 3D control in text-to-image generation, enhancing the realism of synthesiz...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.23335] Understanding Usage and Engagement in AI-Powered Scientific Research Tools: The Asta Interaction Dataset

This paper presents the Asta Interaction Dataset, analyzing over 200,000 user queries from AI-powered research tools to understand user e...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2510.15464] Learning to Answer from Correct Demonstrations

The paper explores a novel approach to learning answer generation from correct demonstrations, formalizing it as imitation learning withi...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.23331] Utilizing LLMs for Industrial Process Automation

This article explores the application of Large Language Models (LLMs) in industrial process automation, focusing on their potential to en...

arXiv - AI · 3 min · about 1 month ago

Llms

[2510.05725] Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies

This article presents a novel approach to improving masked diffusion models (MDMs) for language modeling by introducing a learned schedul...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.21936] Statistical Advantage of Softmax Attention: Insights from Single-Location Regression

This article explores the statistical advantages of softmax attention mechanisms in large language models, particularly in single-locatio...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.23286] SPARTA: Scalable and Principled Benchmark of Tree-Structured Multi-hop QA over Text and Tables

The paper presents SPARTA, a novel framework for generating scalable benchmarks for tree-structured multi-hop question answering (QA) ove...

arXiv - AI · 4 min · about 1 month ago

Ai Safety

[2602.23235] Spatio-Temporal Token Pruning for Efficient High-Resolution GUI Agents

The paper presents GUIPruner, a framework for enhancing the efficiency of high-resolution GUI agents by addressing spatiotemporal redunda...

arXiv - AI · 4 min · about 1 month ago

Computer Vision

[2506.15190] Learning Task-Agnostic Motifs to Capture the Continuous Nature of Animal Behavior

The paper presents a novel framework, Motif-based Continuous Dynamics (MCD), to model animal behavior by identifying continuous motor mot...

arXiv - Machine Learning · 4 min · about 1 month ago

Robotics

[2602.23172] Latent Gaussian Splatting for 4D Panoptic Occupancy Tracking

The paper presents Latent Gaussian Splatting (LaGS) for 4D panoptic occupancy tracking, enhancing robot perception in dynamic environment...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2505.24403] On the Lipschitz Continuity of Set Aggregation Functions and Neural Networks for Sets

This paper explores the Lipschitz continuity of set aggregation functions and neural networks designed for set data, providing insights i...

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 30 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

Persistent memory MCP server for AI agents (MCP + REST)

[D] Awesome AI Agent Incidents - A curated list of incidents, attack vectors, failure modes, and defensive tools for autonomous AI agents.

we open sourced a tool that auto generates your AI agent context from your actual codebase, just hit 250 stars

All Content

[2602.12125] Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

[2508.01780] LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?

[2505.19792] Types of Relations: Defining Analogies with Category Theory

[2602.01434] Phase Transitions for Feature Learning in Neural Networks

[2505.04317] Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning

[2602.00299] Agentic Framework for Epidemiological Modeling

[2504.01445] Compositional-ARC: Assessing Systematic Generalization in Abstract Spatial Reasoning

[2601.18231] Rethinking Cross-Modal Fine-Tuning: Optimizing the Interaction between Feature Alignment and Target Fitting

[2412.17287] LLM4AD: A Platform for Algorithm Design with Large Language Model

[2602.23359] SeeThrough3D: Occlusion Aware 3D Control in Text-to-Image Generation

[2602.23335] Understanding Usage and Engagement in AI-Powered Scientific Research Tools: The Asta Interaction Dataset

[2510.15464] Learning to Answer from Correct Demonstrations

[2602.23331] Utilizing LLMs for Industrial Process Automation

[2510.05725] Improving Discrete Diffusion Unmasking Policies Beyond Explicit Reference Policies

[2509.21936] Statistical Advantage of Softmax Attention: Insights from Single-Location Regression

[2602.23286] SPARTA: Scalable and Principled Benchmark of Tree-Structured Multi-hop QA over Text and Tables

[2602.23235] Spatio-Temporal Token Pruning for Efficient High-Resolution GUI Agents

[2506.15190] Learning Task-Agnostic Motifs to Capture the Continuous Nature of Animal Behavior

[2602.23172] Latent Gaussian Splatting for 4D Panoptic Occupancy Tracking

[2505.24403] On the Lipschitz Continuity of Set Aggregation Functions and Neural Networks for Sets

Related Topics

Stay updated with AI News