AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Ai Agents

Microsoft's newest open-source project: Runtime security for AI agents

submitted by /u/Fcking_Chuck [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
[2510.16609] Prior Knowledge Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods
Llms

[2510.16609] Prior Knowledge Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods

Abstract page for arXiv paper 2510.16609: Prior Knowledge Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods

arXiv - Machine Learning · 4 min ·
[2604.02131] Intelligent Cloud Orchestration: A Hybrid Predictive and Heuristic Framework for Cost Optimization
Machine Learning

[2604.02131] Intelligent Cloud Orchestration: A Hybrid Predictive and Heuristic Framework for Cost Optimization

Abstract page for arXiv paper 2604.02131: Intelligent Cloud Orchestration: A Hybrid Predictive and Heuristic Framework for Cost Optimization

arXiv - Machine Learning · 3 min ·

All Content

[2602.21706] SurGo-R1: Benchmarking and Modeling Contextual Reasoning for Operative Zone in Surgical Video
Machine Learning

[2602.21706] SurGo-R1: Benchmarking and Modeling Contextual Reasoning for Operative Zone in Surgical Video

The paper presents SurGo-R1, a model designed to enhance contextual reasoning in surgical video analysis, addressing challenges in identi...

arXiv - AI · 4 min ·
[2602.21670] Hierarchical LLM-Based Multi-Agent Framework with Prompt Optimization for Multi-Robot Task Planning
Llms

[2602.21670] Hierarchical LLM-Based Multi-Agent Framework with Prompt Optimization for Multi-Robot Task Planning

This article presents a novel hierarchical framework for multi-robot task planning using large language models (LLMs) with prompt optimiz...

arXiv - AI · 4 min ·
[2602.21657] Following the Diagnostic Trace: Visual Cognition-guided Cooperative Network for Chest X-Ray Diagnosis
Machine Learning

[2602.21657] Following the Diagnostic Trace: Visual Cognition-guided Cooperative Network for Chest X-Ray Diagnosis

The paper presents VCC-Net, a visual cognition-guided cooperative network aimed at enhancing chest X-ray diagnosis through improved human...

arXiv - AI · 4 min ·
[2602.21655] CCCaption: Dual-Reward Reinforcement Learning for Complete and Correct Image Captioning
Machine Learning

[2602.21655] CCCaption: Dual-Reward Reinforcement Learning for Complete and Correct Image Captioning

The paper introduces CCCaption, a dual-reward reinforcement learning framework designed to enhance image captioning by optimizing for com...

arXiv - AI · 4 min ·
[2602.21650] PPCR-IM: A System for Multi-layer DAG-based Public Policy Consequence Reasoning and Social Indicator Mapping
Llms

[2602.21650] PPCR-IM: A System for Multi-layer DAG-based Public Policy Consequence Reasoning and Social Indicator Mapping

The article presents PPCR-IM, a system for multi-layer DAG-based reasoning in public policy, enhancing the mapping of social indicators a...

arXiv - AI · 3 min ·
[2602.21633] Self-Correcting VLA: Online Action Refinement via Sparse World Imagination
Machine Learning

[2602.21633] Self-Correcting VLA: Online Action Refinement via Sparse World Imagination

The paper presents Self-Correcting VLA, a novel approach in robotics that enhances vision-language-action models by integrating sparse wo...

arXiv - AI · 4 min ·
[2602.21611] Structurally Aligned Subtask-Level Memory for Software Engineering Agents
Llms

[2602.21611] Structurally Aligned Subtask-Level Memory for Software Engineering Agents

The paper presents Structurally Aligned Subtask-Level Memory, a novel approach for enhancing software engineering agents by improving mem...

arXiv - AI · 3 min ·
[2602.21598] Retrieval Challenges in Low-Resource Public Service Information: A Case Study on Food Pantry Access
Nlp

[2602.21598] Retrieval Challenges in Low-Resource Public Service Information: A Case Study on Food Pantry Access

This article explores the challenges of retrieving public service information in low-resource environments, focusing on food pantry acces...

arXiv - AI · 3 min ·
[2602.21613] Virtual Biopsy for Intracranial Tumors Diagnosis on MRI
Ai Safety

[2602.21613] Virtual Biopsy for Intracranial Tumors Diagnosis on MRI

This article presents a novel Virtual Biopsy framework for diagnosing intracranial tumors using MRI, addressing the challenges of traditi...

arXiv - AI · 4 min ·
[2602.21553] Revisiting RAG Retrievers: An Information Theoretic Benchmark
Llms

[2602.21553] Revisiting RAG Retrievers: An Information Theoretic Benchmark

This paper presents MIGRASCOPE, a new framework for evaluating RAG retrievers, emphasizing the need for systematic benchmarks and metrics...

arXiv - Machine Learning · 4 min ·
[2602.21584] Exploring Human-Machine Coexistence in Symmetrical Reality
Ai Safety

[2602.21584] Exploring Human-Machine Coexistence in Symmetrical Reality

This paper explores the evolving relationship between humans and AI, proposing a framework for harmonious coexistence termed 'symmetrical...

arXiv - AI · 3 min ·
[2602.21551] From Basis to Basis: Gaussian Particle Representation for Interpretable PDE Operators
Machine Learning

[2602.21551] From Basis to Basis: Gaussian Particle Representation for Interpretable PDE Operators

This article presents a novel Gaussian Particle Representation for interpreting PDE operators, enhancing interpretability and efficiency ...

arXiv - Machine Learning · 3 min ·
[2602.21531] LiLo-VLA: Compositional Long-Horizon Manipulation via Linked Object-Centric Policies
Machine Learning

[2602.21531] LiLo-VLA: Compositional Long-Horizon Manipulation via Linked Object-Centric Policies

The paper introduces LiLo-VLA, a modular framework for long-horizon manipulation in robotics, enhancing performance through object-centri...

arXiv - Machine Learning · 4 min ·
[2602.21492] GradAlign: Gradient-Aligned Data Selection for LLM Reinforcement Learning
Llms

[2602.21492] GradAlign: Gradient-Aligned Data Selection for LLM Reinforcement Learning

The paper presents GradAlign, a novel method for selecting training data in reinforcement learning for large language models, enhancing p...

arXiv - Machine Learning · 3 min ·
[2602.21515] Training Generalizable Collaborative Agents via Strategic Risk Aversion
Machine Learning

[2602.21515] Training Generalizable Collaborative Agents via Strategic Risk Aversion

This paper explores training strategies for collaborative agents, emphasizing strategic risk aversion to enhance generalizability and rob...

arXiv - Machine Learning · 4 min ·
[2602.21456] Revisiting Text Ranking in Deep Research
Llms

[2602.21456] Revisiting Text Ranking in Deep Research

The paper 'Revisiting Text Ranking in Deep Research' explores the effectiveness of text ranking methods in deep research settings, focusi...

arXiv - AI · 4 min ·
[2602.21447] Adversarial Intent is a Latent Variable: Stateful Trust Inference for Securing Multimodal Agentic RAG
Machine Learning

[2602.21447] Adversarial Intent is a Latent Variable: Stateful Trust Inference for Securing Multimodal Agentic RAG

The paper presents a novel framework, MMA-RAG^T, for enhancing the security of multimodal agentic retrieval-augmented generation systems ...

arXiv - Machine Learning · 4 min ·
[2602.21442] MINAR: Mechanistic Interpretability for Neural Algorithmic Reasoning
Llms

[2602.21442] MINAR: Mechanistic Interpretability for Neural Algorithmic Reasoning

The paper introduces MINAR, a toolbox for mechanistic interpretability in neural algorithmic reasoning, enhancing understanding of GNNs' ...

arXiv - Machine Learning · 3 min ·
[2602.21421] ECHOSAT: Estimating Canopy Height Over Space And Time
Machine Learning

[2602.21421] ECHOSAT: Estimating Canopy Height Over Space And Time

ECHOSAT introduces a global tree height map that captures temporal forest dynamics, enhancing carbon monitoring and disturbance assessmen...

arXiv - Machine Learning · 4 min ·
[2602.21424] On the Structural Non-Preservation of Epistemic Behaviour under Policy Transformation
Machine Learning

[2602.21424] On the Structural Non-Preservation of Epistemic Behaviour under Policy Transformation

The paper explores how reinforcement learning agents' actions depend on internal information, revealing structural conditions affecting b...

arXiv - AI · 3 min ·
Previous Page 53 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime