AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

[2603.10062] Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges Ahead
Llms

[2603.10062] Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges Ahead

Abstract page for arXiv paper 2603.10062: Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges Ahead

arXiv - AI · 3 min ·
[2601.19066] Dynamic Cogeneration of Bug Reproduction Test in Agentic Program Repair
Ai Agents

[2601.19066] Dynamic Cogeneration of Bug Reproduction Test in Agentic Program Repair

Abstract page for arXiv paper 2601.19066: Dynamic Cogeneration of Bug Reproduction Test in Agentic Program Repair

arXiv - AI · 4 min ·
[2510.16187] Zero-Shot Coordination in Ad Hoc Teams with Generalized Policy Improvement and Difference Rewards
Machine Learning

[2510.16187] Zero-Shot Coordination in Ad Hoc Teams with Generalized Policy Improvement and Difference Rewards

Abstract page for arXiv paper 2510.16187: Zero-Shot Coordination in Ad Hoc Teams with Generalized Policy Improvement and Difference Rewards

arXiv - AI · 4 min ·

All Content

[2503.14499] Measuring AI Ability to Complete Long Software Tasks
Machine Learning

[2503.14499] Measuring AI Ability to Complete Long Software Tasks

The paper introduces a new metric to evaluate AI's ability to complete long software tasks, revealing significant advancements in AI capa...

arXiv - Machine Learning · 4 min ·
[2408.05861] Temporal Knowledge-Graph Memory in a Partially Observable Environment
Ai Agents

[2408.05861] Temporal Knowledge-Graph Memory in a Partially Observable Environment

This paper introduces a novel temporal knowledge-graph memory system for agents operating in partially observable environments, enhancing...

arXiv - Machine Learning · 4 min ·
[2407.20058] Shapley Value Computation in Ontology-Mediated Query Answering
Ai Agents

[2407.20058] Shapley Value Computation in Ontology-Mediated Query Answering

This paper explores the application of the drastic Shapley value in ontology-mediated query answering, presenting a complexity analysis t...

arXiv - AI · 4 min ·
[2602.22190] GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL
Machine Learning

[2602.22190] GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

The paper presents GUI-Libra, a novel training approach for native GUI agents that enhances reasoning and action capabilities through act...

arXiv - Machine Learning · 4 min ·
[2602.22124] SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents
Llms

[2602.22124] SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents

The paper presents SWE-Protégé, a framework that enhances small language models (SLMs) for software engineering tasks by enabling selecti...

arXiv - Machine Learning · 4 min ·
[2602.22072] Understanding Artificial Theory of Mind: Perturbed Tasks and Reasoning in Large Language Models
Llms

[2602.22072] Understanding Artificial Theory of Mind: Perturbed Tasks and Reasoning in Large Language Models

This article explores the robustness of Theory of Mind (ToM) in large language models (LLMs) through perturbation tasks, revealing signif...

arXiv - AI · 3 min ·
[2602.22039] TG-ASR: Translation-Guided Learning with Parallel Gated Cross Attention for Low-Resource Automatic Speech Recognition
Machine Learning

[2602.22039] TG-ASR: Translation-Guided Learning with Parallel Gated Cross Attention for Low-Resource Automatic Speech Recognition

The paper presents TG-ASR, a translation-guided framework for improving automatic speech recognition in low-resource languages, specifica...

arXiv - AI · 4 min ·
[2602.22026] RGB-Event HyperGraph Prompt for Kilometer Marker Recognition based on Pre-trained Foundation Models
Llms

[2602.22026] RGB-Event HyperGraph Prompt for Kilometer Marker Recognition based on Pre-trained Foundation Models

This article presents a novel approach to Kilometer Marker Recognition (KMR) using RGB-event cameras, enhancing visual perception for aut...

arXiv - AI · 3 min ·
[2602.21997] Enhancing LLM-Based Test Generation by Eliminating Covered Code
Llms

[2602.21997] Enhancing LLM-Based Test Generation by Eliminating Covered Code

This paper presents a novel method for enhancing LLM-based unit test generation by eliminating covered code, addressing challenges in tes...

arXiv - Machine Learning · 4 min ·
[2602.21987] PatchDenoiser: Parameter-efficient multi-scale patch learning and fusion denoiser for medical images
Machine Learning

[2602.21987] PatchDenoiser: Parameter-efficient multi-scale patch learning and fusion denoiser for medical images

PatchDenoiser introduces a lightweight, multi-scale denoising framework for medical images, effectively reducing noise while preserving a...

arXiv - AI · 4 min ·
[2602.21864] DynamicGTR: Leveraging Graph Topology Representation Preferences to Boost VLM Capabilities on Graph QAs
Llms

[2602.21864] DynamicGTR: Leveraging Graph Topology Representation Preferences to Boost VLM Capabilities on Graph QAs

The paper presents DynamicGTR, a framework that enhances Vision-Language Models (VLMs) by dynamically selecting optimal graph topology re...

arXiv - AI · 4 min ·
[2602.21855] Understanding Annotation Error Propagation and Learning an Adaptive Policy for Expert Intervention in Barrett's Video Segmentation
Machine Learning

[2602.21855] Understanding Annotation Error Propagation and Learning an Adaptive Policy for Expert Intervention in Barrett's Video Segmentation

This article explores the challenges of annotation error propagation in endoscopic video segmentation, proposing a framework for optimizi...

arXiv - AI · 3 min ·
[2602.21819] SemVideo: Reconstructs What You Watch from Brain Activity via Hierarchical Semantic Guidance
Machine Learning

[2602.21819] SemVideo: Reconstructs What You Watch from Brain Activity via Hierarchical Semantic Guidance

The paper presents SemVideo, a novel framework that reconstructs videos from brain activity using hierarchical semantic guidance, address...

arXiv - AI · 4 min ·
[2602.21765] Generalisation of RLHF under Reward Shift and Clipped KL Regularisation
Llms

[2602.21765] Generalisation of RLHF under Reward Shift and Clipped KL Regularisation

This paper explores the generalization of Reinforcement Learning from Human Feedback (RLHF) under conditions of reward shift and clipped ...

arXiv - Machine Learning · 4 min ·
[2602.21772] UniWhisper: Efficient Continual Multi-task Training for Robust Universal Audio Representation
Machine Learning

[2602.21772] UniWhisper: Efficient Continual Multi-task Training for Robust Universal Audio Representation

UniWhisper introduces an efficient framework for continual multi-task training, enhancing audio representation across diverse tasks, outp...

arXiv - AI · 3 min ·
[2602.21720] Evaluating the relationship between regularity and learnability in recursive numeral systems using Reinforcement Learning
Ai Safety

[2602.21720] Evaluating the relationship between regularity and learnability in recursive numeral systems using Reinforcement Learning

This article explores the relationship between regularity and learnability in recursive numeral systems using Reinforcement Learning, dem...

arXiv - AI · 3 min ·
[2602.21715] Two-Stage Active Distribution Network Voltage Control via LLM-RL Collaboration: A Hybrid Knowledge-Data-Driven Approach
Llms

[2602.21715] Two-Stage Active Distribution Network Voltage Control via LLM-RL Collaboration: A Hybrid Knowledge-Data-Driven Approach

This article presents a hybrid approach for voltage control in active distribution networks, combining large language models and reinforc...

arXiv - AI · 4 min ·
[2602.21706] SurGo-R1: Benchmarking and Modeling Contextual Reasoning for Operative Zone in Surgical Video
Machine Learning

[2602.21706] SurGo-R1: Benchmarking and Modeling Contextual Reasoning for Operative Zone in Surgical Video

The paper presents SurGo-R1, a model designed to enhance contextual reasoning in surgical video analysis, addressing challenges in identi...

arXiv - AI · 4 min ·
[2602.21670] Hierarchical LLM-Based Multi-Agent Framework with Prompt Optimization for Multi-Robot Task Planning
Llms

[2602.21670] Hierarchical LLM-Based Multi-Agent Framework with Prompt Optimization for Multi-Robot Task Planning

This article presents a novel hierarchical framework for multi-robot task planning using large language models (LLMs) with prompt optimiz...

arXiv - AI · 4 min ·
[2602.21657] Following the Diagnostic Trace: Visual Cognition-guided Cooperative Network for Chest X-Ray Diagnosis
Machine Learning

[2602.21657] Following the Diagnostic Trace: Visual Cognition-guided Cooperative Network for Chest X-Ray Diagnosis

The paper presents VCC-Net, a visual cognition-guided cooperative network aimed at enhancing chest X-ray diagnosis through improved human...

arXiv - AI · 4 min ·
Previous Page 49 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime