AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[2603.10062] Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges Ahead

Abstract page for arXiv paper 2603.10062: Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges Ahead

arXiv - AI · 3 min · about 2 hours ago

Ai Agents

[2601.19066] Dynamic Cogeneration of Bug Reproduction Test in Agentic Program Repair

Abstract page for arXiv paper 2601.19066: Dynamic Cogeneration of Bug Reproduction Test in Agentic Program Repair

arXiv - AI · 4 min · about 2 hours ago

Machine Learning

[2510.16187] Zero-Shot Coordination in Ad Hoc Teams with Generalized Policy Improvement and Difference Rewards

Abstract page for arXiv paper 2510.16187: Zero-Shot Coordination in Ad Hoc Teams with Generalized Policy Improvement and Difference Rewards

arXiv - AI · 4 min · about 2 hours ago

All Content

Machine Learning

[2503.14499] Measuring AI Ability to Complete Long Software Tasks

The paper introduces a new metric to evaluate AI's ability to complete long software tasks, revealing significant advancements in AI capa...

arXiv - Machine Learning · 4 min · about 1 month ago

Ai Agents

[2408.05861] Temporal Knowledge-Graph Memory in a Partially Observable Environment

This paper introduces a novel temporal knowledge-graph memory system for agents operating in partially observable environments, enhancing...

arXiv - Machine Learning · 4 min · about 1 month ago

Ai Agents

[2407.20058] Shapley Value Computation in Ontology-Mediated Query Answering

This paper explores the application of the drastic Shapley value in ontology-mediated query answering, presenting a complexity analysis t...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.22190] GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

The paper presents GUI-Libra, a novel training approach for native GUI agents that enhances reasoning and action capabilities through act...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.22124] SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents

The paper presents SWE-Protégé, a framework that enhances small language models (SLMs) for software engineering tasks by enabling selecti...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.22072] Understanding Artificial Theory of Mind: Perturbed Tasks and Reasoning in Large Language Models

This article explores the robustness of Theory of Mind (ToM) in large language models (LLMs) through perturbation tasks, revealing signif...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.22039] TG-ASR: Translation-Guided Learning with Parallel Gated Cross Attention for Low-Resource Automatic Speech Recognition

The paper presents TG-ASR, a translation-guided framework for improving automatic speech recognition in low-resource languages, specifica...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.22026] RGB-Event HyperGraph Prompt for Kilometer Marker Recognition based on Pre-trained Foundation Models

This article presents a novel approach to Kilometer Marker Recognition (KMR) using RGB-event cameras, enhancing visual perception for aut...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.21997] Enhancing LLM-Based Test Generation by Eliminating Covered Code

This paper presents a novel method for enhancing LLM-based unit test generation by eliminating covered code, addressing challenges in tes...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.21987] PatchDenoiser: Parameter-efficient multi-scale patch learning and fusion denoiser for medical images

PatchDenoiser introduces a lightweight, multi-scale denoising framework for medical images, effectively reducing noise while preserving a...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.21864] DynamicGTR: Leveraging Graph Topology Representation Preferences to Boost VLM Capabilities on Graph QAs

The paper presents DynamicGTR, a framework that enhances Vision-Language Models (VLMs) by dynamically selecting optimal graph topology re...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.21855] Understanding Annotation Error Propagation and Learning an Adaptive Policy for Expert Intervention in Barrett's Video Segmentation

This article explores the challenges of annotation error propagation in endoscopic video segmentation, proposing a framework for optimizi...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.21819] SemVideo: Reconstructs What You Watch from Brain Activity via Hierarchical Semantic Guidance

The paper presents SemVideo, a novel framework that reconstructs videos from brain activity using hierarchical semantic guidance, address...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.21765] Generalisation of RLHF under Reward Shift and Clipped KL Regularisation

This paper explores the generalization of Reinforcement Learning from Human Feedback (RLHF) under conditions of reward shift and clipped ...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.21772] UniWhisper: Efficient Continual Multi-task Training for Robust Universal Audio Representation

UniWhisper introduces an efficient framework for continual multi-task training, enhancing audio representation across diverse tasks, outp...

arXiv - AI · 3 min · about 1 month ago

Ai Safety

[2602.21720] Evaluating the relationship between regularity and learnability in recursive numeral systems using Reinforcement Learning

This article explores the relationship between regularity and learnability in recursive numeral systems using Reinforcement Learning, dem...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.21715] Two-Stage Active Distribution Network Voltage Control via LLM-RL Collaboration: A Hybrid Knowledge-Data-Driven Approach

This article presents a hybrid approach for voltage control in active distribution networks, combining large language models and reinforc...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.21706] SurGo-R1: Benchmarking and Modeling Contextual Reasoning for Operative Zone in Surgical Video

The paper presents SurGo-R1, a model designed to enhance contextual reasoning in surgical video analysis, addressing challenges in identi...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.21670] Hierarchical LLM-Based Multi-Agent Framework with Prompt Optimization for Multi-Robot Task Planning

This article presents a novel hierarchical framework for multi-robot task planning using large language models (LLMs) with prompt optimiz...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.21657] Following the Diagnostic Trace: Visual Cognition-guided Cooperative Network for Chest X-Ray Diagnosis

The paper presents VCC-Net, a visual cognition-guided cooperative network aimed at enhancing chest X-ray diagnosis through improved human...

arXiv - AI · 4 min · about 1 month ago

Previous Page 49 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

[2603.10062] Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges Ahead

[2601.19066] Dynamic Cogeneration of Bug Reproduction Test in Agentic Program Repair

[2510.16187] Zero-Shot Coordination in Ad Hoc Teams with Generalized Policy Improvement and Difference Rewards

All Content

[2503.14499] Measuring AI Ability to Complete Long Software Tasks

[2408.05861] Temporal Knowledge-Graph Memory in a Partially Observable Environment

[2407.20058] Shapley Value Computation in Ontology-Mediated Query Answering

[2602.22190] GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

[2602.22124] SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents

[2602.22072] Understanding Artificial Theory of Mind: Perturbed Tasks and Reasoning in Large Language Models

[2602.22039] TG-ASR: Translation-Guided Learning with Parallel Gated Cross Attention for Low-Resource Automatic Speech Recognition

[2602.22026] RGB-Event HyperGraph Prompt for Kilometer Marker Recognition based on Pre-trained Foundation Models

[2602.21997] Enhancing LLM-Based Test Generation by Eliminating Covered Code

[2602.21987] PatchDenoiser: Parameter-efficient multi-scale patch learning and fusion denoiser for medical images

[2602.21864] DynamicGTR: Leveraging Graph Topology Representation Preferences to Boost VLM Capabilities on Graph QAs

[2602.21855] Understanding Annotation Error Propagation and Learning an Adaptive Policy for Expert Intervention in Barrett's Video Segmentation

[2602.21819] SemVideo: Reconstructs What You Watch from Brain Activity via Hierarchical Semantic Guidance

[2602.21765] Generalisation of RLHF under Reward Shift and Clipped KL Regularisation

[2602.21772] UniWhisper: Efficient Continual Multi-task Training for Robust Universal Audio Representation

[2602.21720] Evaluating the relationship between regularity and learnability in recursive numeral systems using Reinforcement Learning

[2602.21715] Two-Stage Active Distribution Network Voltage Control via LLM-RL Collaboration: A Hybrid Knowledge-Data-Driven Approach

[2602.21706] SurGo-R1: Benchmarking and Modeling Contextual Reasoning for Operative Zone in Surgical Video

[2602.21670] Hierarchical LLM-Based Multi-Agent Framework with Prompt Optimization for Multi-Robot Task Planning

[2602.21657] Following the Diagnostic Trace: Visual Cognition-guided Cooperative Network for Chest X-Ray Diagnosis

Related Topics

Stay updated with AI News