AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

[2603.10062] Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges Ahead
Llms

[2603.10062] Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges Ahead

Abstract page for arXiv paper 2603.10062: Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges Ahead

arXiv - AI · 3 min ·
[2601.19066] Dynamic Cogeneration of Bug Reproduction Test in Agentic Program Repair
Ai Agents

[2601.19066] Dynamic Cogeneration of Bug Reproduction Test in Agentic Program Repair

Abstract page for arXiv paper 2601.19066: Dynamic Cogeneration of Bug Reproduction Test in Agentic Program Repair

arXiv - AI · 4 min ·
[2510.16187] Zero-Shot Coordination in Ad Hoc Teams with Generalized Policy Improvement and Difference Rewards
Machine Learning

[2510.16187] Zero-Shot Coordination in Ad Hoc Teams with Generalized Policy Improvement and Difference Rewards

Abstract page for arXiv paper 2510.16187: Zero-Shot Coordination in Ad Hoc Teams with Generalized Policy Improvement and Difference Rewards

arXiv - AI · 4 min ·

All Content

[2510.00024] EpidemIQs: Prompt-to-Paper LLM Agents for Epidemic Modeling and Analysis
Llms

[2510.00024] EpidemIQs: Prompt-to-Paper LLM Agents for Epidemic Modeling and Analysis

The paper presents EpidemIQs, a multi-agent framework utilizing large language models for efficient epidemic modeling, demonstrating impr...

arXiv - AI · 4 min ·
[2509.24072] Uncovering Grounding IDs: How External Cues Shape Multimodal Binding
Llms

[2509.24072] Uncovering Grounding IDs: How External Cues Shape Multimodal Binding

This article explores the concept of Grounding IDs, which are latent identifiers that enhance multimodal binding in large vision-language...

arXiv - AI · 4 min ·
[2509.23744] Compose and Fuse: Revisiting the Foundational Bottlenecks in Multimodal Reasoning
Llms

[2509.23744] Compose and Fuse: Revisiting the Foundational Bottlenecks in Multimodal Reasoning

This article explores the foundational bottlenecks in multimodal reasoning, highlighting how additional modalities can enhance or hinder ...

arXiv - AI · 4 min ·
[2508.21112] EO-1: An Open Unified Embodied Foundation Model for General Robot Control
Llms

[2508.21112] EO-1: An Open Unified Embodied Foundation Model for General Robot Control

The EO-1 model is introduced as a unified foundation for general robot control, enhancing multimodal reasoning through a large dataset an...

arXiv - AI · 4 min ·
[2509.14537] ClearFairy: Capturing Creative Workflows through Decision Structuring, In-Situ Questioning, and Rationale Inference
Machine Learning

[2509.14537] ClearFairy: Capturing Creative Workflows through Decision Structuring, In-Situ Questioning, and Rationale Inference

The paper introduces ClearFairy, an AI assistant designed to enhance decision-making in creative workflows by structuring reasoning and i...

arXiv - AI · 3 min ·
[2507.17691] CASCADE: LLM-Powered JavaScript Deobfuscator at Google
Llms

[2507.17691] CASCADE: LLM-Powered JavaScript Deobfuscator at Google

The paper presents CASCADE, a novel LLM-powered JavaScript deobfuscator developed by Google, which enhances code comprehension and analys...

arXiv - Machine Learning · 3 min ·
[2506.07477] Premise Selection for a Lean Hammer
Machine Learning

[2506.07477] Premise Selection for a Lean Hammer

The paper presents LeanPremise, a neural premise selection system that enhances LeanHammer, a tool for automated reasoning in proof assis...

arXiv - Machine Learning · 4 min ·
[2506.05154] Resisting Contextual Interference in RAG via Parametric-Knowledge Reinforcement
Llms

[2506.05154] Resisting Contextual Interference in RAG via Parametric-Knowledge Reinforcement

The paper presents Knowledgeable-R1, a reinforcement-learning framework designed to enhance retrieval-augmented generation (RAG) by mitig...

arXiv - AI · 4 min ·
[2506.01085] Learning What Matters: Prioritized Concept Learning via Relative Error-driven Sample Selection
Llms

[2506.01085] Learning What Matters: Prioritized Concept Learning via Relative Error-driven Sample Selection

The paper presents PROGRESS, a framework for prioritized concept learning in vision-language models, enabling efficient sample selection ...

arXiv - AI · 4 min ·
[2503.01927] QCS-ADME: Quantum Circuit Search for Drug Property Prediction with Imbalanced Data and Regression Adaptation
Machine Learning

[2503.01927] QCS-ADME: Quantum Circuit Search for Drug Property Prediction with Imbalanced Data and Regression Adaptation

The paper presents QCS-ADME, a novel quantum circuit search framework for predicting drug properties, addressing challenges in imbalanced...

arXiv - Machine Learning · 4 min ·
[2411.06657] Renaissance: Investigating the Pretraining of Vision-Language Encoders
Machine Learning

[2411.06657] Renaissance: Investigating the Pretraining of Vision-Language Encoders

The paper 'Renaissance' explores the pretraining of vision-language encoders, addressing best practices and introducing a flexible evalua...

arXiv - Machine Learning · 4 min ·
[2401.12455] Multi-agent deep reinforcement learning with centralized training and decentralized execution for transportation infrastructure management
Machine Learning

[2401.12455] Multi-agent deep reinforcement learning with centralized training and decentralized execution for transportation infrastructure management

This article presents a novel framework for managing transportation infrastructure using multi-agent deep reinforcement learning, address...

arXiv - Machine Learning · 4 min ·
[2601.10402] Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering
Llms

[2601.10402] Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering

The paper discusses advancements in AI towards ultra-long-horizon autonomy, introducing ML-Master 2.0, which utilizes Hierarchical Cognit...

arXiv - AI · 4 min ·
[2602.12259] Think like a Scientist: Physics-guided LLM Agent for Equation Discovery
Llms

[2602.12259] Think like a Scientist: Physics-guided LLM Agent for Equation Discovery

The paper introduces KeplerAgent, a physics-guided LLM framework designed for symbolic equation discovery, enhancing accuracy and robustn...

arXiv - Machine Learning · 3 min ·
[2509.01350] Error Notebook-Guided, Training-Free Part Retrieval in 3D CAD Assemblies via Vision-Language Models
Llms

[2509.01350] Error Notebook-Guided, Training-Free Part Retrieval in 3D CAD Assemblies via Vision-Language Models

The paper presents a novel framework for part retrieval in 3D CAD assemblies using vision-language models, emphasizing training-free meth...

arXiv - AI · 4 min ·
[2508.07667] 1-2-3 Check: Enhancing Contextual Privacy in LLM via Multi-Agent Reasoning
Llms

[2508.07667] 1-2-3 Check: Enhancing Contextual Privacy in LLM via Multi-Agent Reasoning

The paper presents a multi-agent framework to enhance contextual privacy in large language models (LLMs), demonstrating a significant red...

arXiv - AI · 3 min ·
[2507.14899] InsightX Agent: An LMM-based Agentic Framework with Integrated Tools for Reliable X-ray NDT Analysis
Ai Agents

[2507.14899] InsightX Agent: An LMM-based Agentic Framework with Integrated Tools for Reliable X-ray NDT Analysis

The paper presents InsightX Agent, an LMM-based framework that enhances X-ray non-destructive testing (NDT) by improving reliability, int...

arXiv - AI · 4 min ·
[2506.13793] Med-REFL: Medical Reasoning Enhancement via Self-Corrected Fine-grained Reflection
Machine Learning

[2506.13793] Med-REFL: Medical Reasoning Enhancement via Self-Corrected Fine-grained Reflection

The paper presents Med-REFL, a framework designed to enhance medical reasoning in AI by enabling self-correction through fine-grained ref...

arXiv - AI · 4 min ·
[2506.10947] Spurious Rewards: Rethinking Training Signals in RLVR
Llms

[2506.10947] Spurious Rewards: Rethinking Training Signals in RLVR

The paper explores the impact of spurious rewards in reinforcement learning with verifiable rewards (RLVR), demonstrating how they can en...

arXiv - Machine Learning · 4 min ·
[2505.18502] Knowledge Fusion of Large Language Models Via Modular SkillPacks
Llms

[2505.18502] Knowledge Fusion of Large Language Models Via Modular SkillPacks

The paper presents GraftLLM, a novel method for knowledge fusion in large language models using modular SkillPacks, enhancing cross-capab...

arXiv - Machine Learning · 4 min ·
Previous Page 48 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime