AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Ai Agents

AMD's GAIA now allows building custom AI agents via chat, becomes "true desktop app"

submitted by /u/Fcking_Chuck [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
Llms

Claude code x n8n

Hi everyone, I’ve been exploring MCP and integrating tools like n8n with Claude Code, and I’m trying to understand how practical this rea...

Reddit - Artificial Intelligence · 1 min ·
Ai Agents

Cloudflare just turned Browser Rendering into a lot more powerful MCP infrastructure

Browser Rendering now exposes the Chrome DevTools Protocol, which means MCP clients can access a remote browser directly. That’s a pretty...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.14252] GRAIL: Goal Recognition Alignment through Imitation Learning
Nlp

[2602.14252] GRAIL: Goal Recognition Alignment through Imitation Learning

The paper introduces GRAIL, a method for recognizing agent goals through imitation learning, enhancing goal recognition accuracy in AI sy...

arXiv - Machine Learning · 3 min ·
[2602.13807] AnomaMind: Agentic Time Series Anomaly Detection with Tool-Augmented Reasoning
Ai Agents

[2602.13807] AnomaMind: Agentic Time Series Anomaly Detection with Tool-Augmented Reasoning

AnomaMind presents a novel framework for time series anomaly detection, enhancing traditional methods by incorporating tool-augmented rea...

arXiv - Machine Learning · 4 min ·
[2602.14234] REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents
Llms

[2602.14234] REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents

The paper presents REDSearcher, a novel framework designed to optimize long-horizon search agents by addressing the challenges of task sy...

arXiv - AI · 4 min ·
[2602.14229] CORPGEN: Simulating Corporate Environments with Autonomous Digital Employees in Multi-Horizon Task Environments
Robotics

[2602.14229] CORPGEN: Simulating Corporate Environments with Autonomous Digital Employees in Multi-Horizon Task Environments

The paper introduces CORPGEN, a framework for simulating corporate environments using autonomous digital employees, addressing long-horiz...

arXiv - Machine Learning · 4 min ·
[2602.14225] Text Before Vision: Staged Knowledge Injection Matters for Agentic RLVR in Ultra-High-Resolution Remote Sensing Understanding
Machine Learning

[2602.14225] Text Before Vision: Staged Knowledge Injection Matters for Agentic RLVR in Ultra-High-Resolution Remote Sensing Understanding

This paper explores the significance of staged knowledge injection in enhancing agentic reinforcement learning for ultra-high-resolution ...

arXiv - AI · 4 min ·
[2602.13791] MechPert: Mechanistic Consensus as an Inductive Bias for Unseen Perturbation Prediction
Llms

[2602.13791] MechPert: Mechanistic Consensus as an Inductive Bias for Unseen Perturbation Prediction

The paper introduces MechPert, a framework that enhances unseen genetic perturbation prediction by leveraging mechanistic consensus among...

arXiv - AI · 3 min ·
[2602.14160] Process-Supervised Multi-Agent Reinforcement Learning for Reliable Clinical Reasoning
Llms

[2602.14160] Process-Supervised Multi-Agent Reinforcement Learning for Reliable Clinical Reasoning

This paper presents a novel multi-agent reinforcement learning framework aimed at enhancing clinical reasoning by ensuring process-ground...

arXiv - AI · 3 min ·
[2602.14130] Algebraic Quantum Intelligence: A New Framework for Reproducible Machine Creativity
Llms

[2602.14130] Algebraic Quantum Intelligence: A New Framework for Reproducible Machine Creativity

The paper introduces Algebraic Quantum Intelligence (AQI), a framework designed to enhance the creative capabilities of large language mo...

arXiv - Machine Learning · 4 min ·
[2602.14093] GUI-GENESIS: Automated Synthesis of Efficient Environments with Verifiable Rewards for GUI Agent Post-Training
Machine Learning

[2602.14093] GUI-GENESIS: Automated Synthesis of Efficient Environments with Verifiable Rewards for GUI Agent Post-Training

The paper presents GUI-GENESIS, a framework for automating the synthesis of efficient training environments for GUI agents, enhancing per...

arXiv - Machine Learning · 3 min ·
[2602.14083] Plan-MCTS: Plan Exploration for Action Exploitation in Web Navigation
Llms

[2602.14083] Plan-MCTS: Plan Exploration for Action Exploitation in Web Navigation

The article presents Plan-MCTS, a novel framework for enhancing web navigation through improved exploration and state perception, address...

arXiv - AI · 3 min ·
[2602.13706] Near-Optimal Regret for Policy Optimization in Contextual MDPs with General Offline Function Approximation
Machine Learning

[2602.13706] Near-Optimal Regret for Policy Optimization in Contextual MDPs with General Offline Function Approximation

This paper presents OPO-CMDP, a novel policy optimization algorithm for stochastic Contextual Markov Decision Processes (CMDPs) that achi...

arXiv - Machine Learning · 3 min ·
[2602.14035] FloCA: Towards Faithful and Logically Consistent Flowchart Reasoning
Llms

[2602.14035] FloCA: Towards Faithful and Logically Consistent Flowchart Reasoning

The paper introduces FloCA, a flowchart-oriented conversational agent designed to enhance decision-making in dialogue systems by ensuring...

arXiv - AI · 4 min ·
[2602.14038] Choosing How to Remember: Adaptive Memory Structures for LLM Agents
Llms

[2602.14038] Choosing How to Remember: Adaptive Memory Structures for LLM Agents

The paper presents FluxMem, a novel framework for adaptive memory structures in large language model (LLM) agents, addressing limitations...

arXiv - Machine Learning · 3 min ·
[2602.13700] Optimal Regret for Policy Optimization in Contextual Bandits
Machine Learning

[2602.13700] Optimal Regret for Policy Optimization in Contextual Bandits

This paper presents a novel algorithm achieving optimal regret bounds for policy optimization in stochastic contextual multi-armed bandit...

arXiv - Machine Learning · 3 min ·
[2602.14003] Prompt-Driven Low-Altitude Edge Intelligence: Modular Agents and Generative Reasoning
Machine Learning

[2602.14003] Prompt-Driven Low-Altitude Edge Intelligence: Modular Agents and Generative Reasoning

The paper presents a novel framework for low-altitude edge intelligence, addressing limitations of large AI models through a prompt-to-ag...

arXiv - AI · 4 min ·
[2602.13690] Physics Aware Neural Networks: Denoising for Magnetic Navigation
Machine Learning

[2602.13690] Physics Aware Neural Networks: Denoising for Magnetic Navigation

This paper presents a novel framework for denoising magnetic navigation data using physics-aware neural networks, addressing challenges i...

arXiv - Machine Learning · 4 min ·
[2602.13985] Bridging AI and Clinical Reasoning: Abductive Explanations for Alignment on Critical Symptoms
Machine Learning

[2602.13985] Bridging AI and Clinical Reasoning: Abductive Explanations for Alignment on Critical Symptoms

This article discusses the integration of AI in clinical diagnostics, focusing on the use of abductive explanations to enhance AI's align...

arXiv - AI · 3 min ·
[2602.13666] ALMo: Interactive Aim-Limit-Defined, Multi-Objective System for Personalized High-Dose-Rate Brachytherapy Treatment Planning and Visualization for Cervical Cancer
Data Science

[2602.13666] ALMo: Interactive Aim-Limit-Defined, Multi-Objective System for Personalized High-Dose-Rate Brachytherapy Treatment Planning and Visualization for Cervical Cancer

The article presents ALMo, an interactive system for personalized high-dose-rate brachytherapy treatment planning for cervical cancer, en...

arXiv - AI · 4 min ·
[2602.13935] Statistical Early Stopping for Reasoning Models
Llms

[2602.13935] Statistical Early Stopping for Reasoning Models

The paper presents statistical early stopping methods for reasoning models, addressing inefficiencies in large language models (LLMs) tha...

arXiv - Machine Learning · 3 min ·
[2602.13912] From Pixels to Policies: Reinforcing Spatial Reasoning in Language Models for Content-Aware Layout Design
Llms

[2602.13912] From Pixels to Policies: Reinforcing Spatial Reasoning in Language Models for Content-Aware Layout Design

The paper presents LaySPA, a reinforcement learning framework designed to enhance spatial reasoning in large language models for effectiv...

arXiv - AI · 3 min ·
Previous Page 143 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime