AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Ai Agents

AMD's GAIA now allows building custom AI agents via chat, becomes "true desktop app"

submitted by /u/Fcking_Chuck [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
Llms

Claude code x n8n

Hi everyone, I’ve been exploring MCP and integrating tools like n8n with Claude Code, and I’m trying to understand how practical this rea...

Reddit - Artificial Intelligence · 1 min ·
Ai Agents

Cloudflare just turned Browser Rendering into a lot more powerful MCP infrastructure

Browser Rendering now exposes the Chrome DevTools Protocol, which means MCP clients can access a remote browser directly. That’s a pretty...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.13873] Ambient Physics: Training Neural PDE Solvers with Partial Observations
Machine Learning

[2602.13873] Ambient Physics: Training Neural PDE Solvers with Partial Observations

The paper introduces 'Ambient Physics', a novel framework for training neural PDE solvers using partial observations, achieving significa...

arXiv - Machine Learning · 3 min ·
[2602.13855] From Fluent to Verifiable: Claim-Level Auditability for Deep Research Agents
Ai Agents

[2602.13855] From Fluent to Verifiable: Claim-Level Auditability for Deep Research Agents

The paper discusses the need for claim-level auditability in deep research agents, highlighting the shift from factual errors to weak cla...

arXiv - AI · 3 min ·
[2602.13865] Enabling Option Learning in Sparse Rewards with Hindsight Experience Replay
Machine Learning

[2602.13865] Enabling Option Learning in Sparse Rewards with Hindsight Experience Replay

This paper introduces MOC-HER and 2HER, methods that enhance Hierarchical Reinforcement Learning by improving performance in sparse rewar...

arXiv - Machine Learning · 4 min ·
[2602.13852] Experimentation Accelerator: Interpretable Insights and Creative Recommendations for A/B Testing with Content-Aware ranking
Nlp

[2602.13852] Experimentation Accelerator: Interpretable Insights and Creative Recommendations for A/B Testing with Content-Aware ranking

The paper presents the Experimentation Accelerator, a framework that enhances A/B testing by providing interpretable insights and creativ...

arXiv - AI · 4 min ·
[2602.13808] An end-to-end agentic pipeline for smart contract translation and quality evaluation
Llms

[2602.13808] An end-to-end agentic pipeline for smart contract translation and quality evaluation

This article presents a comprehensive framework for evaluating smart contracts generated from natural language specifications, focusing o...

arXiv - AI · 3 min ·
[2602.13769] OR-Agent: Bridging Evolutionary Search and Structured Research for Automated Algorithm Discovery
Machine Learning

[2602.13769] OR-Agent: Bridging Evolutionary Search and Structured Research for Automated Algorithm Discovery

The paper presents OR-Agent, a multi-agent framework designed to automate scientific discovery through structured hypothesis management a...

arXiv - AI · 4 min ·
[2602.13738] OneLatent: Single-Token Compression for Visual Latent Reasoning
Machine Learning

[2602.13738] OneLatent: Single-Token Compression for Visual Latent Reasoning

The paper introduces OneLatent, a framework that compresses reasoning in visual tasks into a single token, significantly reducing output ...

arXiv - AI · 3 min ·
[2602.13695] Can a Lightweight Automated AI Pipeline Solve Research-Level Mathematical Problems?
Llms

[2602.13695] Can a Lightweight Automated AI Pipeline Solve Research-Level Mathematical Problems?

This article explores the potential of a lightweight AI pipeline to solve complex mathematical problems, demonstrating its effectiveness ...

arXiv - AI · 4 min ·
[2602.13691] PhGPO: Pheromone-Guided Policy Optimization for Long-Horizon Tool Planning
Llms

[2602.13691] PhGPO: Pheromone-Guided Policy Optimization for Long-Horizon Tool Planning

The paper presents PhGPO, a novel approach for long-horizon tool planning that utilizes pheromone-guided policy optimization to enhance t...

arXiv - AI · 3 min ·
[2602.13665] HyFunc: Accelerating LLM-based Function Calls for Agentic AI through Hybrid-Model Cascade and Dynamic Templating
Llms

[2602.13665] HyFunc: Accelerating LLM-based Function Calls for Agentic AI through Hybrid-Model Cascade and Dynamic Templating

The paper presents HyFunc, a framework designed to enhance the efficiency of LLM-based function calls in agentic AI by reducing computati...

arXiv - AI · 4 min ·
[2602.13653] Building Autonomous GUI Navigation via Agentic-Q Estimation and Step-Wise Policy Optimization
Llms

[2602.13653] Building Autonomous GUI Navigation via Agentic-Q Estimation and Step-Wise Policy Optimization

The paper presents a novel framework for autonomous GUI navigation using Agentic-Q estimation and step-wise policy optimization, enhancin...

arXiv - AI · 4 min ·
[2602.13639] Guided Collaboration in Heterogeneous LLM-Based Multi-Agent Systems via Entropy-Based Understanding Assessment and Experience Retrieval
Llms

[2602.13639] Guided Collaboration in Heterogeneous LLM-Based Multi-Agent Systems via Entropy-Based Understanding Assessment and Experience Retrieval

The paper discusses a novel Entropy-Based Adaptive Guidance Framework for enhancing collaboration in heterogeneous multi-agent systems us...

arXiv - AI · 4 min ·
[2602.13616] DiffusionRollout: Uncertainty-Aware Rollout Planning in Long-Horizon PDE Solving
Machine Learning

[2602.13616] DiffusionRollout: Uncertainty-Aware Rollout Planning in Long-Horizon PDE Solving

The paper introduces DiffusionRollout, a strategy for improving long-horizon predictions in physical systems governed by PDEs by addressi...

arXiv - Machine Learning · 3 min ·
[2602.13594] Hippocampus: An Efficient and Scalable Memory Module for Agentic AI
Llms

[2602.13594] Hippocampus: An Efficient and Scalable Memory Module for Agentic AI

The paper introduces Hippocampus, a scalable memory module designed for agentic AI, enhancing retrieval speed and storage efficiency comp...

arXiv - AI · 3 min ·
[2602.13359] The Speed-up Factor: A Quantitative Multi-Iteration Active Learning Performance Metric
Machine Learning

[2602.13359] The Speed-up Factor: A Quantitative Multi-Iteration Active Learning Performance Metric

This article introduces the Speed-up Factor, a new performance metric for evaluating multi-iteration active learning methods, demonstrati...

arXiv - Machine Learning · 3 min ·
[2602.13587] A First Proof Sprint
Ai Agents

[2602.13587] A First Proof Sprint

This paper presents a multi-agent proof sprint addressing ten research-level problems, utilizing rapid draft generation and adversarial v...

arXiv - AI · 3 min ·
[2602.13583] Differentiable Rule Induction from Raw Sequence Inputs
Machine Learning

[2602.13583] Differentiable Rule Induction from Raw Sequence Inputs

This paper presents a novel approach to differentiable rule induction from raw sequence inputs, enhancing interpretability in machine lea...

arXiv - Machine Learning · 3 min ·
[2602.13559] OpAgent: Operator Agent for Web Navigation
Machine Learning

[2602.13559] OpAgent: Operator Agent for Web Navigation

The paper presents OpAgent, an innovative online reinforcement learning agent designed for effective web navigation, achieving a state-of...

arXiv - AI · 4 min ·
[2602.13516] SPILLage: Agentic Oversharing on the Web
Llms

[2602.13516] SPILLage: Agentic Oversharing on the Web

The paper introduces SPILLage, a framework addressing unintentional oversharing by web agents powered by LLMs, highlighting behavioral ov...

arXiv - AI · 4 min ·
[2602.13530] REMem: Reasoning with Episodic Memory in Language Agent
Ai Agents

[2602.13530] REMem: Reasoning with Episodic Memory in Language Agent

The paper presents REMem, a novel framework for enhancing language agents' episodic memory, enabling better recollection and reasoning ov...

arXiv - AI · 3 min ·
Previous Page 144 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime