AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Ai Agents

Cloudflare just turned Browser Rendering into a lot more powerful MCP infrastructure

Browser Rendering now exposes the Chrome DevTools Protocol, which means MCP clients can access a remote browser directly. That’s a pretty...

Reddit - Artificial Intelligence · 1 min ·
Llms

Anthropic launches Claude Managed Agents — composable APIs for shipping production AI agents 10x faster. Notion, Rakuten, Asana, and Sentry already in production.

Anthropic launches Claude Managed Agents in public beta — composable APIs for shipping production AI agents 10x faster Handles sandboxing...

Reddit - Artificial Intelligence · 1 min ·
Llms

persistent memory system for AI agents — single SQLite file, no external server, no API keys. free and opensource - BrainCTL

Every agent I build forgets everything between sessions. I got tired of it and built brainctl. pip install brainctl, then: from agentmemo...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.13363] Assessing Spear-Phishing Website Generation in Large Language Model Coding Agents
Llms

[2602.13363] Assessing Spear-Phishing Website Generation in Large Language Model Coding Agents

This article evaluates the capabilities of large language models (LLMs) in generating spear-phishing websites, highlighting the potential...

arXiv - AI · 4 min ·
[2602.14761] Universal Algorithm-Implicit Learning
Machine Learning

[2602.14761] Universal Algorithm-Implicit Learning

The paper presents a theoretical framework for meta-learning, introducing the concept of algorithm-implicit learning through a new model ...

arXiv - AI · 3 min ·
[2602.14759] Inner Loop Inference for Pretrained Transformers: Unlocking Latent Capabilities Without Training
Machine Learning

[2602.14759] Inner Loop Inference for Pretrained Transformers: Unlocking Latent Capabilities Without Training

The paper presents 'Inner Loop Inference,' a method for enhancing pretrained Transformers by iteratively refining outputs during inferenc...

arXiv - Machine Learning · 4 min ·
[2602.13351] A Formal Framework for the Explanation of Finite Automata Decisions
Machine Learning

[2602.13351] A Formal Framework for the Explanation of Finite Automata Decisions

This paper presents a formal framework for explaining the decisions made by finite automata (FA), focusing on minimal input character set...

arXiv - AI · 4 min ·
[2602.13346] CellMaster: Collaborative Cell Type Annotation in Single-Cell Analysis
Llms

[2602.13346] CellMaster: Collaborative Cell Type Annotation in Single-Cell Analysis

CellMaster introduces an AI-driven approach for zero-shot cell-type annotation in single-cell RNA sequencing, improving accuracy signific...

arXiv - AI · 3 min ·
[2602.13347] Visual Foresight for Robotic Stow: A Diffusion-Based World Model from Sparse Snapshots
Machine Learning

[2602.13347] Visual Foresight for Robotic Stow: A Diffusion-Based World Model from Sparse Snapshots

The paper presents FOREST, a diffusion-based world model for robotic stow operations, enhancing the prediction of post-stow configuration...

arXiv - AI · 3 min ·
[2602.14626] Concepts' Information Bottleneck Models
Machine Learning

[2602.14626] Concepts' Information Bottleneck Models

This article presents the Concepts' Information Bottleneck Models, which enhance the interpretability of predictions in machine learning ...

arXiv - Machine Learning · 3 min ·
[2602.13332] MedScope: Incentivizing "Think with Videos" for Clinical Reasoning via Coarse-to-Fine Tool Calling
Llms

[2602.13332] MedScope: Incentivizing "Think with Videos" for Clinical Reasoning via Coarse-to-Fine Tool Calling

The paper presents MedScope, a clinical video reasoning model that enhances decision-making in medical contexts by integrating tool use a...

arXiv - AI · 4 min ·
[2602.13329] HiST-VLA: A Hierarchical Spatio-Temporal Vision-Language-Action Model for End-to-End Autonomous Driving
Machine Learning

[2602.13329] HiST-VLA: A Hierarchical Spatio-Temporal Vision-Language-Action Model for End-to-End Autonomous Driving

The HiST-VLA model enhances autonomous driving by integrating vision, language, and action through improved spatio-temporal reasoning and...

arXiv - AI · 3 min ·
[2602.14587] Decoupled Continuous-Time Reinforcement Learning via Hamiltonian Flow
Llms

[2602.14587] Decoupled Continuous-Time Reinforcement Learning via Hamiltonian Flow

This paper presents a novel decoupled continuous-time reinforcement learning algorithm using Hamiltonian flow, addressing challenges in s...

arXiv - AI · 4 min ·
[2602.14580] Replicable Constrained Bandits
Machine Learning

[2602.14580] Replicable Constrained Bandits

The paper discusses replicable online learning algorithms in constrained multi-armed bandit (MAB) problems, demonstrating that replicabil...

arXiv - Machine Learning · 3 min ·
[2602.14578] RNM-TD3: N:M Semi-structured Sparse Reinforcement Learning From Scratch
Machine Learning

[2602.14578] RNM-TD3: N:M Semi-structured Sparse Reinforcement Learning From Scratch

The paper presents RNM-TD3, a novel approach to reinforcement learning that employs N:M structured sparsity, enhancing performance while ...

arXiv - Machine Learning · 3 min ·
[2602.14559] Fluid-Agent Reinforcement Learning
Ai Agents

[2602.14559] Fluid-Agent Reinforcement Learning

The paper introduces a novel framework for multi-agent reinforcement learning (MARL) that allows agents to create other agents, termed fl...

arXiv - AI · 3 min ·
[2602.13314] Sim2Radar: Toward Bridging the Radar Sim-to-Real Gap with VLM-Guided Scene Reconstruction
Machine Learning

[2602.13314] Sim2Radar: Toward Bridging the Radar Sim-to-Real Gap with VLM-Guided Scene Reconstruction

The paper presents Sim2Radar, a framework that generates synthetic radar data from RGB images, addressing the challenges of limited radar...

arXiv - AI · 3 min ·
[2602.13313] Agentic Spatio-Temporal Grounding via Collaborative Reasoning
Ai Agents

[2602.13313] Agentic Spatio-Temporal Grounding via Collaborative Reasoning

The paper presents the Agentic Spatio-Temporal Grounder (ASTG), a novel framework for Spatio-Temporal Video Grounding (STVG) that enhance...

arXiv - AI · 3 min ·
[2602.14543] Truly Adapting to Adversarial Constraints in Constrained MABs
Machine Learning

[2602.14543] Truly Adapting to Adversarial Constraints in Constrained MABs

This paper presents algorithms for the constrained multi-armed bandit (MAB) problem, addressing both stochastic and adversarial constrain...

arXiv - Machine Learning · 4 min ·
[2602.13312] PeroMAS: A Multi-agent System of Perovskite Material Discovery
Machine Learning

[2602.13312] PeroMAS: A Multi-agent System of Perovskite Material Discovery

PeroMAS introduces a multi-agent system for discovering perovskite materials, enhancing efficiency in photovoltaic research through a com...

arXiv - AI · 4 min ·
[2602.13310] Visual Para-Thinker: Divide-and-Conquer Reasoning for Visual Comprehension
Llms

[2602.13310] Visual Para-Thinker: Divide-and-Conquer Reasoning for Visual Comprehension

The paper introduces Visual Para-Thinker, a novel framework for parallel reasoning in visual comprehension, addressing limitations in exi...

arXiv - AI · 3 min ·
[2602.14506] Covariance-Aware Transformers for Quadratic Programming and Decision Making
Machine Learning

[2602.14506] Covariance-Aware Transformers for Quadratic Programming and Decision Making

This paper introduces Covariance-Aware Transformers, a novel approach for solving quadratic programming (QP) problems, enhancing decision...

arXiv - Machine Learning · 4 min ·
[2602.13309] Adaptive Value Decomposition: Coordinating a Varying Number of Agents in Urban Systems
Ai Agents

[2602.13309] Adaptive Value Decomposition: Coordinating a Varying Number of Agents in Urban Systems

The paper presents Adaptive Value Decomposition (AVD), a framework for coordinating multi-agent systems in urban environments, addressing...

arXiv - AI · 4 min ·
Previous Page 139 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime