AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Ai Infrastructure

Alibaba-linked AI agent hijacked GPUs for unauthorized crypto mining, researchers say

How do people make sense of this? submitted by /u/stvlsn [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

Ai Agents

Spent today at MIT's Open Agentic Web conference. Six things worth thinking about.

We're in the DNS era of agent infrastructure. Before agents can find and trust each other at scale, you need identity, attestation, reput...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

Ai Agents

AMD's GAIA Now Allows Building Custom AI Agents Via Chat, Becomes "True Desktop App"

In addition to their efforts around the Lemonade SDK itself, AMD software engineers working on their AI initiatives continue to be invest...

AI Tools & Products · 4 min · about 5 hours ago

All Content

Machine Learning

[2602.08216] Thermodynamic Isomorphism of Transformers: A Lagrangian Approach to Attention Dynamics

This paper presents a thermodynamic framework for analyzing Transformer attention dynamics, linking it to statistical mechanics through a...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2601.05383] Imitation Learning for Combinatorial Optimisation under Uncertainty

This paper explores imitation learning for combinatorial optimization under uncertainty, introducing a taxonomy of expert types and a new...

arXiv - Machine Learning · 4 min · about 2 months ago

Ai Startups

[2511.21537] Context-Specific Causal Graph Discovery with Unobserved Contexts: Non-Stationarity, Regimes and Spatio-Temporal Patterns

This paper presents a framework for context-specific causal graph discovery that addresses non-stationarity and spatio-temporal patterns,...

arXiv - Machine Learning · 4 min · about 2 months ago

Ai Agents

[2510.00602] Multi-Agent Stage-wise Conservative Linear Bandits

This paper presents MA-SCLUCB, an algorithm for multi-agent linear bandit problems, focusing on balancing exploration and exploitation wh...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2509.14585] Online reinforcement learning via sparse Gaussian mixture model Q-functions

This paper presents an innovative online reinforcement learning framework using sparse Gaussian mixture model Q-functions, enhancing expl...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2508.14746] MissionHD: Hyperdimensional Refinement of Distribution-Deficient Reasoning Graphs for Video Anomaly Detection

The paper presents MissionHD, a novel approach for video anomaly detection using hyperdimensional refinement of reasoning graphs, address...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2508.11522] Finite-Width Neural Tangent Kernels from Feynman Diagrams

This article presents a novel approach to computing finite-width neural tangent kernels (NTKs) using Feynman diagrams, enhancing the unde...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2011.07687] DART: aDaptive Accept RejecT for non-linear top-K subset identification

The paper presents DART, a novel algorithm for non-linear top-K subset identification in bandit problems, achieving efficient performance...

arXiv - Machine Learning · 4 min · about 2 months ago

Ai Agents

[2501.18138] B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning

The paper presents B3C, a novel approach to offline multi-agent reinforcement learning that addresses overestimation issues by integratin...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2409.00730] Generating Physical Dynamics under Priors

This article presents a novel framework for generating physically realistic dynamics in data-driven contexts by incorporating physical pr...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.13003] MASAR: Motion-Appearance Synergy Refinement for Joint Detection and Trajectory Forecasting

The paper presents MASAR, a novel framework for joint 3D detection and trajectory forecasting that enhances performance by integrating mo...

arXiv - Machine Learning · 3 min · about 2 months ago

Ai Infrastructure

[2602.12904] Nonparametric Contextual Online Bilateral Trade

This paper explores nonparametric contextual online bilateral trade, presenting an algorithm that optimizes trade pricing based on contex...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.12916] Reliable Thinking with Images

The paper discusses 'Reliable Thinking with Images,' a method to enhance reasoning in Multi-modal Large Language Models (MLLMs) by addres...

arXiv - Machine Learning · 4 min · about 2 months ago

Ai Startups

[2602.12903] Contextual Online Bilateral Trade

This paper explores contextual online bilateral trade, focusing on how agents' valuations depend on context vectors. It presents algorith...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.12901] Blessings of Multiple Good Arms in Multi-Objective Linear Bandits

This paper explores the multi-objective linear bandit problem, revealing that multiple good arms can lead to implicit exploration, enhanc...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.12492] Composable Model-Free RL for Navigation with Input-Affine Systems

This paper presents a novel composable model-free reinforcement learning approach for navigation in dynamic environments, focusing on rea...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.12405] Self-Refining Vision Language Model for Robotic Failure Detection and Reasoning

The paper presents ARMOR, a self-refining vision language model designed for robotic failure detection and reasoning, achieving significa...

arXiv - Machine Learning · 4 min · about 2 months ago

Ai Agents

[2602.12386] Provably Convergent Actor-Critic in Risk-averse MARL

This paper presents a novel Actor-Critic algorithm for risk-averse Multi-Agent Reinforcement Learning (MARL), demonstrating global conver...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.13052] Quantization-Aware Collaborative Inference for Large Embodied AI Models

This paper explores quantization-aware collaborative inference for large embodied AI models, addressing challenges in resource-limited en...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.12753] Hierarchical Successor Representation for Robust Transfer

The paper introduces the Hierarchical Successor Representation (HSR), addressing limitations of classical successor representation in dyn...

arXiv - Machine Learning · 3 min · about 2 months ago

Previous Page 148 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

Alibaba-linked AI agent hijacked GPUs for unauthorized crypto mining, researchers say

Spent today at MIT's Open Agentic Web conference. Six things worth thinking about.

AMD's GAIA Now Allows Building Custom AI Agents Via Chat, Becomes "True Desktop App"

All Content

[2602.08216] Thermodynamic Isomorphism of Transformers: A Lagrangian Approach to Attention Dynamics

[2601.05383] Imitation Learning for Combinatorial Optimisation under Uncertainty

[2511.21537] Context-Specific Causal Graph Discovery with Unobserved Contexts: Non-Stationarity, Regimes and Spatio-Temporal Patterns

[2510.00602] Multi-Agent Stage-wise Conservative Linear Bandits

[2509.14585] Online reinforcement learning via sparse Gaussian mixture model Q-functions

[2508.14746] MissionHD: Hyperdimensional Refinement of Distribution-Deficient Reasoning Graphs for Video Anomaly Detection

[2508.11522] Finite-Width Neural Tangent Kernels from Feynman Diagrams

[2011.07687] DART: aDaptive Accept RejecT for non-linear top-K subset identification

[2501.18138] B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning

[2409.00730] Generating Physical Dynamics under Priors

[2602.13003] MASAR: Motion-Appearance Synergy Refinement for Joint Detection and Trajectory Forecasting

[2602.12904] Nonparametric Contextual Online Bilateral Trade

[2602.12916] Reliable Thinking with Images

[2602.12903] Contextual Online Bilateral Trade

[2602.12901] Blessings of Multiple Good Arms in Multi-Objective Linear Bandits

[2602.12492] Composable Model-Free RL for Navigation with Input-Affine Systems

[2602.12405] Self-Refining Vision Language Model for Robotic Failure Detection and Reasoning

[2602.12386] Provably Convergent Actor-Critic in Risk-averse MARL

[2602.13052] Quantization-Aware Collaborative Inference for Large Embodied AI Models

[2602.12753] Hierarchical Successor Representation for Robust Transfer

Related Topics

Stay updated with AI News