AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Ai Infrastructure

Alibaba-linked AI agent hijacked GPUs for unauthorized crypto mining, researchers say

How do people make sense of this? submitted by /u/stvlsn [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
Ai Agents

Spent today at MIT's Open Agentic Web conference. Six things worth thinking about.

We're in the DNS era of agent infrastructure. Before agents can find and trust each other at scale, you need identity, attestation, reput...

Reddit - Artificial Intelligence · 1 min ·
AMD's GAIA Now Allows Building Custom AI Agents Via Chat, Becomes "True Desktop App"
Ai Agents

AMD's GAIA Now Allows Building Custom AI Agents Via Chat, Becomes "True Desktop App"

In addition to their efforts around the Lemonade SDK itself, AMD software engineers working on their AI initiatives continue to be invest...

AI Tools & Products · 4 min ·

All Content

[2602.08216] Thermodynamic Isomorphism of Transformers: A Lagrangian Approach to Attention Dynamics
Machine Learning

[2602.08216] Thermodynamic Isomorphism of Transformers: A Lagrangian Approach to Attention Dynamics

This paper presents a thermodynamic framework for analyzing Transformer attention dynamics, linking it to statistical mechanics through a...

arXiv - Machine Learning · 4 min ·
[2601.05383] Imitation Learning for Combinatorial Optimisation under Uncertainty
Machine Learning

[2601.05383] Imitation Learning for Combinatorial Optimisation under Uncertainty

This paper explores imitation learning for combinatorial optimization under uncertainty, introducing a taxonomy of expert types and a new...

arXiv - Machine Learning · 4 min ·
[2511.21537] Context-Specific Causal Graph Discovery with Unobserved Contexts: Non-Stationarity, Regimes and Spatio-Temporal Patterns
Ai Startups

[2511.21537] Context-Specific Causal Graph Discovery with Unobserved Contexts: Non-Stationarity, Regimes and Spatio-Temporal Patterns

This paper presents a framework for context-specific causal graph discovery that addresses non-stationarity and spatio-temporal patterns,...

arXiv - Machine Learning · 4 min ·
[2510.00602] Multi-Agent Stage-wise Conservative Linear Bandits
Ai Agents

[2510.00602] Multi-Agent Stage-wise Conservative Linear Bandits

This paper presents MA-SCLUCB, an algorithm for multi-agent linear bandit problems, focusing on balancing exploration and exploitation wh...

arXiv - Machine Learning · 4 min ·
[2509.14585] Online reinforcement learning via sparse Gaussian mixture model Q-functions
Machine Learning

[2509.14585] Online reinforcement learning via sparse Gaussian mixture model Q-functions

This paper presents an innovative online reinforcement learning framework using sparse Gaussian mixture model Q-functions, enhancing expl...

arXiv - Machine Learning · 3 min ·
[2508.14746] MissionHD: Hyperdimensional Refinement of Distribution-Deficient Reasoning Graphs for Video Anomaly Detection
Llms

[2508.14746] MissionHD: Hyperdimensional Refinement of Distribution-Deficient Reasoning Graphs for Video Anomaly Detection

The paper presents MissionHD, a novel approach for video anomaly detection using hyperdimensional refinement of reasoning graphs, address...

arXiv - Machine Learning · 3 min ·
[2508.11522] Finite-Width Neural Tangent Kernels from Feynman Diagrams
Machine Learning

[2508.11522] Finite-Width Neural Tangent Kernels from Feynman Diagrams

This article presents a novel approach to computing finite-width neural tangent kernels (NTKs) using Feynman diagrams, enhancing the unde...

arXiv - Machine Learning · 4 min ·
[2011.07687] DART: aDaptive Accept RejecT for non-linear top-K subset identification
Machine Learning

[2011.07687] DART: aDaptive Accept RejecT for non-linear top-K subset identification

The paper presents DART, a novel algorithm for non-linear top-K subset identification in bandit problems, achieving efficient performance...

arXiv - Machine Learning · 4 min ·
[2501.18138] B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning
Ai Agents

[2501.18138] B3C: A Minimalist Approach to Offline Multi-Agent Reinforcement Learning

The paper presents B3C, a novel approach to offline multi-agent reinforcement learning that addresses overestimation issues by integratin...

arXiv - Machine Learning · 4 min ·
[2409.00730] Generating Physical Dynamics under Priors
Machine Learning

[2409.00730] Generating Physical Dynamics under Priors

This article presents a novel framework for generating physically realistic dynamics in data-driven contexts by incorporating physical pr...

arXiv - Machine Learning · 4 min ·
[2602.13003] MASAR: Motion-Appearance Synergy Refinement for Joint Detection and Trajectory Forecasting
Machine Learning

[2602.13003] MASAR: Motion-Appearance Synergy Refinement for Joint Detection and Trajectory Forecasting

The paper presents MASAR, a novel framework for joint 3D detection and trajectory forecasting that enhances performance by integrating mo...

arXiv - Machine Learning · 3 min ·
[2602.12904] Nonparametric Contextual Online Bilateral Trade
Ai Infrastructure

[2602.12904] Nonparametric Contextual Online Bilateral Trade

This paper explores nonparametric contextual online bilateral trade, presenting an algorithm that optimizes trade pricing based on contex...

arXiv - Machine Learning · 3 min ·
[2602.12916] Reliable Thinking with Images
Llms

[2602.12916] Reliable Thinking with Images

The paper discusses 'Reliable Thinking with Images,' a method to enhance reasoning in Multi-modal Large Language Models (MLLMs) by addres...

arXiv - Machine Learning · 4 min ·
[2602.12903] Contextual Online Bilateral Trade
Ai Startups

[2602.12903] Contextual Online Bilateral Trade

This paper explores contextual online bilateral trade, focusing on how agents' valuations depend on context vectors. It presents algorith...

arXiv - Machine Learning · 4 min ·
[2602.12901] Blessings of Multiple Good Arms in Multi-Objective Linear Bandits
Machine Learning

[2602.12901] Blessings of Multiple Good Arms in Multi-Objective Linear Bandits

This paper explores the multi-objective linear bandit problem, revealing that multiple good arms can lead to implicit exploration, enhanc...

arXiv - Machine Learning · 3 min ·
[2602.12492] Composable Model-Free RL for Navigation with Input-Affine Systems
Machine Learning

[2602.12492] Composable Model-Free RL for Navigation with Input-Affine Systems

This paper presents a novel composable model-free reinforcement learning approach for navigation in dynamic environments, focusing on rea...

arXiv - Machine Learning · 3 min ·
[2602.12405] Self-Refining Vision Language Model for Robotic Failure Detection and Reasoning
Llms

[2602.12405] Self-Refining Vision Language Model for Robotic Failure Detection and Reasoning

The paper presents ARMOR, a self-refining vision language model designed for robotic failure detection and reasoning, achieving significa...

arXiv - Machine Learning · 4 min ·
[2602.12386] Provably Convergent Actor-Critic in Risk-averse MARL
Ai Agents

[2602.12386] Provably Convergent Actor-Critic in Risk-averse MARL

This paper presents a novel Actor-Critic algorithm for risk-averse Multi-Agent Reinforcement Learning (MARL), demonstrating global conver...

arXiv - Machine Learning · 3 min ·
[2602.13052] Quantization-Aware Collaborative Inference for Large Embodied AI Models
Machine Learning

[2602.13052] Quantization-Aware Collaborative Inference for Large Embodied AI Models

This paper explores quantization-aware collaborative inference for large embodied AI models, addressing challenges in resource-limited en...

arXiv - Machine Learning · 3 min ·
[2602.12753] Hierarchical Successor Representation for Robust Transfer
Machine Learning

[2602.12753] Hierarchical Successor Representation for Robust Transfer

The paper introduces the Hierarchical Successor Representation (HSR), addressing limitations of classical successor representation in dyn...

arXiv - Machine Learning · 3 min ·
Previous Page 148 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime