AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Machine Learning

Why does Multi-Agent RL fail to act like a real society in Spatial Game Theory? [P] [R]

Hey everyone, I’m building a project for my university Machine Learning course called "Social network analysis using iterated game theory...

Reddit - Machine Learning · 1 min ·
AWS turns its S3 storage service into a file system for AI agents
Nlp

AWS turns its S3 storage service into a file system for AI agents

AI News - General ·
Moody’s Integrates AI Agents With Anthropic’s Claude
Llms

Moody’s Integrates AI Agents With Anthropic’s Claude

AI Tools & Products · 4 min ·

All Content

[2602.15112] ResearchGym: Evaluating Language Model Agents on Real-World AI Research
Llms

[2602.15112] ResearchGym: Evaluating Language Model Agents on Real-World AI Research

ResearchGym introduces a benchmark for evaluating AI agents in real-world research scenarios, revealing significant performance gaps and ...

arXiv - AI · 4 min ·
[2602.15817] Solving Parameter-Robust Avoid Problems with Unknown Feasibility using Reinforcement Learning
Machine Learning

[2602.15817] Solving Parameter-Robust Avoid Problems with Unknown Feasibility using Reinforcement Learning

This article presents a novel approach using Feasibility-Guided Exploration (FGE) to address parameter-robust avoid problems in reinforce...

arXiv - Machine Learning · 4 min ·
[2602.15763] GLM-5: from Vibe Coding to Agentic Engineering
Llms

[2602.15763] GLM-5: from Vibe Coding to Agentic Engineering

GLM-5 introduces a next-generation foundation model that enhances coding capabilities through agentic engineering, reducing costs while i...

arXiv - Machine Learning · 5 min ·
[2602.15740] MRC-GAT: A Meta-Relational Copula-Based Graph Attention Network for Interpretable Multimodal Alzheimer's Disease Diagnosis
Machine Learning

[2602.15740] MRC-GAT: A Meta-Relational Copula-Based Graph Attention Network for Interpretable Multimodal Alzheimer's Disease Diagnosis

The paper presents the MRC-GAT, a novel Meta-Relational Copula-Based Graph Attention Network designed for accurate and interpretable Alzh...

arXiv - AI · 4 min ·
[2602.15704] Controlled oscillation modeling using port-Hamiltonian neural networks
Machine Learning

[2602.15704] Controlled oscillation modeling using port-Hamiltonian neural networks

This paper presents a novel approach to modeling controlled oscillations using port-Hamiltonian neural networks, emphasizing a second-ord...

arXiv - Machine Learning · 4 min ·
[2602.15676] Relative Geometry of Neural Forecasters: Linking Accuracy and Alignment in Learned Latent Geometry
Machine Learning

[2602.15676] Relative Geometry of Neural Forecasters: Linking Accuracy and Alignment in Learned Latent Geometry

This paper explores how neural networks represent latent geometry in forecasting complex dynamical systems, linking model alignment with ...

arXiv - AI · 3 min ·
[2602.15649] Continuous-Time Piecewise-Linear Recurrent Neural Networks
Machine Learning

[2602.15649] Continuous-Time Piecewise-Linear Recurrent Neural Networks

This article presents Continuous-Time Piecewise-Linear Recurrent Neural Networks (cPLRNNs), a novel approach to modeling dynamical system...

arXiv - Machine Learning · 4 min ·
[2602.15634] Beyond ReLU: Bifurcation, Oversmoothing, and Topological Priors
Machine Learning

[2602.15634] Beyond ReLU: Bifurcation, Oversmoothing, and Topological Priors

This paper explores the limitations of Graph Neural Networks (GNNs) due to oversmoothing and proposes a novel approach using bifurcation ...

arXiv - Machine Learning · 3 min ·
[2602.15595] Multi-Objective Coverage via Constraint Active Search
Nlp

[2602.15595] Multi-Objective Coverage via Constraint Active Search

This paper introduces a novel algorithm, MOC-CAS, for solving the multi-objective coverage problem, enhancing efficiency in applications ...

arXiv - Machine Learning · 4 min ·
[2602.15593] A unified theory of feature learning in RNNs and DNNs
Machine Learning

[2602.15593] A unified theory of feature learning in RNNs and DNNs

This paper presents a unified theory of feature learning in recurrent neural networks (RNNs) and deep neural networks (DNNs), highlightin...

arXiv - Machine Learning · 4 min ·
[2602.15572] Neural Network-Based Parameter Estimation of a Labour Market Agent-Based Model
Machine Learning

[2602.15572] Neural Network-Based Parameter Estimation of a Labour Market Agent-Based Model

This paper presents a neural network-based framework for parameter estimation in agent-based models (ABMs) of the labor market, demonstra...

arXiv - Machine Learning · 3 min ·
[2602.15571] Accelerated Predictive Coding Networks via Direct Kolen-Pollack Feedback Alignment
Machine Learning

[2602.15571] Accelerated Predictive Coding Networks via Direct Kolen-Pollack Feedback Alignment

The paper introduces Direct Kolen-Pollack Predictive Coding (DKP-PC), an innovative approach that enhances the efficiency of predictive c...

arXiv - Machine Learning · 3 min ·
[2602.15515] The Obfuscation Atlas: Mapping Where Honesty Emerges in RLVR with Deception Probes
Machine Learning

[2602.15515] The Obfuscation Atlas: Mapping Where Honesty Emerges in RLVR with Deception Probes

The paper explores how AI models can learn to obfuscate deception when trained against white-box deception detectors, introducing a taxon...

arXiv - AI · 4 min ·
[2602.15473] POP: Prior-fitted Optimizer Policies
Machine Learning

[2602.15473] POP: Prior-fitted Optimizer Policies

The paper introduces POP (Prior-fitted Optimizer Policies), a meta-learned optimization method that predicts step sizes based on contextu...

arXiv - Machine Learning · 3 min ·
[2602.15407] Fairness over Equality: Correcting Social Incentives in Asymmetric Sequential Social Dilemmas
Nlp

[2602.15407] Fairness over Equality: Correcting Social Incentives in Asymmetric Sequential Social Dilemmas

This paper explores how asymmetric conditions in Sequential Social Dilemmas affect cooperation dynamics in Multi-Agent Reinforcement Lear...

arXiv - Machine Learning · 4 min ·
[2602.15367] CDRL: A Reinforcement Learning Framework Inspired by Cerebellar Circuits and Dendritic Computational Strategies
Machine Learning

[2602.15367] CDRL: A Reinforcement Learning Framework Inspired by Cerebellar Circuits and Dendritic Computational Strategies

The paper presents CDRL, a reinforcement learning framework inspired by cerebellar circuits, aiming to enhance sample efficiency and robu...

arXiv - AI · 3 min ·
[2602.15332] Directional Reasoning Trajectory Change (DRTC): Identifying Critical Trace Segments in Reasoning Models
Llms

[2602.15332] Directional Reasoning Trajectory Change (DRTC): Identifying Critical Trace Segments in Reasoning Models

The paper introduces Directional Reasoning Trajectory Change (DRTC), a framework for interpreting long-horizon reasoning in language mode...

arXiv - Machine Learning · 4 min ·
[2602.15293] The Information Geometry of Softmax: Probing and Steering
Machine Learning

[2602.15293] The Information Geometry of Softmax: Probing and Steering

This paper explores the information geometry of softmax distributions, focusing on how AI systems encode semantic structures and the deve...

arXiv - AI · 3 min ·
[2602.15260] Fast and Effective On-policy Distillation from Reasoning Prefixes
Machine Learning

[2602.15260] Fast and Effective On-policy Distillation from Reasoning Prefixes

This paper presents an innovative approach to on-policy distillation (OPD) in machine learning, focusing on the effective use of reasonin...

arXiv - AI · 3 min ·
[2602.15239] Size Transferability of Graph Transformers with Convolutional Positional Encodings
Machine Learning

[2602.15239] Size Transferability of Graph Transformers with Convolutional Positional Encodings

This paper explores the size transferability of Graph Transformers (GTs) with convolutional positional encodings, demonstrating their abi...

arXiv - Machine Learning · 3 min ·
Previous Page 126 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime