AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Sierra's Bret Taylor says the era of clicking buttons is over | TechCrunch
Ai Agents

Sierra's Bret Taylor says the era of clicking buttons is over | TechCrunch

Co-founder of Sierra predicts that AI agents will make software interfaces obsolete.

TechCrunch - AI · 4 min ·
Ai Agents

Visa rolls out AI agent shopping infrastructure

submitted by /u/tekz [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
Llms

I compiled every major AI agent security incident from 2024-2026 in one place - 90 incidents, all sourced, updated weekly

After tracking AI agent security incidents for the past year, I put together a single reference covering every major breach, vulnerabilit...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.16301] Multi-agent cooperation through in-context co-player inference
Machine Learning

[2602.16301] Multi-agent cooperation through in-context co-player inference

This paper explores multi-agent cooperation in reinforcement learning through in-context learning, demonstrating how sequence models can ...

arXiv - AI · 4 min ·
[2602.16229] Factored Latent Action World Models
Machine Learning

[2602.16229] Factored Latent Action World Models

The paper presents the Factored Latent Action Model (FLAM), a new framework for modeling complex dynamics in action-free video generation...

arXiv - Machine Learning · 3 min ·
[2602.16246] Toward Scalable Verifiable Reward: Proxy State-Based Evaluation for Multi-turn Tool-Calling LLM Agents
Llms

[2602.16246] Toward Scalable Verifiable Reward: Proxy State-Based Evaluation for Multi-turn Tool-Calling LLM Agents

This paper presents a Proxy State-Based Evaluation framework for assessing multi-turn tool-calling LLM agents, offering a scalable altern...

arXiv - AI · 4 min ·
[2602.16192] Revolutionizing Long-Term Memory in AI: New Horizons with High-Capacity and High-Speed Storage
Nlp

[2602.16192] Revolutionizing Long-Term Memory in AI: New Horizons with High-Capacity and High-Speed Storage

This article discusses innovative approaches to long-term memory in AI, emphasizing the importance of retaining raw experiences for bette...

arXiv - Machine Learning · 4 min ·
[2602.16179] EnterpriseGym Corecraft: Training Generalizable Agents on High-Fidelity RL Environments
Machine Learning

[2602.16179] EnterpriseGym Corecraft: Training Generalizable Agents on High-Fidelity RL Environments

The paper presents EnterpriseGym Corecraft, a novel high-fidelity reinforcement learning environment designed to train AI agents for gene...

arXiv - Machine Learning · 4 min ·
[2602.16173] Learning Personalized Agents from Human Feedback
Machine Learning

[2602.16173] Learning Personalized Agents from Human Feedback

The paper presents a framework, Personalized Agents from Human Feedback (PAHF), which enables AI agents to adapt to individual user prefe...

arXiv - Machine Learning · 4 min ·
[2602.16105] GPSBench: Do Large Language Models Understand GPS Coordinates?
Llms

[2602.16105] GPSBench: Do Large Language Models Understand GPS Coordinates?

The paper introduces GPSBench, a dataset designed to evaluate the geospatial reasoning capabilities of large language models (LLMs) using...

arXiv - AI · 3 min ·
[2602.16050] Evidence-Grounded Subspecialty Reasoning: Evaluating a Curated Clinical Intelligence Layer on the 2025 Endocrinology Board-Style Examination
Llms

[2602.16050] Evidence-Grounded Subspecialty Reasoning: Evaluating a Curated Clinical Intelligence Layer on the 2025 Endocrinology Board-Style Examination

This article evaluates the performance of the January Mirror, an evidence-grounded clinical reasoning system, against leading large langu...

arXiv - AI · 4 min ·
[2602.16066] Improving Interactive In-Context Learning from Natural Language Feedback
Llms

[2602.16066] Improving Interactive In-Context Learning from Natural Language Feedback

This paper presents a novel framework for improving interactive in-context learning in large language models by utilizing natural languag...

arXiv - AI · 4 min ·
[2602.16213] Graph neural network for colliding particles with an application to sea ice floe modeling
Machine Learning

[2602.16213] Graph neural network for colliding particles with an application to sea ice floe modeling

This article presents a novel Graph Neural Network approach for modeling sea ice dynamics, focusing on particle collisions and data assim...

arXiv - AI · 3 min ·
[2602.16037] Optimization Instability in Autonomous Agentic Workflows for Clinical Symptom Detection
Robotics

[2602.16037] Optimization Instability in Autonomous Agentic Workflows for Clinical Symptom Detection

This paper explores optimization instability in autonomous workflows for clinical symptom detection, revealing critical failure modes and...

arXiv - AI · 4 min ·
[2602.16012] Towards Efficient Constraint Handling in Neural Solvers for Routing Problems
Machine Learning

[2602.16012] Towards Efficient Constraint Handling in Neural Solvers for Routing Problems

The paper presents Construct-and-Refine (CaR), a novel framework for efficiently handling constraints in neural solvers for routing probl...

arXiv - Machine Learning · 4 min ·
[2602.16196] Graphon Mean-Field Subsampling for Cooperative Heterogeneous Multi-Agent Reinforcement Learning
Ai Agents

[2602.16196] Graphon Mean-Field Subsampling for Cooperative Heterogeneous Multi-Agent Reinforcement Learning

This paper introduces Graphon Mean-Field Subsampling (GMFS), a framework for scalable cooperative multi-agent reinforcement learning (MAR...

arXiv - AI · 3 min ·
[2602.16193] Rethinking Input Domains in Physics-Informed Neural Networks via Geometric Compactification Mappings
Machine Learning

[2602.16193] Rethinking Input Domains in Physics-Informed Neural Networks via Geometric Compactification Mappings

This article presents a novel approach to enhance Physics-Informed Neural Networks (PINNs) by utilizing geometric compactification mappin...

arXiv - AI · 3 min ·
[2602.16165] HiPER: Hierarchical Reinforcement Learning with Explicit Credit Assignment for Large Language Model Agents
Llms

[2602.16165] HiPER: Hierarchical Reinforcement Learning with Explicit Credit Assignment for Large Language Model Agents

The HiPER framework introduces a hierarchical approach to reinforcement learning for large language model agents, enhancing decision-maki...

arXiv - AI · 4 min ·
[2602.16147] ASPEN: Spectral-Temporal Fusion for Cross-Subject Brain Decoding
Machine Learning

[2602.16147] ASPEN: Spectral-Temporal Fusion for Cross-Subject Brain Decoding

The paper presents ASPEN, a novel architecture that enhances cross-subject brain decoding by integrating spectral and temporal features, ...

arXiv - AI · 3 min ·
[2602.15997] Anatomy of Capability Emergence: Scale-Invariant Representation Collapse and Top-Down Reorganization in Neural Networks
Llms

[2602.15997] Anatomy of Capability Emergence: Scale-Invariant Representation Collapse and Top-Down Reorganization in Neural Networks

This article explores the mechanisms of capability emergence in neural networks, revealing a scale-invariant representation collapse and ...

arXiv - Machine Learning · 4 min ·
[2602.15971] B-DENSE: Branching For Dense Ensemble Network Learning
Machine Learning

[2602.15971] B-DENSE: Branching For Dense Ensemble Network Learning

The paper presents B-DENSE, a novel framework for improving dense ensemble network learning by leveraging multi-branch trajectory alignme...

arXiv - AI · 3 min ·
[2602.15955] Adaptive Semi-Supervised Training of P300 ERP-BCI Speller System with Minimum Calibration Effort
Machine Learning

[2602.15955] Adaptive Semi-Supervised Training of P300 ERP-BCI Speller System with Minimum Calibration Effort

This article presents a novel adaptive semi-supervised training method for a P300 ERP-based Brain-Computer Interface (BCI) speller system...

arXiv - Machine Learning · 4 min ·
[2602.15879] BamaER: A Behavior-Aware Memory-Augmented Model for Exercise Recommendation
Machine Learning

[2602.15879] BamaER: A Behavior-Aware Memory-Augmented Model for Exercise Recommendation

The paper presents BamaER, a memory-augmented model designed for personalized exercise recommendations based on students' learning behavi...

arXiv - Machine Learning · 4 min ·
Previous Page 118 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime