AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Ai Agents

Sierra's Bret Taylor says the era of clicking buttons is over | TechCrunch

Co-founder of Sierra predicts that AI agents will make software interfaces obsolete.

TechCrunch - AI · 4 min · about 2 hours ago

Ai Agents

Visa rolls out AI agent shopping infrastructure

submitted by /u/tekz [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

Llms

I compiled every major AI agent security incident from 2024-2026 in one place - 90 incidents, all sourced, updated weekly

After tracking AI agent security incidents for the past year, I put together a single reference covering every major breach, vulnerabilit...

Reddit - Artificial Intelligence · 1 min · about 8 hours ago

All Content

Llms

[2602.15875] Fly0: Decoupling Semantic Grounding from Geometric Planning for Zero-Shot Aerial Navigation

The paper presents Fly0, a novel framework that separates semantic grounding from geometric planning to enhance zero-shot aerial navigati...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.15873] Test-Time Adaptation for Tactile-Vision-Language Models

This paper presents a novel approach to test-time adaptation (TTA) for tactile-vision-language (TVL) models, addressing challenges posed ...

arXiv - AI · 3 min · about 2 months ago

Ai Safety

[2602.16543] Vulnerability Analysis of Safe Reinforcement Learning via Inverse Constrained Reinforcement Learning

This paper presents a framework for analyzing the vulnerabilities of Safe Reinforcement Learning (Safe RL) policies against adversarial a...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.15867] Playing With AI: How Do State-Of-The-Art Large Language Models Perform in the 1977 Text-Based Adventure Game Zork?

This paper evaluates the performance of state-of-the-art Large Language Models (LLMs) in the 1977 text-based adventure game Zork, reveali...

arXiv - AI · 4 min · about 2 months ago

Ai Agents

[2602.15865] AI as Teammate or Tool? A Review of Human-AI Interaction in Decision Support

This article reviews the role of AI in decision support, analyzing whether AI systems act as tools or collaborative teammates. It highlig...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.16525] Capacity-constrained demand response in smart grids using deep reinforcement learning

This paper explores a capacity-constrained demand response strategy for smart grids using deep reinforcement learning, aiming to optimize...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.16523] Reinforcement Learning for Parameterized Quantum State Preparation: A Comparative Study

This paper explores reinforcement learning techniques for parameterized quantum state preparation, comparing one-stage and two-stage trai...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.15858] State Design Matters: How Representations Shape Dynamic Reasoning in Large Language Models

This paper explores how state representations impact the reasoning capabilities of large language models (LLMs) in dynamic environments, ...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.15854] Decoupling Strategy and Execution in Task-Focused Dialogue via Goal-Oriented Preference Optimization

This paper presents Goal-Oriented Preference Optimization (GOPO), a new framework for enhancing task-oriented dialogue systems by decoupl...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.15848] Can LLMs Assess Personality? Validating Conversational AI for Trait Profiling

This study evaluates the effectiveness of Large Language Models (LLMs) in assessing personality traits compared to traditional questionna...

arXiv - AI · 3 min · about 2 months ago

Ai Infrastructure

[2602.16363] Improved Bounds for Reward-Agnostic and Reward-Free Exploration

This paper presents improved algorithms for reward-free and reward-agnostic exploration in Markov decision processes, enhancing the abili...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.15832] What Persona Are We Missing? Identifying Unknown Relevant Personas for Faithful User Simulation

This article explores the identification of unknown user personas in simulations, introducing the PICQ dataset and evaluating leading LLM...

arXiv - AI · 3 min · about 2 months ago

Ai Agents

[2602.16666] Towards a Science of AI Agent Reliability

This paper explores the reliability of AI agents, proposing twelve metrics to evaluate their performance across dimensions like consisten...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.16653] Agent Skill Framework: Perspectives on the Potential of Small Language Models in Industrial Environments

The article explores the Agent Skill Framework, assessing its effectiveness in enhancing small language models (SLMs) for industrial appl...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.16512] Framework of Thoughts: A Foundation Framework for Dynamic and Optimized Reasoning based on Chains, Trees, and Graphs

The article presents the Framework of Thoughts (FoT), a new foundation framework designed to enhance the reasoning capabilities of large ...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.16316] A Graph Meta-Network for Learning on Kolmogorov-Arnold Networks

This paper introduces WS-KAN, a novel weight-space architecture for Kolmogorov-Arnold Networks (KANs), demonstrating its superior perform...

arXiv - Machine Learning · 4 min · about 2 months ago

Robotics

[2602.16435] Causally-Guided Automated Feature Engineering with Multi-Agent Reinforcement Learning

The paper presents CAFE, a novel framework for automated feature engineering that combines causal discovery with multi-agent reinforcemen...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.16274] Regret and Sample Complexity of Online Q-Learning via Concentration of Stochastic Approximation with Time-Inhomogeneous Markov Chains

This paper presents a high-probability regret bound for online Q-learning in infinite-horizon discounted Markov decision processes, analy...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.16424] Verifiable Semantics for Agent-to-Agent Communication

This paper introduces a certification protocol for agent-to-agent communication in multiagent AI systems, addressing semantic drift and e...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.16264] Prediction of Major Solar Flares Using Interpretable Class-dependent Reward Framework with Active Region Magnetograms and Domain Knowledge

This article presents a novel supervised classification framework for predicting major solar flares using class-dependent rewards and dee...

arXiv - Machine Learning · 4 min · about 2 months ago

Previous Page 117 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

Sierra's Bret Taylor says the era of clicking buttons is over | TechCrunch

Visa rolls out AI agent shopping infrastructure

I compiled every major AI agent security incident from 2024-2026 in one place - 90 incidents, all sourced, updated weekly

All Content

[2602.15875] Fly0: Decoupling Semantic Grounding from Geometric Planning for Zero-Shot Aerial Navigation

[2602.15873] Test-Time Adaptation for Tactile-Vision-Language Models

[2602.16543] Vulnerability Analysis of Safe Reinforcement Learning via Inverse Constrained Reinforcement Learning

[2602.15867] Playing With AI: How Do State-Of-The-Art Large Language Models Perform in the 1977 Text-Based Adventure Game Zork?

[2602.15865] AI as Teammate or Tool? A Review of Human-AI Interaction in Decision Support

[2602.16525] Capacity-constrained demand response in smart grids using deep reinforcement learning

[2602.16523] Reinforcement Learning for Parameterized Quantum State Preparation: A Comparative Study

[2602.15858] State Design Matters: How Representations Shape Dynamic Reasoning in Large Language Models

[2602.15854] Decoupling Strategy and Execution in Task-Focused Dialogue via Goal-Oriented Preference Optimization

[2602.15848] Can LLMs Assess Personality? Validating Conversational AI for Trait Profiling

[2602.16363] Improved Bounds for Reward-Agnostic and Reward-Free Exploration

[2602.15832] What Persona Are We Missing? Identifying Unknown Relevant Personas for Faithful User Simulation

[2602.16666] Towards a Science of AI Agent Reliability

[2602.16653] Agent Skill Framework: Perspectives on the Potential of Small Language Models in Industrial Environments

[2602.16512] Framework of Thoughts: A Foundation Framework for Dynamic and Optimized Reasoning based on Chains, Trees, and Graphs

[2602.16316] A Graph Meta-Network for Learning on Kolmogorov-Arnold Networks

[2602.16435] Causally-Guided Automated Feature Engineering with Multi-Agent Reinforcement Learning

[2602.16274] Regret and Sample Complexity of Online Q-Learning via Concentration of Stochastic Approximation with Time-Inhomogeneous Markov Chains

[2602.16424] Verifiable Semantics for Agent-to-Agent Communication

[2602.16264] Prediction of Major Solar Flares Using Interpretable Class-dependent Reward Framework with Active Region Magnetograms and Domain Knowledge

Related Topics

Stay updated with AI News