AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Agents

Deepmind's 'AI Agent Traps' Paper Maps How Hackers Could Weaponize AI Agents Against Users

Google Deepmind's "AI Agent Traps" paper maps 6 attack types targeting autonomous AI agents, with exploit rates reaching 86% in tests.

AI Tools & Products · 7 min · 14 minutes ago

Llms

Agentic AI in Beauty: How ChatGPT Is Reshaping Discovery, Trust, and Conversion

Agentic AI is transforming beauty shopping, shifting discovery from search to intent-driven recommendations where relevance, trust, and c...

AI Tools & Products · 7 min · 14 minutes ago

Llms

Claude, OpenClaw and the new reality: AI agents are here — and so is the chaos

AI Tools & Products · 14 minutes ago

All Content

Llms

[2602.18600] MapTab: Can MLLMs Master Constrained Route Planning?

The paper introduces MapTab, a benchmark for evaluating Multimodal Large Language Models (MLLMs) on constrained route planning tasks, hig...

arXiv - Machine Learning · 3 min · about 1 month ago

Robotics

[2602.18985] InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing

InfEngine is an innovative autonomous engine designed to enhance infrared radiation computing by automating workflows, achieving a 92.7% ...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.18584] GIST: Targeted Data Selection for Instruction Tuning via Coupled Optimization Geometry

The paper presents GIST, a method for targeted data selection in instruction tuning, improving efficiency by aligning training gradients ...

arXiv - AI · 4 min · about 1 month ago

Computer Vision

[2602.18981] How Far Can We Go with Pixels Alone? A Pilot Study on Screen-Only Navigation in Commercial 3D ARPGs

This study explores the effectiveness of screen-only navigation in 3D ARPGs, demonstrating how visual affordances can guide gameplay, whi...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.18581] Learning Beyond Optimization: Stress-Gated Dynamical Regime Regulation in Autonomous Systems

The paper explores a novel framework for autonomous systems that enables learning without explicit objectives, focusing on self-regulatio...

arXiv - Machine Learning · 4 min · about 1 month ago

Ai Agents

[2602.18968] Robust and Efficient Tool Orchestration via Layered Execution Structures with Reflective Correction

This article presents a novel approach to tool orchestration in agentic systems, emphasizing a layered execution structure that enhances ...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.18531] Deep Reinforcement Learning for Optimizing Energy Consumption in Smart Grid Systems

This paper explores the use of Deep Reinforcement Learning (RL) combined with Physics-Informed Neural Networks (PINNs) to optimize energy...

arXiv - AI · 4 min · about 1 month ago

Ai Agents

[2602.18960] Modularity is the Bedrock of Natural and Artificial Intelligence

The paper discusses the importance of modularity in both natural and artificial intelligence, highlighting its role in efficient learning...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.18956] INDUCTION: Finite-Structure Concept Synthesis in First-Order Logic

The paper introduces INDUCTION, a benchmark for finite structure concept synthesis in first-order logic, focusing on generating logical f...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.18528] Audio-Visual Continual Test-Time Adaptation without Forgetting

The paper presents a novel method, AV-CTTA, for audio-visual continual test-time adaptation that minimizes catastrophic forgetting while ...

arXiv - Machine Learning · 4 min · about 1 month ago

Ai Agents

[2602.18947] (Perlin) Noise as AI coordinator

The paper explores using Perlin noise as a coordinator for AI in large-scale game environments, addressing challenges in balancing behavi...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.18523] The Geometry of Multi-Task Grokking: Transverse Instability, Superposition, and Weight Decay Phase Structure

This article explores the geometric analysis of multi-task grokking in machine learning, detailing five key phenomena observed during tra...

arXiv - AI · 4 min · about 1 month ago

Generative Ai

[2602.18943] High Dimensional Procedural Content Generation

The paper introduces High-Dimensional Procedural Content Generation (HDPCG), a framework that enhances gameplay mechanics by treating non...

arXiv - AI · 3 min · about 1 month ago

Nlp

[2602.18940] DREAM: Deep Research Evaluation with Agentic Metrics

The paper presents DREAM, a framework for evaluating Deep Research Agents, addressing challenges in assessing research quality through ag...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.18884] TPRU: Advancing Temporal and Procedural Understanding in Large Multimodal Models

The paper introduces TPRU, a dataset aimed at improving temporal and procedural understanding in Multimodal Large Language Models (MLLMs)...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.18843] ABD: Default Exception Abduction in Finite First Order Worlds

The paper introduces ABD, a benchmark for default-exception abduction in finite first-order worlds, evaluating LLMs on their ability to d...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.18493] Learning to Remember: End-to-End Training of Memory Agents for Long-Context Reasoning

The paper presents the Unified Memory Agent (UMA), an end-to-end reinforcement learning framework designed for long-context reasoning, en...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.18812] GenPlanner: From Noise to Plans -- Emergent Reasoning in Flow Matching and Diffusion Models

The paper presents GenPlanner, a novel approach to path planning in complex environments using generative models, specifically diffusion ...

arXiv - AI · 3 min · about 1 month ago

Ai Agents

[2602.18773] LAMMI-Pathology: A Tool-Centric Bottom-Up LVLM-Agent Framework for Molecularly Informed Medical Intelligence in Pathology

The LAMMI-Pathology framework proposes a novel tool-centric approach for enhancing molecularly informed medical intelligence in pathology...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.18764] The Convergence of Schema-Guided Dialogue Systems and the Model Context Protocol

This paper discusses the convergence of Schema-Guided Dialogue Systems and the Model Context Protocol, proposing five foundational princi...

arXiv - AI · 3 min · about 1 month ago

Previous Page 83 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

Deepmind's 'AI Agent Traps' Paper Maps How Hackers Could Weaponize AI Agents Against Users

Agentic AI in Beauty: How ChatGPT Is Reshaping Discovery, Trust, and Conversion

Claude, OpenClaw and the new reality: AI agents are here — and so is the chaos

All Content

[2602.18600] MapTab: Can MLLMs Master Constrained Route Planning?

[2602.18985] InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing

[2602.18584] GIST: Targeted Data Selection for Instruction Tuning via Coupled Optimization Geometry

[2602.18981] How Far Can We Go with Pixels Alone? A Pilot Study on Screen-Only Navigation in Commercial 3D ARPGs

[2602.18581] Learning Beyond Optimization: Stress-Gated Dynamical Regime Regulation in Autonomous Systems

[2602.18968] Robust and Efficient Tool Orchestration via Layered Execution Structures with Reflective Correction

[2602.18531] Deep Reinforcement Learning for Optimizing Energy Consumption in Smart Grid Systems

[2602.18960] Modularity is the Bedrock of Natural and Artificial Intelligence

[2602.18956] INDUCTION: Finite-Structure Concept Synthesis in First-Order Logic

[2602.18528] Audio-Visual Continual Test-Time Adaptation without Forgetting

[2602.18947] (Perlin) Noise as AI coordinator

[2602.18523] The Geometry of Multi-Task Grokking: Transverse Instability, Superposition, and Weight Decay Phase Structure

[2602.18943] High Dimensional Procedural Content Generation

[2602.18940] DREAM: Deep Research Evaluation with Agentic Metrics

[2602.18884] TPRU: Advancing Temporal and Procedural Understanding in Large Multimodal Models

[2602.18843] ABD: Default Exception Abduction in Finite First Order Worlds

[2602.18493] Learning to Remember: End-to-End Training of Memory Agents for Long-Context Reasoning

[2602.18812] GenPlanner: From Noise to Plans -- Emergent Reasoning in Flow Matching and Diffusion Models

[2602.18773] LAMMI-Pathology: A Tool-Centric Bottom-Up LVLM-Agent Framework for Molecularly Informed Medical Intelligence in Pathology

[2602.18764] The Convergence of Schema-Guided Dialogue Systems and the Model Context Protocol

Related Topics

Stay updated with AI News