AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Ai Agents

[P] Easily provide Wandb logs as context to agents for analysis and planning.

It is frustrating to use the Wandb CLI and MCP tools with my agents. For one, the MCP tool basically floods the context window and freque...

Reddit - Machine Learning · 1 min ·
Deepmind's 'AI Agent Traps' Paper Maps How Hackers Could Weaponize AI Agents Against Users
Ai Agents

Deepmind's 'AI Agent Traps' Paper Maps How Hackers Could Weaponize AI Agents Against Users

AI Tools & Products · 7 min ·
Llms

Claude, OpenClaw and the new reality: AI agents are here — and so is the chaos

AI Tools & Products ·

All Content

[2602.17773] Learning Flow Distributions via Projection-Constrained Diffusion on Manifolds
Machine Learning

[2602.17773] Learning Flow Distributions via Projection-Constrained Diffusion on Manifolds

The paper presents a novel generative modeling framework for synthesizing physically feasible two-dimensional incompressible flows, addre...

arXiv - Machine Learning · 3 min ·
[2602.17747] AgriVariant: Variant Effect Prediction using DeepChem-Variant for Precision Breeding in Rice
Machine Learning

[2602.17747] AgriVariant: Variant Effect Prediction using DeepChem-Variant for Precision Breeding in Rice

The article presents AgriVariant, a deep learning-based pipeline for predicting the effects of genetic variants in rice, enhancing precis...

arXiv - Machine Learning · 3 min ·
[2602.17999] Aurora: Neuro-Symbolic AI Driven Advising Agent
Nlp

[2602.17999] Aurora: Neuro-Symbolic AI Driven Advising Agent

Aurora is a neuro-symbolic AI advising agent designed to enhance academic advising in higher education by providing timely, policy-compli...

arXiv - AI · 4 min ·
[2602.17737] Nested Training for Mutual Adaptation in Human-AI Teaming
Machine Learning

[2602.17737] Nested Training for Mutual Adaptation in Human-AI Teaming

This paper presents a novel nested training approach for enhancing mutual adaptation in human-AI teaming, addressing challenges in agent ...

arXiv - Machine Learning · 4 min ·
[2602.17951] ROCKET: Residual-Oriented Multi-Layer Alignment for Spatially-Aware Vision-Language-Action Models
Llms

[2602.17951] ROCKET: Residual-Oriented Multi-Layer Alignment for Spatially-Aware Vision-Language-Action Models

The paper presents ROCKET, a novel framework for enhancing Vision-Language-Action models by employing residual-oriented multi-layer align...

arXiv - AI · 4 min ·
[2602.17949] CUICurate: A GraphRAG-based Framework for Automated Clinical Concept Curation for NLP applications
Machine Learning

[2602.17949] CUICurate: A GraphRAG-based Framework for Automated Clinical Concept Curation for NLP applications

CUICurate introduces a GraphRAG framework for automated curation of clinical concepts in NLP, enhancing efficiency and accuracy in clinic...

arXiv - AI · 4 min ·
[2602.17913] From Lossy to Verified: A Provenance-Aware Tiered Memory for Agents
Ai Agents

[2602.17913] From Lossy to Verified: A Provenance-Aware Tiered Memory for Agents

The paper presents TierMem, a novel memory framework for agents that balances the need for accurate evidence with efficiency, reducing la...

arXiv - AI · 4 min ·
[2602.18409] Unifying approach to uniform expressivity of graph neural networks
Machine Learning

[2602.18409] Unifying approach to uniform expressivity of graph neural networks

This paper presents a unified framework for enhancing the expressivity of Graph Neural Networks (GNNs) through Template GNNs (T-GNNs), es...

arXiv - Machine Learning · 3 min ·
[2602.17911] Condition-Gated Reasoning for Context-Dependent Biomedical Question Answering
Machine Learning

[2602.17911] Condition-Gated Reasoning for Context-Dependent Biomedical Question Answering

The paper introduces Condition-Gated Reasoning (CGR) for context-dependent biomedical question answering, addressing the limitations of e...

arXiv - AI · 3 min ·
[2602.18401] Leakage and Second-Order Dynamics Improve Hippocampal RNN Replay
Machine Learning

[2602.18401] Leakage and Second-Order Dynamics Improve Hippocampal RNN Replay

This paper explores how leakage and second-order dynamics can enhance replay mechanisms in hippocampal recurrent neural networks (RNNs), ...

arXiv - Machine Learning · 4 min ·
[2602.17875] MultiVer: Zero-Shot Multi-Agent Vulnerability Detection
Llms

[2602.17875] MultiVer: Zero-Shot Multi-Agent Vulnerability Detection

The paper presents MultiVer, a zero-shot multi-agent system for vulnerability detection that outperforms fine-tuned models in recall, ach...

arXiv - AI · 3 min ·
[2602.17871] Understanding the Fine-Grained Knowledge Capabilities of Vision-Language Models
Llms

[2602.17871] Understanding the Fine-Grained Knowledge Capabilities of Vision-Language Models

This paper explores the fine-grained knowledge capabilities of vision-language models (VLMs), highlighting their performance on visual qu...

arXiv - Machine Learning · 3 min ·
[2602.17856] Enhancing Scientific Literature Chatbots with Retrieval-Augmented Generation: A Performance Evaluation of Vector and Graph-Based Systems
Nlp

[2602.17856] Enhancing Scientific Literature Chatbots with Retrieval-Augmented Generation: A Performance Evaluation of Vector and Graph-Based Systems

This paper evaluates the enhancement of scientific literature chatbots using retrieval-augmented generation (RAG), comparing vector and g...

arXiv - AI · 3 min ·
[2602.18277] PRISM: Parallel Reward Integration with Symmetry for MORL
Ai Safety

[2602.18277] PRISM: Parallel Reward Integration with Symmetry for MORL

The paper presents PRISM, a novel algorithm for Multi-Objective Reinforcement Learning (MORL) that addresses the challenges of heterogene...

arXiv - Machine Learning · 4 min ·
[2602.17850] Mind the Style: Impact of Communication Style on Human-Chatbot Interaction
Nlp

[2602.17850] Mind the Style: Impact of Communication Style on Human-Chatbot Interaction

This article examines how different communication styles of chatbots affect user experience and task success, revealing insights from a u...

arXiv - AI · 3 min ·
[2602.18266] A Probabilistic Framework for LLM-Based Model Discovery
Llms

[2602.18266] A Probabilistic Framework for LLM-Based Model Discovery

This article presents a probabilistic framework for discovering mechanistic models using large language models (LLMs), introducing an alg...

arXiv - Machine Learning · 3 min ·
[2602.17753] The 2025 AI Agent Index: Documenting Technical and Safety Features of Deployed Agentic AI Systems
Ai Agents

[2602.17753] The 2025 AI Agent Index: Documenting Technical and Safety Features of Deployed Agentic AI Systems

The 2025 AI Agent Index presents a comprehensive overview of 30 deployed agentic AI systems, detailing their technical and safety feature...

arXiv - AI · 3 min ·
[2602.18250] Variational Distributional Neuron
Machine Learning

[2602.18250] Variational Distributional Neuron

The paper introduces the concept of a Variational Distributional Neuron, a compute unit that incorporates uncertainty in its operations, ...

arXiv - Machine Learning · 4 min ·
[2602.18230] [Re] Benchmarking LLM Capabilities in Negotiation through Scoreable Games
Llms

[2602.18230] [Re] Benchmarking LLM Capabilities in Negotiation through Scoreable Games

This paper evaluates the benchmarking of Large Language Models (LLMs) in negotiation tasks using Scoreable Games, assessing the reproduci...

arXiv - Machine Learning · 4 min ·
[2602.17734] Five Fatal Assumptions: Why T-Shirt Sizing Systematically Fails for AI Projects
Llms

[2602.17734] Five Fatal Assumptions: Why T-Shirt Sizing Systematically Fails for AI Projects

This paper critiques the T-shirt sizing estimation method in AI projects, highlighting five key assumptions that often lead to failure an...

arXiv - AI · 4 min ·
Previous Page 90 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime