AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Agents

[P] Easily provide Wandb logs as context to agents for analysis and planning.

It is frustrating to use the Wandb CLI and MCP tools with my agents. For one, the MCP tool basically floods the context window and freque...

Reddit - Machine Learning · 1 min · about 1 hour ago

Ai Agents

Deepmind's 'AI Agent Traps' Paper Maps How Hackers Could Weaponize AI Agents Against Users

AI Tools & Products · 7 min · about 8 hours ago

Llms

Claude, OpenClaw and the new reality: AI agents are here — and so is the chaos

AI Tools & Products · about 8 hours ago

All Content

Machine Learning

[2602.17773] Learning Flow Distributions via Projection-Constrained Diffusion on Manifolds

The paper presents a novel generative modeling framework for synthesizing physically feasible two-dimensional incompressible flows, addre...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.17747] AgriVariant: Variant Effect Prediction using DeepChem-Variant for Precision Breeding in Rice

The article presents AgriVariant, a deep learning-based pipeline for predicting the effects of genetic variants in rice, enhancing precis...

arXiv - Machine Learning · 3 min · about 1 month ago

Nlp

[2602.17999] Aurora: Neuro-Symbolic AI Driven Advising Agent

Aurora is a neuro-symbolic AI advising agent designed to enhance academic advising in higher education by providing timely, policy-compli...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.17737] Nested Training for Mutual Adaptation in Human-AI Teaming

This paper presents a novel nested training approach for enhancing mutual adaptation in human-AI teaming, addressing challenges in agent ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.17951] ROCKET: Residual-Oriented Multi-Layer Alignment for Spatially-Aware Vision-Language-Action Models

The paper presents ROCKET, a novel framework for enhancing Vision-Language-Action models by employing residual-oriented multi-layer align...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.17949] CUICurate: A GraphRAG-based Framework for Automated Clinical Concept Curation for NLP applications

CUICurate introduces a GraphRAG framework for automated curation of clinical concepts in NLP, enhancing efficiency and accuracy in clinic...

arXiv - AI · 4 min · about 1 month ago

Ai Agents

[2602.17913] From Lossy to Verified: A Provenance-Aware Tiered Memory for Agents

The paper presents TierMem, a novel memory framework for agents that balances the need for accurate evidence with efficiency, reducing la...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.18409] Unifying approach to uniform expressivity of graph neural networks

This paper presents a unified framework for enhancing the expressivity of Graph Neural Networks (GNNs) through Template GNNs (T-GNNs), es...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.17911] Condition-Gated Reasoning for Context-Dependent Biomedical Question Answering

The paper introduces Condition-Gated Reasoning (CGR) for context-dependent biomedical question answering, addressing the limitations of e...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.18401] Leakage and Second-Order Dynamics Improve Hippocampal RNN Replay

This paper explores how leakage and second-order dynamics can enhance replay mechanisms in hippocampal recurrent neural networks (RNNs), ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.17875] MultiVer: Zero-Shot Multi-Agent Vulnerability Detection

The paper presents MultiVer, a zero-shot multi-agent system for vulnerability detection that outperforms fine-tuned models in recall, ach...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.17871] Understanding the Fine-Grained Knowledge Capabilities of Vision-Language Models

This paper explores the fine-grained knowledge capabilities of vision-language models (VLMs), highlighting their performance on visual qu...

arXiv - Machine Learning · 3 min · about 1 month ago

Nlp

[2602.17856] Enhancing Scientific Literature Chatbots with Retrieval-Augmented Generation: A Performance Evaluation of Vector and Graph-Based Systems

This paper evaluates the enhancement of scientific literature chatbots using retrieval-augmented generation (RAG), comparing vector and g...

arXiv - AI · 3 min · about 1 month ago

Ai Safety

[2602.18277] PRISM: Parallel Reward Integration with Symmetry for MORL

The paper presents PRISM, a novel algorithm for Multi-Objective Reinforcement Learning (MORL) that addresses the challenges of heterogene...

arXiv - Machine Learning · 4 min · about 1 month ago

Nlp

[2602.17850] Mind the Style: Impact of Communication Style on Human-Chatbot Interaction

This article examines how different communication styles of chatbots affect user experience and task success, revealing insights from a u...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.18266] A Probabilistic Framework for LLM-Based Model Discovery

This article presents a probabilistic framework for discovering mechanistic models using large language models (LLMs), introducing an alg...

arXiv - Machine Learning · 3 min · about 1 month ago

Ai Agents

[2602.17753] The 2025 AI Agent Index: Documenting Technical and Safety Features of Deployed Agentic AI Systems

The 2025 AI Agent Index presents a comprehensive overview of 30 deployed agentic AI systems, detailing their technical and safety feature...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.18250] Variational Distributional Neuron

The paper introduces the concept of a Variational Distributional Neuron, a compute unit that incorporates uncertainty in its operations, ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.18230] [Re] Benchmarking LLM Capabilities in Negotiation through Scoreable Games

This paper evaluates the benchmarking of Large Language Models (LLMs) in negotiation tasks using Scoreable Games, assessing the reproduci...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.17734] Five Fatal Assumptions: Why T-Shirt Sizing Systematically Fails for AI Projects

This paper critiques the T-shirt sizing estimation method in AI projects, highlighting five key assumptions that often lead to failure an...

arXiv - AI · 4 min · about 1 month ago

Previous Page 90 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

[P] Easily provide Wandb logs as context to agents for analysis and planning.

Deepmind's 'AI Agent Traps' Paper Maps How Hackers Could Weaponize AI Agents Against Users

Claude, OpenClaw and the new reality: AI agents are here — and so is the chaos

All Content

[2602.17773] Learning Flow Distributions via Projection-Constrained Diffusion on Manifolds

[2602.17747] AgriVariant: Variant Effect Prediction using DeepChem-Variant for Precision Breeding in Rice

[2602.17999] Aurora: Neuro-Symbolic AI Driven Advising Agent

[2602.17737] Nested Training for Mutual Adaptation in Human-AI Teaming

[2602.17951] ROCKET: Residual-Oriented Multi-Layer Alignment for Spatially-Aware Vision-Language-Action Models

[2602.17949] CUICurate: A GraphRAG-based Framework for Automated Clinical Concept Curation for NLP applications

[2602.17913] From Lossy to Verified: A Provenance-Aware Tiered Memory for Agents

[2602.18409] Unifying approach to uniform expressivity of graph neural networks

[2602.17911] Condition-Gated Reasoning for Context-Dependent Biomedical Question Answering

[2602.18401] Leakage and Second-Order Dynamics Improve Hippocampal RNN Replay

[2602.17875] MultiVer: Zero-Shot Multi-Agent Vulnerability Detection

[2602.17871] Understanding the Fine-Grained Knowledge Capabilities of Vision-Language Models

[2602.17856] Enhancing Scientific Literature Chatbots with Retrieval-Augmented Generation: A Performance Evaluation of Vector and Graph-Based Systems

[2602.18277] PRISM: Parallel Reward Integration with Symmetry for MORL

[2602.17850] Mind the Style: Impact of Communication Style on Human-Chatbot Interaction

[2602.18266] A Probabilistic Framework for LLM-Based Model Discovery

[2602.17753] The 2025 AI Agent Index: Documenting Technical and Safety Features of Deployed Agentic AI Systems

[2602.18250] Variational Distributional Neuron

[2602.18230] [Re] Benchmarking LLM Capabilities in Negotiation through Scoreable Games

[2602.17734] Five Fatal Assumptions: Why T-Shirt Sizing Systematically Fails for AI Projects

Related Topics

Stay updated with AI News