AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Ai Agents

[P] Easily provide Wandb logs as context to agents for analysis and planning.

It is frustrating to use the Wandb CLI and MCP tools with my agents. For one, the MCP tool basically floods the context window and freque...

Reddit - Machine Learning · 1 min ·
Deepmind's 'AI Agent Traps' Paper Maps How Hackers Could Weaponize AI Agents Against Users
Ai Agents

Deepmind's 'AI Agent Traps' Paper Maps How Hackers Could Weaponize AI Agents Against Users

AI Tools & Products · 7 min ·
Llms

Claude, OpenClaw and the new reality: AI agents are here — and so is the chaos

AI Tools & Products ·

All Content

[2602.17902] El Agente Gráfico: Structured Execution Graphs for Scientific Agents
Llms

[2602.17902] El Agente Gráfico: Structured Execution Graphs for Scientific Agents

The paper introduces El Agente Gráfico, a framework that enhances scientific workflows by integrating LLMs with structured execution grap...

arXiv - AI · 4 min ·
[2602.17831] The Token Games: Evaluating Language Model Reasoning with Puzzle Duels
Llms

[2602.17831] The Token Games: Evaluating Language Model Reasoning with Puzzle Duels

The Token Games introduces a novel evaluation framework for language models, using puzzle duels to assess reasoning capabilities without ...

arXiv - AI · 4 min ·
[2602.17826] Ontology-Guided Neuro-Symbolic Inference: Grounding Language Models with Mathematical Domain Knowledge
Llms

[2602.17826] Ontology-Guided Neuro-Symbolic Inference: Grounding Language Models with Mathematical Domain Knowledge

This article explores the integration of formal domain ontologies into language models to enhance their reliability in mathematical reaso...

arXiv - Machine Learning · 3 min ·
[2602.17676] Epistemic Traps: Rational Misalignment Driven by Model Misspecification
Llms

[2602.17676] Epistemic Traps: Rational Misalignment Driven by Model Misspecification

This paper explores how model misspecification leads to rational misalignments in AI behavior, presenting a new framework for understandi...

arXiv - Machine Learning · 4 min ·
[2602.17931] Memory-Based Advantage Shaping for LLM-Guided Reinforcement Learning
Llms

[2602.17931] Memory-Based Advantage Shaping for LLM-Guided Reinforcement Learning

This article presents a novel approach to reinforcement learning (RL) using memory-based advantage shaping, leveraging large language mod...

arXiv - Machine Learning · 3 min ·
[2602.17930] MIRA: Memory-Integrated Reinforcement Learning Agent with Limited LLM Guidance
Llms

[2602.17930] MIRA: Memory-Integrated Reinforcement Learning Agent with Limited LLM Guidance

The paper presents MIRA, a Memory-Integrated Reinforcement Learning Agent that reduces reliance on large language models (LLMs) by utiliz...

arXiv - Machine Learning · 4 min ·
[2602.17898] Breaking the Correlation Plateau: On the Optimization and Capacity Limits of Attention-Based Regressors
Machine Learning

[2602.17898] Breaking the Correlation Plateau: On the Optimization and Capacity Limits of Attention-Based Regressors

This paper explores the limitations of attention-based regression models, particularly the phenomenon of the Pearson correlation coeffici...

arXiv - Machine Learning · 4 min ·
[2602.17832] MePoly: Max Entropy Polynomial Policy Optimization
Generative Ai

[2602.17832] MePoly: Max Entropy Polynomial Policy Optimization

MePoly introduces a novel polynomial energy-based model for policy optimization in stochastic control, enhancing multi-modality represent...

arXiv - Machine Learning · 3 min ·
[2602.17827] Avoid What You Know: Divergent Trajectory Balance for GFlowNets
Machine Learning

[2602.17827] Avoid What You Know: Divergent Trajectory Balance for GFlowNets

The paper presents Adaptive Complementary Exploration (ACE), an algorithm designed to enhance the efficiency of Generative Flow Networks ...

arXiv - Machine Learning · 4 min ·
[2602.17798] Grassmannian Mixture-of-Experts: Concentration-Controlled Routing on Subspace Manifolds
Machine Learning

[2602.17798] Grassmannian Mixture-of-Experts: Concentration-Controlled Routing on Subspace Manifolds

The paper presents Grassmannian Mixture-of-Experts (GrMoE), a novel routing framework that enhances expert assignment in machine learning...

arXiv - Machine Learning · 4 min ·
[2602.17744] Bayesian Optimality of In-Context Learning with Selective State Spaces
Machine Learning

[2602.17744] Bayesian Optimality of In-Context Learning with Selective State Spaces

This paper introduces Bayesian optimal sequential prediction as a framework for understanding in-context learning (ICL), demonstrating it...

arXiv - Machine Learning · 4 min ·
[2602.17695] EXACT: Explicit Attribute-Guided Decoding-Time Personalization
Llms

[2602.17695] EXACT: Explicit Attribute-Guided Decoding-Time Personalization

The paper presents EXACT, a novel approach for decoding-time personalization in large language models, enhancing user alignment through i...

arXiv - Machine Learning · 3 min ·
[2602.17692] Agentic Unlearning: When LLM Agent Meets Machine Unlearning
Llms

[2602.17692] Agentic Unlearning: When LLM Agent Meets Machine Unlearning

The paper introduces 'agentic unlearning,' a novel approach to remove sensitive information from both model parameters and memory in AI a...

arXiv - Machine Learning · 3 min ·
[2602.17685] Optimal Multi-Debris Mission Planning in LEO: A Deep Reinforcement Learning Approach with Co-Elliptic Transfers and Refueling
Machine Learning

[2602.17685] Optimal Multi-Debris Mission Planning in LEO: A Deep Reinforcement Learning Approach with Co-Elliptic Transfers and Refueling

This paper presents a novel approach to multi-target active debris removal in Low Earth Orbit using deep reinforcement learning, co-ellip...

arXiv - Machine Learning · 3 min ·
Samsung is adding Perplexity to Galaxy AI for its upcoming S26 series
Ai Startups

Samsung is adding Perplexity to Galaxy AI for its upcoming S26 series

Samsung is integrating Perplexity's AI agent into its Galaxy AI for the upcoming S26 series, enhancing user experience with multiple AI f...

AI Tools & Products · 2 min ·
What Happened When We Let AI Agents Cross-Examine Each Other
Ai Agents

What Happened When We Let AI Agents Cross-Examine Each Other

The article explores an experiment where AI agents cross-examined each other after a summit, revealing insights about their interactions ...

AI Tools & Products · 7 min ·
AI agent invasion leaves people scrambling to pick winners | Daily Sabah
Ai Agents

AI agent invasion leaves people scrambling to pick winners | Daily Sabah

The rise of AI agents capable of performing complex tasks is reshaping the tech landscape, prompting investors to reassess their strategi...

AI Tools & Products · 4 min ·
Samsung is adding Perplexity to Galaxy AI | The Verge
Llms

Samsung is adding Perplexity to Galaxy AI | The Verge

Samsung is integrating Perplexity into its Galaxy AI, enhancing its multi-agent ecosystem to allow users to interact with various AI agen...

The Verge - AI · 4 min ·
Machine Learning

[P] Ai Learns to play Street Fighter 6

This article details the process of training an AI to play Street Fighter 6 using imitation learning, showcasing both the gameplay and te...

Reddit - Machine Learning · 1 min ·
Ai Agents

We have HR for managing human capital. What's the equivalent for AI agents?

The article discusses the need for a management framework for AI agents, similar to HR for human capital, as organizations increasingly d...

Reddit - Artificial Intelligence · 1 min ·
Previous Page 92 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime