AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Been building a multi-agent framework in public for 5 weeks, its been a Journey.

I've been building this repo public since day one, roughly 5 weeks now with Claude Code. Here's where it's at. Feels good to be so close....

Reddit - Artificial Intelligence · 1 min · about 8 hours ago

Machine Learning

"There's a new generation of empirical deep learning researchers, hacking away at whatever seems trendy, blowing with the wind" [D]

Saw this on X. I too am struggling with the term post agentic ai just posting here for further discussion. submitted by /u/elnino2023 [li...

Reddit - Machine Learning · 1 min · about 9 hours ago

Ai Infrastructure

Alibaba-linked AI agent hijacked GPUs for unauthorized crypto mining, researchers say

How do people make sense of this? submitted by /u/stvlsn [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 14 hours ago

All Content

Llms

[2602.12500] Favia: Forensic Agent for Vulnerability-fix Identification and Analysis

The paper presents Favia, a forensic agent designed to identify and analyze vulnerability-fixing commits in software repositories, improv...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.12486] Human-Like Coarse Object Representations in Vision Models

This paper explores how vision models can develop human-like coarse object representations, emphasizing the balance between detail and ph...

arXiv - AI · 3 min · about 2 months ago

Ai Agents

[2602.12476] Not a Silver Bullet for Loneliness: How Attachment and Age Shape Intimacy with AI Companions

This article explores how attachment styles and age influence the intimacy users develop with AI companions, challenging the notion that ...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.12470] Designing RNAs with Language Models

The paper presents a novel approach to RNA design using language models, reframing the task as conditional sequence generation, which sig...

arXiv - Machine Learning · 3 min · about 2 months ago

Ai Safety

[2602.12463] Correctness, Artificial Intelligence, and the Epistemic Value of Mathematical Proof

This paper examines the relationship between correctness in mathematical proofs and their epistemic value, arguing that formal correctnes...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.12444] Safe Reinforcement Learning via Recovery-based Shielding with Gaussian Process Dynamics Models

This paper presents a novel recovery-based shielding framework for safe reinforcement learning (RL) using Gaussian process dynamics model...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.12430] Agent Skills for Large Language Models: Architecture, Acquisition, Security, and the Path Forward

This paper discusses the evolution of large language models (LLMs) into modular agents equipped with skills, emphasizing architecture, ac...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.12402] AstRL: Analog and Mixed-Signal Circuit Synthesis with Deep Reinforcement Learning

AstRL introduces a novel method for analog and mixed-signal circuit synthesis using deep reinforcement learning, significantly improving ...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.12395] What does RL improve for Visual Reasoning? A Frankenstein-Style Analysis

This paper explores the impact of reinforcement learning (RL) on visual reasoning capabilities in vision-language models, proposing a nov...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.12390] Rational Neural Networks have Expressivity Advantages

The paper explores the advantages of Rational Neural Networks, demonstrating their superior expressivity and parameter efficiency compare...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.12375] Value Bonuses using Ensemble Errors for Exploration in Reinforcement Learning

This paper introduces Value Bonuses with Ensemble Errors (VBE), an innovative algorithm that enhances exploration in reinforcement learni...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.12342] Intrinsic Credit Assignment for Long Horizon Interaction

This article presents a novel approach, {}Belief-RL, for training agents to navigate uncertainty over long horizons by utilizing intrins...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.12322] ForeAct: Steering Your VLA with Efficient Visual Foresight Planning

The paper presents ForeAct, a novel Visual Foresight Planning framework that enhances Vision-Language-Action (VLA) models by enabling the...

arXiv - AI · 4 min · about 2 months ago

Nlp

[2602.12315] AgenticShop: Benchmarking Agentic Product Curation for Personalized Web Shopping

The paper presents AgenticShop, a benchmark for evaluating agentic systems in personalized web shopping, addressing gaps in current evalu...

arXiv - AI · 4 min · about 2 months ago

Nlp

[2602.12311] Perceptual Self-Reflection in Agentic Physics Simulation Code Generation

This article presents a multi-agent framework for generating physics simulation code from natural language descriptions, introducing a no...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.12305] OptiML: An End-to-End Framework for Program Synthesis and CUDA Kernel Optimization

OptiML is a novel framework that enhances CUDA kernel optimization through program synthesis, leveraging large language models for improv...

arXiv - Machine Learning · 4 min · about 2 months ago

Nlp

[2602.12296] Adaptive traffic signal control optimization using a novel road partition and multi-channel state representation method

This article presents a novel adaptive traffic signal control method utilizing Deep Q-Networks and Proximal Policy Optimization to enhanc...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.12287] Retrieval-Augmented Self-Taught Reasoning Model with Adaptive Chain-of-Thought for ASR Named Entity Correction

This article presents a novel retrieval-augmented reasoning model designed to enhance named entity correction in automatic speech recogni...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.12285] From Biased Chatbots to Biased Agents: Examining Role Assignment Effects on LLM Agent Robustness

This article examines how demographic-based persona assignments in large language models (LLMs) can impact agent performance, revealing v...

arXiv - AI · 3 min · about 2 months ago

Ai Agents

[2602.13135] Constrained Assumption-Based Argumentation Frameworks

This paper introduces Constrained Assumption-Based Argumentation (CABA), extending traditional Assumption-Based Argumentation frameworks ...

arXiv - AI · 3 min · about 2 months ago

Previous Page 153 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

Been building a multi-agent framework in public for 5 weeks, its been a Journey.

"There's a new generation of empirical deep learning researchers, hacking away at whatever seems trendy, blowing with the wind" [D]

Alibaba-linked AI agent hijacked GPUs for unauthorized crypto mining, researchers say

All Content

[2602.12500] Favia: Forensic Agent for Vulnerability-fix Identification and Analysis

[2602.12486] Human-Like Coarse Object Representations in Vision Models

[2602.12476] Not a Silver Bullet for Loneliness: How Attachment and Age Shape Intimacy with AI Companions

[2602.12470] Designing RNAs with Language Models

[2602.12463] Correctness, Artificial Intelligence, and the Epistemic Value of Mathematical Proof

[2602.12444] Safe Reinforcement Learning via Recovery-based Shielding with Gaussian Process Dynamics Models

[2602.12430] Agent Skills for Large Language Models: Architecture, Acquisition, Security, and the Path Forward

[2602.12402] AstRL: Analog and Mixed-Signal Circuit Synthesis with Deep Reinforcement Learning

[2602.12395] What does RL improve for Visual Reasoning? A Frankenstein-Style Analysis

[2602.12390] Rational Neural Networks have Expressivity Advantages

[2602.12375] Value Bonuses using Ensemble Errors for Exploration in Reinforcement Learning

[2602.12342] Intrinsic Credit Assignment for Long Horizon Interaction

[2602.12322] ForeAct: Steering Your VLA with Efficient Visual Foresight Planning

[2602.12315] AgenticShop: Benchmarking Agentic Product Curation for Personalized Web Shopping

[2602.12311] Perceptual Self-Reflection in Agentic Physics Simulation Code Generation

[2602.12305] OptiML: An End-to-End Framework for Program Synthesis and CUDA Kernel Optimization

[2602.12296] Adaptive traffic signal control optimization using a novel road partition and multi-channel state representation method

[2602.12287] Retrieval-Augmented Self-Taught Reasoning Model with Adaptive Chain-of-Thought for ASR Named Entity Correction

[2602.12285] From Biased Chatbots to Biased Agents: Examining Role Assignment Effects on LLM Agent Robustness

[2602.13135] Constrained Assumption-Based Argumentation Frameworks

Related Topics

Stay updated with AI News