AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Llms

Been building a multi-agent framework in public for 5 weeks, its been a Journey.

I've been building this repo public since day one, roughly 5 weeks now with Claude Code. Here's where it's at. Feels good to be so close....

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

"There's a new generation of empirical deep learning researchers, hacking away at whatever seems trendy, blowing with the wind" [D]

Saw this on X. I too am struggling with the term post agentic ai just posting here for further discussion. submitted by /u/elnino2023 [li...

Reddit - Machine Learning · 1 min ·
Ai Infrastructure

Alibaba-linked AI agent hijacked GPUs for unauthorized crypto mining, researchers say

How do people make sense of this? submitted by /u/stvlsn [link] [comments]

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.12500] Favia: Forensic Agent for Vulnerability-fix Identification and Analysis
Llms

[2602.12500] Favia: Forensic Agent for Vulnerability-fix Identification and Analysis

The paper presents Favia, a forensic agent designed to identify and analyze vulnerability-fixing commits in software repositories, improv...

arXiv - AI · 4 min ·
[2602.12486] Human-Like Coarse Object Representations in Vision Models
Machine Learning

[2602.12486] Human-Like Coarse Object Representations in Vision Models

This paper explores how vision models can develop human-like coarse object representations, emphasizing the balance between detail and ph...

arXiv - AI · 3 min ·
[2602.12476] Not a Silver Bullet for Loneliness: How Attachment and Age Shape Intimacy with AI Companions
Ai Agents

[2602.12476] Not a Silver Bullet for Loneliness: How Attachment and Age Shape Intimacy with AI Companions

This article explores how attachment styles and age influence the intimacy users develop with AI companions, challenging the notion that ...

arXiv - AI · 4 min ·
[2602.12470] Designing RNAs with Language Models
Llms

[2602.12470] Designing RNAs with Language Models

The paper presents a novel approach to RNA design using language models, reframing the task as conditional sequence generation, which sig...

arXiv - Machine Learning · 3 min ·
[2602.12463] Correctness, Artificial Intelligence, and the Epistemic Value of Mathematical Proof
Ai Safety

[2602.12463] Correctness, Artificial Intelligence, and the Epistemic Value of Mathematical Proof

This paper examines the relationship between correctness in mathematical proofs and their epistemic value, arguing that formal correctnes...

arXiv - AI · 3 min ·
[2602.12444] Safe Reinforcement Learning via Recovery-based Shielding with Gaussian Process Dynamics Models
Machine Learning

[2602.12444] Safe Reinforcement Learning via Recovery-based Shielding with Gaussian Process Dynamics Models

This paper presents a novel recovery-based shielding framework for safe reinforcement learning (RL) using Gaussian process dynamics model...

arXiv - AI · 3 min ·
[2602.12430] Agent Skills for Large Language Models: Architecture, Acquisition, Security, and the Path Forward
Llms

[2602.12430] Agent Skills for Large Language Models: Architecture, Acquisition, Security, and the Path Forward

This paper discusses the evolution of large language models (LLMs) into modular agents equipped with skills, emphasizing architecture, ac...

arXiv - AI · 4 min ·
[2602.12402] AstRL: Analog and Mixed-Signal Circuit Synthesis with Deep Reinforcement Learning
Machine Learning

[2602.12402] AstRL: Analog and Mixed-Signal Circuit Synthesis with Deep Reinforcement Learning

AstRL introduces a novel method for analog and mixed-signal circuit synthesis using deep reinforcement learning, significantly improving ...

arXiv - Machine Learning · 4 min ·
[2602.12395] What does RL improve for Visual Reasoning? A Frankenstein-Style Analysis
Llms

[2602.12395] What does RL improve for Visual Reasoning? A Frankenstein-Style Analysis

This paper explores the impact of reinforcement learning (RL) on visual reasoning capabilities in vision-language models, proposing a nov...

arXiv - AI · 3 min ·
[2602.12390] Rational Neural Networks have Expressivity Advantages
Machine Learning

[2602.12390] Rational Neural Networks have Expressivity Advantages

The paper explores the advantages of Rational Neural Networks, demonstrating their superior expressivity and parameter efficiency compare...

arXiv - Machine Learning · 3 min ·
[2602.12375] Value Bonuses using Ensemble Errors for Exploration in Reinforcement Learning
Machine Learning

[2602.12375] Value Bonuses using Ensemble Errors for Exploration in Reinforcement Learning

This paper introduces Value Bonuses with Ensemble Errors (VBE), an innovative algorithm that enhances exploration in reinforcement learni...

arXiv - Machine Learning · 4 min ·
[2602.12342] Intrinsic Credit Assignment for Long Horizon Interaction
Llms

[2602.12342] Intrinsic Credit Assignment for Long Horizon Interaction

This article presents a novel approach, {}Belief-RL, for training agents to navigate uncertainty over long horizons by utilizing intrins...

arXiv - Machine Learning · 3 min ·
[2602.12322] ForeAct: Steering Your VLA with Efficient Visual Foresight Planning
Machine Learning

[2602.12322] ForeAct: Steering Your VLA with Efficient Visual Foresight Planning

The paper presents ForeAct, a novel Visual Foresight Planning framework that enhances Vision-Language-Action (VLA) models by enabling the...

arXiv - AI · 4 min ·
[2602.12315] AgenticShop: Benchmarking Agentic Product Curation for Personalized Web Shopping
Nlp

[2602.12315] AgenticShop: Benchmarking Agentic Product Curation for Personalized Web Shopping

The paper presents AgenticShop, a benchmark for evaluating agentic systems in personalized web shopping, addressing gaps in current evalu...

arXiv - AI · 4 min ·
[2602.12311] Perceptual Self-Reflection in Agentic Physics Simulation Code Generation
Nlp

[2602.12311] Perceptual Self-Reflection in Agentic Physics Simulation Code Generation

This article presents a multi-agent framework for generating physics simulation code from natural language descriptions, introducing a no...

arXiv - AI · 4 min ·
[2602.12305] OptiML: An End-to-End Framework for Program Synthesis and CUDA Kernel Optimization
Llms

[2602.12305] OptiML: An End-to-End Framework for Program Synthesis and CUDA Kernel Optimization

OptiML is a novel framework that enhances CUDA kernel optimization through program synthesis, leveraging large language models for improv...

arXiv - Machine Learning · 4 min ·
[2602.12296] Adaptive traffic signal control optimization using a novel road partition and multi-channel state representation method
Nlp

[2602.12296] Adaptive traffic signal control optimization using a novel road partition and multi-channel state representation method

This article presents a novel adaptive traffic signal control method utilizing Deep Q-Networks and Proximal Policy Optimization to enhanc...

arXiv - AI · 4 min ·
[2602.12287] Retrieval-Augmented Self-Taught Reasoning Model with Adaptive Chain-of-Thought for ASR Named Entity Correction
Llms

[2602.12287] Retrieval-Augmented Self-Taught Reasoning Model with Adaptive Chain-of-Thought for ASR Named Entity Correction

This article presents a novel retrieval-augmented reasoning model designed to enhance named entity correction in automatic speech recogni...

arXiv - AI · 3 min ·
[2602.12285] From Biased Chatbots to Biased Agents: Examining Role Assignment Effects on LLM Agent Robustness
Llms

[2602.12285] From Biased Chatbots to Biased Agents: Examining Role Assignment Effects on LLM Agent Robustness

This article examines how demographic-based persona assignments in large language models (LLMs) can impact agent performance, revealing v...

arXiv - AI · 3 min ·
[2602.13135] Constrained Assumption-Based Argumentation Frameworks
Ai Agents

[2602.13135] Constrained Assumption-Based Argumentation Frameworks

This paper introduces Constrained Assumption-Based Argumentation (CABA), extending traditional Assumption-Based Argumentation frameworks ...

arXiv - AI · 3 min ·
Previous Page 153 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime