AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Llms

persistent memory system for AI agents — single SQLite file, no external server, no API keys. free and opensource - BrainCTL

Every agent I build forgets everything between sessions. I got tired of it and built brainctl. pip install brainctl, then: from agentmemo...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

Why does Multi-Agent RL fail to act like a real society in Spatial Game Theory? [P] [R]

Hey everyone, I’m building a project for my university Machine Learning course called "Social network analysis using iterated game theory...

Reddit - Machine Learning · 1 min ·
AWS turns its S3 storage service into a file system for AI agents
Nlp

AWS turns its S3 storage service into a file system for AI agents

AI News - General ·

All Content

[2505.21723] Are Statistical Methods Obsolete in the Era of Deep Learning? A Study of ODE Inverse Problems
Machine Learning

[2505.21723] Are Statistical Methods Obsolete in the Era of Deep Learning? A Study of ODE Inverse Problems

This article examines the relevance of statistical methods in the age of deep learning, using ordinary differential equation (ODE) invers...

arXiv - Machine Learning · 4 min ·
[2509.25275] VoiceBridge: General Speech Restoration with One-step Latent Bridge Models
Machine Learning

[2509.25275] VoiceBridge: General Speech Restoration with One-step Latent Bridge Models

VoiceBridge introduces a novel one-step latent bridge model for general speech restoration, enhancing audio quality from various distorti...

arXiv - AI · 4 min ·
[2509.18008] Through the Lens of Human-Human Collaboration: A Configurable Research Platform for Exploring Human-Agent Collaboration
Llms

[2509.18008] Through the Lens of Human-Human Collaboration: A Configurable Research Platform for Exploring Human-Agent Collaboration

This article presents a configurable research platform aimed at enhancing human-agent collaboration, exploring the dynamics of human-huma...

arXiv - AI · 4 min ·
[2509.13550] Complexity Bounds for Smooth Multiobjective Optimization
Machine Learning

[2509.13550] Complexity Bounds for Smooth Multiobjective Optimization

This paper investigates the oracle complexity of finding ε-Pareto stationary points in smooth multiobjective optimization, presenting new...

arXiv - AI · 3 min ·
[2509.12456] Reinforcement Learning-Based Market Making as a Stochastic Control on Non-Stationary Limit Order Book Dynamics
Machine Learning

[2509.12456] Reinforcement Learning-Based Market Making as a Stochastic Control on Non-Stationary Limit Order Book Dynamics

This paper explores the use of reinforcement learning for market making in non-stationary limit order book dynamics, presenting a practic...

arXiv - AI · 4 min ·
[2509.05311] Large Language Model Integration with Reinforcement Learning to Augment Decision-Making in Autonomous Cyber Operations
Llms

[2509.05311] Large Language Model Integration with Reinforcement Learning to Augment Decision-Making in Autonomous Cyber Operations

This article explores the integration of Large Language Models (LLMs) with Reinforcement Learning (RL) to enhance decision-making in auto...

arXiv - Machine Learning · 3 min ·
[2410.22009] On uniqueness in structured model learning
Machine Learning

[2410.22009] On uniqueness in structured model learning

This paper explores the uniqueness in structured model learning for systems of partial differential equations (PDEs), proposing a framewo...

arXiv - Machine Learning · 4 min ·
[2508.19278] Towards Production-Worthy Simulation for Autonomous Cyber Operations
Machine Learning

[2508.19278] Towards Production-Worthy Simulation for Autonomous Cyber Operations

This article presents a framework for enhancing simulation environments in Autonomous Cyber Operations (ACO) by implementing new actions ...

arXiv - Machine Learning · 3 min ·
[2508.18210] Why Synthetic Isn't Real Yet: A Diagnostic Framework for Contact Center Dialogue Generation
Generative Ai

[2508.18210] Why Synthetic Isn't Real Yet: A Diagnostic Framework for Contact Center Dialogue Generation

This article presents a diagnostic framework for evaluating synthetic dialogue generation in contact centers, highlighting the limitation...

arXiv - AI · 4 min ·
[2508.07514] Robust MultiSpecies Agricultural Segmentation Across Devices, Seasons, and Sensors Using Hierarchical DINOv2 Models
Machine Learning

[2508.07514] Robust MultiSpecies Agricultural Segmentation Across Devices, Seasons, and Sensors Using Hierarchical DINOv2 Models

This article presents a robust segmentation framework using Hierarchical DINOv2 models for reliable plant species and damage identificati...

arXiv - AI · 4 min ·
[2507.19457] GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
Llms

[2507.19457] GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning

The paper introduces GEPA, a novel prompt optimizer that leverages natural language reflection to enhance learning efficiency in large la...

arXiv - Machine Learning · 4 min ·
[2507.16713] A Pragmatist Robot: Learning to Plan Tasks by Experiencing the Real World
Llms

[2507.16713] A Pragmatist Robot: Learning to Plan Tasks by Experiencing the Real World

The paper presents PragmaBot, a framework for robotic task planning that utilizes real-world experiences and self-reflection to enhance l...

arXiv - AI · 4 min ·
[2506.20430] An Agentic System for Rare Disease Diagnosis with Traceable Reasoning
Ai Agents

[2506.20430] An Agentic System for Rare Disease Diagnosis with Traceable Reasoning

The paper presents DeepRare, a multi-agent system utilizing large language models for the differential diagnosis of rare diseases, demons...

arXiv - AI · 4 min ·
[2506.08672] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling
Machine Learning

[2506.08672] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling

RuleReasoner introduces a novel approach to rule-based reasoning using domain-aware dynamic sampling, enhancing reinforcement learning fo...

arXiv - Machine Learning · 3 min ·
[2602.06130] Self-Improving World Modelling with Latent Actions
Llms

[2602.06130] Self-Improving World Modelling with Latent Actions

The paper presents SWIRL, a framework for self-improving world modeling in machine learning, focusing on latent actions to enhance predic...

arXiv - AI · 4 min ·
[2602.05319] Accelerated Sequential Flow Matching: A Bayesian Filtering Perspective
Machine Learning

[2602.05319] Accelerated Sequential Flow Matching: A Bayesian Filtering Perspective

This paper introduces Accelerated Sequential Flow Matching, a Bayesian filtering framework that enhances real-time inference in stochasti...

arXiv - Machine Learning · 4 min ·
[2602.04942] Privileged Information Distillation for Language Models
Llms

[2602.04942] Privileged Information Distillation for Language Models

This paper presents methods for distilling privileged information in language models, focusing on improving performance in multi-turn env...

arXiv - AI · 4 min ·
[2602.03901] NeuroPareto: Calibrated Acquisition for Costly Many-Goal Search in Vast Parameter Spaces
Machine Learning

[2602.03901] NeuroPareto: Calibrated Acquisition for Costly Many-Goal Search in Vast Parameter Spaces

NeuroPareto introduces a novel architecture for optimizing multi-objective problems in high-dimensional spaces, leveraging Bayesian class...

arXiv - Machine Learning · 4 min ·
[2506.00494] Multi-Objective Neural Network-Assisted Design Optimization of Soft Fin-Ray Fingers for Enhanced Grasping Performance
Machine Learning

[2506.00494] Multi-Objective Neural Network-Assisted Design Optimization of Soft Fin-Ray Fingers for Enhanced Grasping Performance

This article presents a multi-objective optimization approach using neural networks to enhance the design of soft Fin-Ray fingers for imp...

arXiv - AI · 4 min ·
[2602.03195] Reinforcement Learning with Promising Tokens for Large Language Models
Llms

[2602.03195] Reinforcement Learning with Promising Tokens for Large Language Models

This article presents a novel framework called Reinforcement Learning with Promising Tokens (RLPT) designed to optimize large language mo...

arXiv - AI · 4 min ·
Previous Page 130 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime