AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Machine Learning

[P] Run Karpathy's Autoresearch for $0.44 instead of $24 — Open-source parallel evolution pipeline on SageMaker Spot

TL;DR: I built an open-source pipeline that runs Karpathy's autoresearch on SageMaker Spot instances — 25 autonomous ML experiments for $...

Reddit - Machine Learning · 1 min ·
ProCap Financial Acquires AI Agent Lab
Ai Agents

ProCap Financial Acquires AI Agent Lab

ProCap Financial, a leading financial services firm, has successfully acquired AI Agent Lab, a pioneering artificial intelligence company...

AI News - General · 4 min ·
When Agentic AI Browsers Outrun Governance
Ai Safety

When Agentic AI Browsers Outrun Governance

Agentic AI browsers introduce new enterprise risk. Learn how AI governance helps leaders assess exposure, oversight gaps, and safe adopti...

AI Tools & Products · 14 min ·

All Content

[2602.22609] EvolveGen: Algorithmic Level Hardware Model Checking Benchmark Generation through Reinforcement Learning
Machine Learning

[2602.22609] EvolveGen: Algorithmic Level Hardware Model Checking Benchmark Generation through Reinforcement Learning

EvolveGen introduces a novel framework for generating hardware model checking benchmarks using reinforcement learning, addressing the ben...

arXiv - Machine Learning · 4 min ·
[2602.22549] DrivePTS: A Progressive Learning Framework with Textual and Structural Enhancement for Driving Scene Generation
Machine Learning

[2602.22549] DrivePTS: A Progressive Learning Framework with Textual and Structural Enhancement for Driving Scene Generation

DrivePTS introduces a progressive learning framework for generating diverse driving scenes, enhancing fidelity and controllability in aut...

arXiv - AI · 4 min ·
[2602.22576] Search-P1: Path-Centric Reward Shaping for Stable and Efficient Agentic RAG Training
Llms

[2602.22576] Search-P1: Path-Centric Reward Shaping for Stable and Efficient Agentic RAG Training

The paper presents Search-P1, a framework for enhancing Retrieval-Augmented Generation (RAG) training through path-centric reward shaping...

arXiv - Machine Learning · 3 min ·
[2602.22529] Generative Agents Navigating Digital Libraries
Llms

[2602.22529] Generative Agents Navigating Digital Libraries

The paper introduces Agent4DL, a simulator for user search behavior in digital libraries, leveraging large language models to generate re...

arXiv - AI · 3 min ·
[2602.22533] A Synergistic Approach: Dynamics-AI Ensemble in Tropical Cyclone Forecasting
Machine Learning

[2602.22533] A Synergistic Approach: Dynamics-AI Ensemble in Tropical Cyclone Forecasting

This article presents a novel AI-driven ensemble forecasting system for tropical cyclones, optimizing computational efficiency while main...

arXiv - Machine Learning · 3 min ·
[2602.22514] SignVLA: A Gloss-Free Vision-Language-Action Framework for Real-Time Sign Language-Guided Robotic Manipulation
Robotics

[2602.22514] SignVLA: A Gloss-Free Vision-Language-Action Framework for Real-Time Sign Language-Guided Robotic Manipulation

The paper presents SignVLA, a novel gloss-free Vision-Language-Action framework for real-time robotic manipulation guided by sign languag...

arXiv - AI · 4 min ·
[2602.22481] Sydney Telling Fables on AI and Humans: A Corpus Tracing Memetic Transfer of Persona between LLMs
Llms

[2602.22481] Sydney Telling Fables on AI and Humans: A Corpus Tracing Memetic Transfer of Persona between LLMs

This article explores the relationship between AI and humans through the lens of large language models (LLMs), focusing on the Sydney per...

arXiv - AI · 4 min ·
[2602.22474] When to Act, Ask, or Learn: Uncertainty-Aware Policy Steering
Llms

[2602.22474] When to Act, Ask, or Learn: Uncertainty-Aware Policy Steering

This article presents a framework for uncertainty-aware policy steering in robotics, enabling adaptive robot behavior by addressing task ...

arXiv - Machine Learning · 4 min ·
[2602.22456] Automating the Detection of Requirement Dependencies Using Large Language Models
Llms

[2602.22456] Automating the Detection of Requirement Dependencies Using Large Language Models

This article presents LEREDD, a novel approach utilizing Large Language Models to automate the detection of requirement dependencies in s...

arXiv - AI · 4 min ·
[2602.22469] Beyond Dominant Patches: Spatial Credit Redistribution For Grounded Vision-Language Models
Llms

[2602.22469] Beyond Dominant Patches: Spatial Credit Redistribution For Grounded Vision-Language Models

This paper introduces Spatial Credit Redistribution (SCR) to address hallucinations in vision-language models by redistributing activatio...

arXiv - AI · 4 min ·
[2602.22450] Silent Egress: When Implicit Prompt Injection Makes LLM Agents Leak Without a Trace
Llms

[2602.22450] Silent Egress: When Implicit Prompt Injection Makes LLM Agents Leak Without a Trace

The paper discusses the security risks posed by implicit prompt injection in large language model (LLM) agents, demonstrating how adversa...

arXiv - AI · 4 min ·
[2602.22431] mmWave Radar Aware Dual-Conditioned GAN for Speech Reconstruction of Signals With Low SNR
Machine Learning

[2602.22431] mmWave Radar Aware Dual-Conditioned GAN for Speech Reconstruction of Signals With Low SNR

This article presents a novel approach using a Dual-Conditioned Generative Adversarial Network (GAN) for reconstructing speech signals ca...

arXiv - Machine Learning · 3 min ·
[2602.22430] TopoEdit: Fast Post-Optimization Editing of Topology Optimized Structures
Machine Learning

[2602.22430] TopoEdit: Fast Post-Optimization Editing of Topology Optimized Structures

TopoEdit presents a novel approach for fast post-optimization editing of topology optimized structures, enhancing mechanical performance ...

arXiv - Machine Learning · 4 min ·
[2602.22426] SimpleOCR: Rendering Visualized Questions to Teach MLLMs to Read
Llms

[2602.22426] SimpleOCR: Rendering Visualized Questions to Teach MLLMs to Read

The paper introduces SimpleOCR, a method to enhance Multimodal Large Language Models (MLLMs) by rendering visualized questions, addressin...

arXiv - Machine Learning · 4 min ·
[2602.22402] Contextual Memory Virtualisation: DAG-Based State Management and Structurally Lossless Trimming for LLM Agents
Llms

[2602.22402] Contextual Memory Virtualisation: DAG-Based State Management and Structurally Lossless Trimming for LLM Agents

The paper presents Contextual Memory Virtualisation (CMV), a novel system for managing state in large language models (LLMs) using a Dire...

arXiv - AI · 4 min ·
[2602.22368] EyeLayer: Integrating Human Attention Patterns into LLM-Based Code Summarization
Llms

[2602.22368] EyeLayer: Integrating Human Attention Patterns into LLM-Based Code Summarization

The paper presents EyeLayer, a novel module that integrates human attention patterns into LLM-based code summarization, enhancing model p...

arXiv - AI · 4 min ·
[2602.22241] Stochastic Neural Networks for Quantum Devices
Machine Learning

[2602.22241] Stochastic Neural Networks for Quantum Devices

This paper explores the integration of stochastic neural networks into quantum devices, presenting a novel approach to optimize these net...

arXiv - Machine Learning · 3 min ·
[2602.22226] SEGB: Self-Evolved Generative Bidding with Local Autoregressive Diffusion
Generative Ai

[2602.22226] SEGB: Self-Evolved Generative Bidding with Local Autoregressive Diffusion

The paper presents Self-Evolved Generative Bidding (SEGB), a novel framework for automated online advertising that enhances bidding strat...

arXiv - Machine Learning · 3 min ·
[2602.21988] Solving stiff dark matter equations via Jacobian Normalization with Physics-Informed Neural Networks
Machine Learning

[2602.21988] Solving stiff dark matter equations via Jacobian Normalization with Physics-Informed Neural Networks

This article presents a novel method for solving stiff dark matter equations using Jacobian Normalization within Physics-Informed Neural ...

arXiv - Machine Learning · 3 min ·
[2602.23360] Model Agreement via Anchoring
Machine Learning

[2602.23360] Model Agreement via Anchoring

The paper presents a method for reducing model disagreement in machine learning by using an anchoring technique, demonstrating its effect...

arXiv - AI · 4 min ·
Previous Page 33 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime