AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Robotics

What happens when you let AI agents run a sitcom 24/7 with zero human involvement

Ran an experiment — gave AI agents full control over writing, character creation, and performing a sitcom. Left it running nonstop for ov...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Ai Agents

Microsoft's newest open-source project: Runtime security for AI agents

submitted by /u/Fcking_Chuck [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 8 hours ago

Llms

[2510.16609] Prior Knowledge Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods

Abstract page for arXiv paper 2510.16609: Prior Knowledge Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods

arXiv - Machine Learning · 4 min · about 14 hours ago

All Content

Machine Learning

[2602.20530] Memory-guided Prototypical Co-occurrence Learning for Mixed Emotion Recognition

The paper presents a novel framework, Memory-guided Prototypical Co-occurrence Learning (MPCL), aimed at improving mixed emotion recognit...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.20461] Nonparametric Teaching of Attention Learners

This article presents a novel teaching paradigm called Attention Neural Teaching (AtteNT) that enhances the efficiency of attention learn...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.20404] $κ$-Explorer: A Unified Framework for Active Model Estimation in MDPs

$κ$-Explorer presents a novel framework for active model estimation in Markov decision processes (MDPs), focusing on optimizing explorati...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.20399] GeoPT: Scaling Physics Simulation via Lifted Geometric Pre-Training

GeoPT introduces a novel approach to scaling physics simulations by utilizing lifted geometric pre-training, enhancing model efficiency a...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.20370] Quantitative Approximation Rates for Group Equivariant Learning

This paper explores quantitative approximation rates for group equivariant learning, demonstrating that equivariant architectures maintai...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.20175] Tensor Network Generator-Enhanced Optimization for Traveling Salesman Problem

This article presents a novel optimization framework using tensor networks to tackle the Traveling Salesman Problem (TSP), demonstrating ...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.10693] VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

The paper introduces VESPO, a novel approach for stable off-policy training of large language models (LLMs) that addresses training stabi...

arXiv - Machine Learning · 3 min · about 1 month ago

Ai Safety

[2602.09050] SAS-Net: Scene-Appearance Separation Network for Robust Spatiotemporal Registration in Bidirectional Photoacoustic Microscopy

The paper introduces SAS-Net, a novel framework for robust spatiotemporal registration in bidirectional photoacoustic microscopy, address...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.09082] UI-Venus-1.5 Technical Report

The UI-Venus-1.5 Technical Report presents advancements in GUI agents, detailing a unified model that enhances task performance across va...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.07906] AceGRPO: Adaptive Curriculum Enhanced Group Relative Policy Optimization for Autonomous Machine Learning Engineering

The paper presents AceGRPO, a novel approach for enhancing autonomous machine learning engineering through adaptive curriculum and group ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.07729] Do We Need Adam? Surprisingly Strong and Sparse Reinforcement Learning with SGD in LLMs

This paper explores the effectiveness of the SGD optimizer in reinforcement learning for large language models, challenging the dominance...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2601.19001] FROST: Filtering Reasoning Outliers with Attention for Efficient Reasoning

The paper presents FROST, an innovative method that utilizes attention mechanisms to filter out reasoning outliers, enhancing the efficie...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2601.11675] Generating metamers of human scene understanding

This article presents MetamerGen, a novel tool that generates metamers of human scene understanding by combining low-resolution gist info...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2601.09768] CLiMB: A Domain-Informed Novelty Detection Clustering Framework for Galactic Archaeology and Scientific Discovery

The paper presents CLiMB, a novel framework for novelty detection in galactic archaeology, enhancing clustering methods to identify unkno...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2601.09708] Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

The paper presents Fast-ThinkAct, a novel framework for efficient Vision-Language-Action reasoning that reduces inference latency while m...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2601.03868] What Matters For Safety Alignment?

This paper investigates safety alignment in large language models (LLMs) and large reasoning models (LRMs), identifying key factors that ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2601.01874] CogFlow: Bridging Perception and Reasoning through Knowledge Internalization for Visual Mathematical Problem Solving

CogFlow introduces a novel framework for visual mathematical problem solving, enhancing perception and reasoning through knowledge intern...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2512.24787] HiGR: Efficient Generative Slate Recommendation via Hierarchical Planning and Multi-Objective Preference Alignment

The paper presents HiGR, a novel framework for generative slate recommendation that enhances efficiency and user preference alignment thr...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2511.02565] A Cognitive Process-Inspired Architecture for Subject-Agnostic Brain Visual Decoding

The paper presents VCFlow, a novel architecture for subject-agnostic brain visual decoding, enhancing the reconstruction of visual experi...

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.23587] A Survey of Data Agents: Emerging Paradigm or Overstated Hype?

This survey explores the concept of data agents, autonomous systems that manage complex data tasks. It introduces a hierarchical taxonomy...

arXiv - AI · 4 min · about 1 month ago

Previous Page 58 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

What happens when you let AI agents run a sitcom 24/7 with zero human involvement

Microsoft's newest open-source project: Runtime security for AI agents

[2510.16609] Prior Knowledge Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods

All Content

[2602.20530] Memory-guided Prototypical Co-occurrence Learning for Mixed Emotion Recognition

[2602.20461] Nonparametric Teaching of Attention Learners

[2602.20404] $κ$-Explorer: A Unified Framework for Active Model Estimation in MDPs

[2602.20399] GeoPT: Scaling Physics Simulation via Lifted Geometric Pre-Training

[2602.20370] Quantitative Approximation Rates for Group Equivariant Learning

[2602.20175] Tensor Network Generator-Enhanced Optimization for Traveling Salesman Problem

[2602.10693] VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

[2602.09050] SAS-Net: Scene-Appearance Separation Network for Robust Spatiotemporal Registration in Bidirectional Photoacoustic Microscopy

[2602.09082] UI-Venus-1.5 Technical Report

[2602.07906] AceGRPO: Adaptive Curriculum Enhanced Group Relative Policy Optimization for Autonomous Machine Learning Engineering

[2602.07729] Do We Need Adam? Surprisingly Strong and Sparse Reinforcement Learning with SGD in LLMs

[2601.19001] FROST: Filtering Reasoning Outliers with Attention for Efficient Reasoning

[2601.11675] Generating metamers of human scene understanding

[2601.09768] CLiMB: A Domain-Informed Novelty Detection Clustering Framework for Galactic Archaeology and Scientific Discovery

[2601.09708] Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

[2601.03868] What Matters For Safety Alignment?

[2601.01874] CogFlow: Bridging Perception and Reasoning through Knowledge Internalization for Visual Mathematical Problem Solving

[2512.24787] HiGR: Efficient Generative Slate Recommendation via Hierarchical Planning and Multi-Objective Preference Alignment

[2511.02565] A Cognitive Process-Inspired Architecture for Subject-Agnostic Brain Visual Decoding

[2510.23587] A Survey of Data Agents: Emerging Paradigm or Overstated Hype?

Related Topics

Stay updated with AI News