AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

Robotics

What happens when you let AI agents run a sitcom 24/7 with zero human involvement

Ran an experiment — gave AI agents full control over writing, character creation, and performing a sitcom. Left it running nonstop for ov...

Reddit - Artificial Intelligence · 1 min ·
Ai Agents

Microsoft's newest open-source project: Runtime security for AI agents

submitted by /u/Fcking_Chuck [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
[2510.16609] Prior Knowledge Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods
Llms

[2510.16609] Prior Knowledge Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods

Abstract page for arXiv paper 2510.16609: Prior Knowledge Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods

arXiv - Machine Learning · 4 min ·

All Content

[2602.20530] Memory-guided Prototypical Co-occurrence Learning for Mixed Emotion Recognition
Machine Learning

[2602.20530] Memory-guided Prototypical Co-occurrence Learning for Mixed Emotion Recognition

The paper presents a novel framework, Memory-guided Prototypical Co-occurrence Learning (MPCL), aimed at improving mixed emotion recognit...

arXiv - Machine Learning · 4 min ·
[2602.20461] Nonparametric Teaching of Attention Learners
Machine Learning

[2602.20461] Nonparametric Teaching of Attention Learners

This article presents a novel teaching paradigm called Attention Neural Teaching (AtteNT) that enhances the efficiency of attention learn...

arXiv - Machine Learning · 4 min ·
[2602.20404] $κ$-Explorer: A Unified Framework for Active Model Estimation in MDPs
Machine Learning

[2602.20404] $κ$-Explorer: A Unified Framework for Active Model Estimation in MDPs

$κ$-Explorer presents a novel framework for active model estimation in Markov decision processes (MDPs), focusing on optimizing explorati...

arXiv - Machine Learning · 3 min ·
[2602.20399] GeoPT: Scaling Physics Simulation via Lifted Geometric Pre-Training
Machine Learning

[2602.20399] GeoPT: Scaling Physics Simulation via Lifted Geometric Pre-Training

GeoPT introduces a novel approach to scaling physics simulations by utilizing lifted geometric pre-training, enhancing model efficiency a...

arXiv - Machine Learning · 3 min ·
[2602.20370] Quantitative Approximation Rates for Group Equivariant Learning
Machine Learning

[2602.20370] Quantitative Approximation Rates for Group Equivariant Learning

This paper explores quantitative approximation rates for group equivariant learning, demonstrating that equivariant architectures maintai...

arXiv - Machine Learning · 4 min ·
[2602.20175] Tensor Network Generator-Enhanced Optimization for Traveling Salesman Problem
Machine Learning

[2602.20175] Tensor Network Generator-Enhanced Optimization for Traveling Salesman Problem

This article presents a novel optimization framework using tensor networks to tackle the Traveling Salesman Problem (TSP), demonstrating ...

arXiv - Machine Learning · 3 min ·
[2602.10693] VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training
Llms

[2602.10693] VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

The paper introduces VESPO, a novel approach for stable off-policy training of large language models (LLMs) that addresses training stabi...

arXiv - Machine Learning · 3 min ·
[2602.09050] SAS-Net: Scene-Appearance Separation Network for Robust Spatiotemporal Registration in Bidirectional Photoacoustic Microscopy
Ai Safety

[2602.09050] SAS-Net: Scene-Appearance Separation Network for Robust Spatiotemporal Registration in Bidirectional Photoacoustic Microscopy

The paper introduces SAS-Net, a novel framework for robust spatiotemporal registration in bidirectional photoacoustic microscopy, address...

arXiv - AI · 4 min ·
[2602.09082] UI-Venus-1.5 Technical Report
Machine Learning

[2602.09082] UI-Venus-1.5 Technical Report

The UI-Venus-1.5 Technical Report presents advancements in GUI agents, detailing a unified model that enhances task performance across va...

arXiv - Machine Learning · 4 min ·
[2602.07906] AceGRPO: Adaptive Curriculum Enhanced Group Relative Policy Optimization for Autonomous Machine Learning Engineering
Llms

[2602.07906] AceGRPO: Adaptive Curriculum Enhanced Group Relative Policy Optimization for Autonomous Machine Learning Engineering

The paper presents AceGRPO, a novel approach for enhancing autonomous machine learning engineering through adaptive curriculum and group ...

arXiv - Machine Learning · 4 min ·
[2602.07729] Do We Need Adam? Surprisingly Strong and Sparse Reinforcement Learning with SGD in LLMs
Llms

[2602.07729] Do We Need Adam? Surprisingly Strong and Sparse Reinforcement Learning with SGD in LLMs

This paper explores the effectiveness of the SGD optimizer in reinforcement learning for large language models, challenging the dominance...

arXiv - Machine Learning · 4 min ·
[2601.19001] FROST: Filtering Reasoning Outliers with Attention for Efficient Reasoning
Machine Learning

[2601.19001] FROST: Filtering Reasoning Outliers with Attention for Efficient Reasoning

The paper presents FROST, an innovative method that utilizes attention mechanisms to filter out reasoning outliers, enhancing the efficie...

arXiv - Machine Learning · 3 min ·
[2601.11675] Generating metamers of human scene understanding
Machine Learning

[2601.11675] Generating metamers of human scene understanding

This article presents MetamerGen, a novel tool that generates metamers of human scene understanding by combining low-resolution gist info...

arXiv - AI · 4 min ·
[2601.09768] CLiMB: A Domain-Informed Novelty Detection Clustering Framework for Galactic Archaeology and Scientific Discovery
Machine Learning

[2601.09768] CLiMB: A Domain-Informed Novelty Detection Clustering Framework for Galactic Archaeology and Scientific Discovery

The paper presents CLiMB, a novel framework for novelty detection in galactic archaeology, enhancing clustering methods to identify unkno...

arXiv - AI · 4 min ·
[2601.09708] Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning
Machine Learning

[2601.09708] Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

The paper presents Fast-ThinkAct, a novel framework for efficient Vision-Language-Action reasoning that reduces inference latency while m...

arXiv - Machine Learning · 3 min ·
[2601.03868] What Matters For Safety Alignment?
Llms

[2601.03868] What Matters For Safety Alignment?

This paper investigates safety alignment in large language models (LLMs) and large reasoning models (LRMs), identifying key factors that ...

arXiv - AI · 4 min ·
[2601.01874] CogFlow: Bridging Perception and Reasoning through Knowledge Internalization for Visual Mathematical Problem Solving
Llms

[2601.01874] CogFlow: Bridging Perception and Reasoning through Knowledge Internalization for Visual Mathematical Problem Solving

CogFlow introduces a novel framework for visual mathematical problem solving, enhancing perception and reasoning through knowledge intern...

arXiv - AI · 4 min ·
[2512.24787] HiGR: Efficient Generative Slate Recommendation via Hierarchical Planning and Multi-Objective Preference Alignment
Machine Learning

[2512.24787] HiGR: Efficient Generative Slate Recommendation via Hierarchical Planning and Multi-Objective Preference Alignment

The paper presents HiGR, a novel framework for generative slate recommendation that enhances efficiency and user preference alignment thr...

arXiv - AI · 4 min ·
[2511.02565] A Cognitive Process-Inspired Architecture for Subject-Agnostic Brain Visual Decoding
Machine Learning

[2511.02565] A Cognitive Process-Inspired Architecture for Subject-Agnostic Brain Visual Decoding

The paper presents VCFlow, a novel architecture for subject-agnostic brain visual decoding, enhancing the reconstruction of visual experi...

arXiv - AI · 4 min ·
[2510.23587] A Survey of Data Agents: Emerging Paradigm or Overstated Hype?
Llms

[2510.23587] A Survey of Data Agents: Emerging Paradigm or Overstated Hype?

This survey explores the concept of data agents, autonomous systems that manage complex data tasks. It introduces a hierarchical taxonomy...

arXiv - AI · 4 min ·
Previous Page 58 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime