AI Agents

Autonomous agents, tool use, and agentic systems

Top This Week

OpenClaw gives users yet another reason to be freaked out about security - Ars Technica
Ai Agents

OpenClaw gives users yet another reason to be freaked out about security - Ars Technica

The viral AI agentic tool let attackers silently gain admin unauthenticated access.

Ars Technica - AI · 5 min ·
Robotics

What happens when you let AI agents run a sitcom 24/7 with zero human involvement

Ran an experiment — gave AI agents full control over writing, character creation, and performing a sitcom. Left it running nonstop for ov...

Reddit - Artificial Intelligence · 1 min ·
Ai Agents

Microsoft's newest open-source project: Runtime security for AI agents

submitted by /u/Fcking_Chuck [link] [comments]

Reddit - Artificial Intelligence · 1 min ·

All Content

[2510.23587] A Survey of Data Agents: Emerging Paradigm or Overstated Hype?
Llms

[2510.23587] A Survey of Data Agents: Emerging Paradigm or Overstated Hype?

This survey explores the concept of data agents, autonomous systems that manage complex data tasks. It introduces a hierarchical taxonomy...

arXiv - AI · 4 min ·
[2510.24694] Repurposing Synthetic Data for Fine-grained Search Agent Supervision
Llms

[2510.24694] Repurposing Synthetic Data for Fine-grained Search Agent Supervision

The paper presents E-GRPO, a novel framework for training search agents using synthetic data, enhancing their ability to learn from near-...

arXiv - AI · 4 min ·
[2510.22620] Breaking Agent Backbones: Evaluating the Security of Backbone LLMs in AI Agents
Llms

[2510.22620] Breaking Agent Backbones: Evaluating the Security of Backbone LLMs in AI Agents

This article evaluates the security of large language models (LLMs) used in AI agents, introducing a framework for identifying vulnerabil...

arXiv - Machine Learning · 4 min ·
[2510.22500] Towards Scalable Oversight via Partitioned Human Supervision
Machine Learning

[2510.22500] Towards Scalable Oversight via Partitioned Human Supervision

The paper proposes a scalable oversight framework for AI systems using partitioned human supervision, addressing challenges in obtaining ...

arXiv - Machine Learning · 4 min ·
[2509.23115] RHYTHM: Reasoning with Hierarchical Temporal Tokenization for Human Mobility
Llms

[2509.23115] RHYTHM: Reasoning with Hierarchical Temporal Tokenization for Human Mobility

The paper presents RHYTHM, a framework utilizing hierarchical temporal tokenization to enhance human mobility predictions by leveraging l...

arXiv - Machine Learning · 4 min ·
[2509.22566] From Parameters to Behaviors: Unsupervised Compression of the Policy Space
Machine Learning

[2509.22566] From Parameters to Behaviors: Unsupervised Compression of the Policy Space

This paper presents an unsupervised method for compressing the policy parameter space in Deep Reinforcement Learning, enhancing sample ef...

arXiv - Machine Learning · 4 min ·
[2509.15796] Monte Carlo Tree Diffusion with Multiple Experts for Protein Design
Llms

[2509.15796] Monte Carlo Tree Diffusion with Multiple Experts for Protein Design

The paper presents MCTD-ME, a novel approach combining Monte Carlo Tree Search and masked diffusion models for efficient protein design, ...

arXiv - Machine Learning · 4 min ·
[2507.03043] K-Function: Joint Pronunciation Transcription and Feedback for Evaluating Kids Language Function
Llms

[2507.03043] K-Function: Joint Pronunciation Transcription and Feedback for Evaluating Kids Language Function

The K-Function framework enhances children's language evaluation by integrating precise phoneme transcription with LLM-driven scoring, im...

arXiv - AI · 4 min ·
[2507.05306] Enjoying Non-linearity in Multinomial Logistic Bandits: A Minimax-Optimal Algorithm
Machine Learning

[2507.05306] Enjoying Non-linearity in Multinomial Logistic Bandits: A Minimax-Optimal Algorithm

This paper presents a minimax-optimal algorithm for the multinomial logistic bandit problem, enhancing existing regret guarantees by leve...

arXiv - Machine Learning · 4 min ·
[2506.14856] Peering into the Unknown: Active View Selection with Neural Uncertainty Maps for 3D Reconstruction
Computer Vision

[2506.14856] Peering into the Unknown: Active View Selection with Neural Uncertainty Maps for 3D Reconstruction

This article presents a novel approach to active view selection (AVS) for 3D reconstruction using neural uncertainty maps, significantly ...

arXiv - AI · 4 min ·
[2506.01666] Synthesis of discrete-continuous quantum circuits with multimodal diffusion models
Machine Learning

[2506.01666] Synthesis of discrete-continuous quantum circuits with multimodal diffusion models

This paper presents a multimodal denoising diffusion model for synthesizing discrete-continuous quantum circuits, improving efficiency in...

arXiv - Machine Learning · 4 min ·
[2505.19698] Performance Asymmetry in Model-Based Reinforcement Learning
Machine Learning

[2505.19698] Performance Asymmetry in Model-Based Reinforcement Learning

The paper explores performance asymmetry in Model-Based Reinforcement Learning (MBRL), highlighting significant disparities in agent perf...

arXiv - Machine Learning · 4 min ·
[2505.11963] MARVEL: Multi-Agent RTL Vulnerability Extraction using Large Language Models
Llms

[2505.11963] MARVEL: Multi-Agent RTL Vulnerability Extraction using Large Language Models

The paper presents MARVEL, a multi-agent framework utilizing Large Language Models for extracting vulnerabilities in RTL hardware designs...

arXiv - AI · 4 min ·
[2505.17645] HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning
Llms

[2505.17645] HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning

HoloLLM introduces a Multimodal Large Language Model that enhances human sensing and reasoning by integrating diverse sensory inputs, out...

arXiv - Machine Learning · 4 min ·
[2504.13647] An Efficient LiDAR-Camera Fusion Network for Multi-Class 3D Dynamic Object Detection and Trajectory Prediction
Computer Vision

[2504.13647] An Efficient LiDAR-Camera Fusion Network for Multi-Class 3D Dynamic Object Detection and Trajectory Prediction

The paper presents a novel LiDAR-camera fusion framework for real-time 3D dynamic object detection and trajectory prediction, enhancing s...

arXiv - AI · 4 min ·
[2502.17457] MoEMba: A Mamba-based Mixture of Experts for High-Density EMG-based Hand Gesture Recognition
Machine Learning

[2502.17457] MoEMba: A Mamba-based Mixture of Experts for High-Density EMG-based Hand Gesture Recognition

The paper presents MoEMba, a novel framework utilizing Mamba-based Mixture of Experts for enhancing high-density EMG-based hand gesture r...

arXiv - Machine Learning · 4 min ·
[2502.01310] A Statistical Learning Perspective on Semi-dual Adversarial Neural Optimal Transport Solvers
Machine Learning

[2502.01310] A Statistical Learning Perspective on Semi-dual Adversarial Neural Optimal Transport Solvers

This article presents a statistical learning perspective on semi-dual adversarial neural optimal transport solvers, addressing theoretica...

arXiv - Machine Learning · 4 min ·
[2406.11935] A Problem-Oriented Perspective and Anchor Verification for Code Optimization
Llms

[2406.11935] A Problem-Oriented Perspective and Anchor Verification for Code Optimization

This paper explores the use of Large Language Models (LLMs) for code optimization, proposing a problem-oriented approach and an anchor ve...

arXiv - AI · 4 min ·
[2602.03022] STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models
Llms

[2602.03022] STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models

The paper presents STAR, a novel framework for transferring capabilities from large language models to super-tiny function calling models...

arXiv - AI · 4 min ·
[2512.03005] From Moderation to Mediation: Can LLMs Serve as Mediators in Online Flame Wars?
Llms

[2512.03005] From Moderation to Mediation: Can LLMs Serve as Mediators in Online Flame Wars?

This article explores the potential of large language models (LLMs) to act as mediators in online conflicts, moving beyond moderation to ...

arXiv - AI · 4 min ·
Previous Page 59 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime