AI Agents

Autonomous agents, tool use, and agentic systems

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Ai Agents

OpenClaw gives users yet another reason to be freaked out about security - Ars Technica

The viral AI agentic tool let attackers silently gain admin unauthenticated access.

Ars Technica - AI · 5 min · 38 minutes ago

Robotics

What happens when you let AI agents run a sitcom 24/7 with zero human involvement

Ran an experiment — gave AI agents full control over writing, character creation, and performing a sitcom. Left it running nonstop for ov...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Ai Agents

Microsoft's newest open-source project: Runtime security for AI agents

submitted by /u/Fcking_Chuck [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 9 hours ago

All Content

Llms

[2510.23587] A Survey of Data Agents: Emerging Paradigm or Overstated Hype?

This survey explores the concept of data agents, autonomous systems that manage complex data tasks. It introduces a hierarchical taxonomy...

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.24694] Repurposing Synthetic Data for Fine-grained Search Agent Supervision

The paper presents E-GRPO, a novel framework for training search agents using synthetic data, enhancing their ability to learn from near-...

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.22620] Breaking Agent Backbones: Evaluating the Security of Backbone LLMs in AI Agents

This article evaluates the security of large language models (LLMs) used in AI agents, introducing a framework for identifying vulnerabil...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2510.22500] Towards Scalable Oversight via Partitioned Human Supervision

The paper proposes a scalable oversight framework for AI systems using partitioned human supervision, addressing challenges in obtaining ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.23115] RHYTHM: Reasoning with Hierarchical Temporal Tokenization for Human Mobility

The paper presents RHYTHM, a framework utilizing hierarchical temporal tokenization to enhance human mobility predictions by leveraging l...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2509.22566] From Parameters to Behaviors: Unsupervised Compression of the Policy Space

This paper presents an unsupervised method for compressing the policy parameter space in Deep Reinforcement Learning, enhancing sample ef...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.15796] Monte Carlo Tree Diffusion with Multiple Experts for Protein Design

The paper presents MCTD-ME, a novel approach combining Monte Carlo Tree Search and masked diffusion models for efficient protein design, ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2507.03043] K-Function: Joint Pronunciation Transcription and Feedback for Evaluating Kids Language Function

The K-Function framework enhances children's language evaluation by integrating precise phoneme transcription with LLM-driven scoring, im...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2507.05306] Enjoying Non-linearity in Multinomial Logistic Bandits: A Minimax-Optimal Algorithm

This paper presents a minimax-optimal algorithm for the multinomial logistic bandit problem, enhancing existing regret guarantees by leve...

arXiv - Machine Learning · 4 min · about 1 month ago

Computer Vision

[2506.14856] Peering into the Unknown: Active View Selection with Neural Uncertainty Maps for 3D Reconstruction

This article presents a novel approach to active view selection (AVS) for 3D reconstruction using neural uncertainty maps, significantly ...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2506.01666] Synthesis of discrete-continuous quantum circuits with multimodal diffusion models

This paper presents a multimodal denoising diffusion model for synthesizing discrete-continuous quantum circuits, improving efficiency in...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2505.19698] Performance Asymmetry in Model-Based Reinforcement Learning

The paper explores performance asymmetry in Model-Based Reinforcement Learning (MBRL), highlighting significant disparities in agent perf...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2505.11963] MARVEL: Multi-Agent RTL Vulnerability Extraction using Large Language Models

The paper presents MARVEL, a multi-agent framework utilizing Large Language Models for extracting vulnerabilities in RTL hardware designs...

arXiv - AI · 4 min · about 1 month ago

Llms

[2505.17645] HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning

HoloLLM introduces a Multimodal Large Language Model that enhances human sensing and reasoning by integrating diverse sensory inputs, out...

arXiv - Machine Learning · 4 min · about 1 month ago

Computer Vision

[2504.13647] An Efficient LiDAR-Camera Fusion Network for Multi-Class 3D Dynamic Object Detection and Trajectory Prediction

The paper presents a novel LiDAR-camera fusion framework for real-time 3D dynamic object detection and trajectory prediction, enhancing s...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2502.17457] MoEMba: A Mamba-based Mixture of Experts for High-Density EMG-based Hand Gesture Recognition

The paper presents MoEMba, a novel framework utilizing Mamba-based Mixture of Experts for enhancing high-density EMG-based hand gesture r...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2502.01310] A Statistical Learning Perspective on Semi-dual Adversarial Neural Optimal Transport Solvers

This article presents a statistical learning perspective on semi-dual adversarial neural optimal transport solvers, addressing theoretica...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2406.11935] A Problem-Oriented Perspective and Anchor Verification for Code Optimization

This paper explores the use of Large Language Models (LLMs) for code optimization, proposing a problem-oriented approach and an anchor ve...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.03022] STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models

The paper presents STAR, a novel framework for transferring capabilities from large language models to super-tiny function calling models...

arXiv - AI · 4 min · about 1 month ago

Llms

[2512.03005] From Moderation to Mediation: Can LLMs Serve as Mediators in Online Flame Wars?

This article explores the potential of large language models (LLMs) to act as mediators in online conflicts, moving beyond moderation to ...

arXiv - AI · 4 min · about 1 month ago

Previous Page 59 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Agents

Top This Week

OpenClaw gives users yet another reason to be freaked out about security - Ars Technica

What happens when you let AI agents run a sitcom 24/7 with zero human involvement

Microsoft's newest open-source project: Runtime security for AI agents

All Content

[2510.23587] A Survey of Data Agents: Emerging Paradigm or Overstated Hype?

[2510.24694] Repurposing Synthetic Data for Fine-grained Search Agent Supervision

[2510.22620] Breaking Agent Backbones: Evaluating the Security of Backbone LLMs in AI Agents

[2510.22500] Towards Scalable Oversight via Partitioned Human Supervision

[2509.23115] RHYTHM: Reasoning with Hierarchical Temporal Tokenization for Human Mobility

[2509.22566] From Parameters to Behaviors: Unsupervised Compression of the Policy Space

[2509.15796] Monte Carlo Tree Diffusion with Multiple Experts for Protein Design

[2507.03043] K-Function: Joint Pronunciation Transcription and Feedback for Evaluating Kids Language Function

[2507.05306] Enjoying Non-linearity in Multinomial Logistic Bandits: A Minimax-Optimal Algorithm

[2506.14856] Peering into the Unknown: Active View Selection with Neural Uncertainty Maps for 3D Reconstruction

[2506.01666] Synthesis of discrete-continuous quantum circuits with multimodal diffusion models

[2505.19698] Performance Asymmetry in Model-Based Reinforcement Learning

[2505.11963] MARVEL: Multi-Agent RTL Vulnerability Extraction using Large Language Models

[2505.17645] HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning

[2504.13647] An Efficient LiDAR-Camera Fusion Network for Multi-Class 3D Dynamic Object Detection and Trajectory Prediction

[2502.17457] MoEMba: A Mamba-based Mixture of Experts for High-Density EMG-based Hand Gesture Recognition

[2502.01310] A Statistical Learning Perspective on Semi-dual Adversarial Neural Optimal Transport Solvers

[2406.11935] A Problem-Oriented Perspective and Anchor Verification for Code Optimization

[2602.03022] STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models

[2512.03005] From Moderation to Mediation: Can LLMs Serve as Mediators in Online Flame Wars?

Related Topics

Stay updated with AI News