Robotics & Embodied AI

Physical AI, robots, and autonomous systems

Top This Week

Robotics

[D] Awesome AI Agent Incidents - A curated list of incidents, attack vectors, failure modes, and defensive tools for autonomous AI agents.

https://github.com/h5i-dev/awesome-ai-agent-incidents submitted by /u/Living_Impression_37 [link] [comments]

Reddit - Machine Learning · 1 min ·
Llms

An attack class that passes every current LLM filter - no payload, no injection signature, no log trace

https://shapingrooms.com/research I published a paper today on something I've been calling postural manipulation. The short version: ordi...

Reddit - Artificial Intelligence · 1 min ·
Llms

[R] An attack class that passes every current LLM filter - no payload, no injection signature, no log trace

https://shapingrooms.com/research I've been documenting what I'm calling postural manipulation: a specific class of language that install...

Reddit - Machine Learning · 1 min ·

All Content

[2602.19313] TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics
Machine Learning

[2602.19313] TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics

The paper introduces TOPReward, a novel method leveraging token probabilities from Vision-Language Models to enhance reinforcement learni...

arXiv - Machine Learning · 4 min ·
[2602.19304] Safe and Interpretable Multimodal Path Planning for Multi-Agent Cooperation
Ai Agents

[2602.19304] Safe and Interpretable Multimodal Path Planning for Multi-Agent Cooperation

The paper presents CaPE, a multimodal path planning method that enhances cooperation among decentralized agents through language communic...

arXiv - AI · 4 min ·
[2602.18813] Habilis-$β$: A Fast-Motion and Long-Lasting On-Device Vision-Language-Action Model
Machine Learning

[2602.18813] Habilis-$β$: A Fast-Motion and Long-Lasting On-Device Vision-Language-Action Model

Habilis-$β$ is a new on-device vision-language-action model that excels in fast-motion tasks, demonstrating superior performance in real-...

arXiv - Machine Learning · 4 min ·
[2602.19193] Visual Prompt Guided Unified Pushing Policy
Robotics

[2602.19193] Visual Prompt Guided Unified Pushing Policy

The paper presents a novel unified pushing policy that utilizes visual prompts to enhance the efficiency and versatility of robotic pushi...

arXiv - AI · 3 min ·
[2602.18663] Toward AI Autonomous Navigation for Mechanical Thrombectomy using Hierarchical Modular Multi-agent Reinforcement Learning (HM-MARL)
Robotics

[2602.18663] Toward AI Autonomous Navigation for Mechanical Thrombectomy using Hierarchical Modular Multi-agent Reinforcement Learning (HM-MARL)

This article presents a novel Hierarchical Modular Multi-Agent Reinforcement Learning (HM-MARL) framework aimed at enhancing autonomous n...

arXiv - Machine Learning · 4 min ·
[2602.18603] Enhancing Goal Inference via Correction Timing
Machine Learning

[2602.18603] Enhancing Goal Inference via Correction Timing

This article explores how the timing of human corrections can enhance robot learning by providing insights into task objectives and impro...

arXiv - Machine Learning · 4 min ·
[2602.18489] DCInject: Persistent Backdoor Attacks via Frequency Manipulation in Personal Federated Learning
Machine Learning

[2602.18489] DCInject: Persistent Backdoor Attacks via Frequency Manipulation in Personal Federated Learning

The paper presents DCInject, a novel backdoor attack method targeting personalized federated learning (PFL) systems, demonstrating high a...

arXiv - Machine Learning · 3 min ·
[2602.18920] DeepInnovator: Triggering the Innovative Capabilities of LLMs
Llms

[2602.18920] DeepInnovator: Triggering the Innovative Capabilities of LLMs

DeepInnovator proposes a novel training framework to enhance the innovative capabilities of Large Language Models (LLMs) for scientific r...

arXiv - AI · 4 min ·
[2602.18850] When the Inference Meets the Explicitness or Why Multimodality Can Make Us Forget About the Perfect Predictor
Machine Learning

[2602.18850] When the Inference Meets the Explicitness or Why Multimodality Can Make Us Forget About the Perfect Predictor

This paper explores the effectiveness of multimodal communication systems in human-robot collaboration, analyzing how explicit communicat...

arXiv - AI · 4 min ·
[2602.18832] OpenClaw AI Agents as Informal Learners at Moltbook: Characterizing an Emergent Learning Community at Scale
Robotics

[2602.18832] OpenClaw AI Agents as Informal Learners at Moltbook: Characterizing an Emergent Learning Community at Scale

This article presents an empirical study of Moltbook, a large-scale informal learning community composed entirely of AI agents, highlight...

arXiv - AI · 4 min ·
[2602.19917] Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning
Machine Learning

[2602.19917] Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning

This paper presents an Uncertainty-Aware Rank-One MIMO Q Network framework designed to enhance offline reinforcement learning by effectiv...

arXiv - Machine Learning · 4 min ·
[2602.18742] RoboCurate: Harnessing Diversity with Action-Verified Neural Trajectory for Robot Learning
Llms

[2602.18742] RoboCurate: Harnessing Diversity with Action-Verified Neural Trajectory for Robot Learning

The paper presents RoboCurate, a framework for generating synthetic robot data that enhances action quality through simulation replay and...

arXiv - AI · 4 min ·
[2602.18716] Temporal Action Representation Learning for Tactical Resource Control and Subsequent Maneuver Generation
Robotics

[2602.18716] Temporal Action Representation Learning for Tactical Resource Control and Subsequent Maneuver Generation

This paper presents TART, a Temporal Action Representation learning framework designed for tactical resource control and maneuver generat...

arXiv - AI · 4 min ·
[2602.19634] Compositional Planning with Jumpy World Models
Machine Learning

[2602.19634] Compositional Planning with Jumpy World Models

This paper presents a novel approach to compositional planning using jumpy world models, enhancing long-horizon predictive accuracy and i...

arXiv - AI · 4 min ·
[2602.18460] The Doctor Will (Still) See You Now: On the Structural Limits of Agentic AI in Healthcare
Robotics

[2602.18460] The Doctor Will (Still) See You Now: On the Structural Limits of Agentic AI in Healthcare

This article examines the limitations of agentic AI in healthcare, highlighting the gap between commercial promises and operational reali...

arXiv - AI · 4 min ·
[2602.18458] The Story is Not the Science: Execution-Grounded Evaluation of Mechanistic Interpretability Research
Robotics

[2602.18458] The Story is Not the Science: Execution-Grounded Evaluation of Mechanistic Interpretability Research

The article presents a novel evaluation framework for mechanistic interpretability research, utilizing AI agents to enhance research rigo...

arXiv - Machine Learning · 3 min ·
[2602.18456] Beyond single-channel agentic benchmarking
Robotics

[2602.18456] Beyond single-channel agentic benchmarking

This paper critiques the current single-channel benchmarking of AI safety, advocating for a more holistic approach that considers the int...

arXiv - AI · 3 min ·
[2602.18296] Context-Aware Mapping of 2D Drawing Annotations to 3D CAD Features Using LLM-Assisted Reasoning for Manufacturing Automation
Llms

[2602.18296] Context-Aware Mapping of 2D Drawing Annotations to 3D CAD Features Using LLM-Assisted Reasoning for Manufacturing Automation

This article presents a framework for mapping 2D drawing annotations to 3D CAD features using context-aware reasoning, enhancing manufact...

arXiv - AI · 4 min ·
[2602.20117] ReSyn: Autonomously Scaling Synthetic Environments for Reasoning Models
Llms

[2602.20117] ReSyn: Autonomously Scaling Synthetic Environments for Reasoning Models

The paper presents ReSyn, a novel pipeline for autonomously generating diverse synthetic environments for training reasoning language mod...

arXiv - Machine Learning · 3 min ·
[2602.20059] Interaction Theater: A case of LLM Agents Interacting at Scale
Llms

[2602.20059] Interaction Theater: A case of LLM Agents Interacting at Scale

The paper explores the interactions of autonomous LLM agents on a social platform, revealing that while agents produce varied text, meani...

arXiv - AI · 4 min ·
Previous Page 32 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime