Robotics & Embodied AI

Physical AI, robots, and autonomous systems

Top This Week

[2601.07855] RoAD Benchmark: How LiDAR Models Fail under Coupled Domain Shifts and Label Evolution
Machine Learning

[2601.07855] RoAD Benchmark: How LiDAR Models Fail under Coupled Domain Shifts and Label Evolution

Abstract page for arXiv paper 2601.07855: RoAD Benchmark: How LiDAR Models Fail under Coupled Domain Shifts and Label Evolution

arXiv - AI · 3 min ·
[2502.00262] INSIGHT: Enhancing Autonomous Driving Safety through Vision-Language Models on Context-Aware Hazard Detection and Edge Case Evaluation
Llms

[2502.00262] INSIGHT: Enhancing Autonomous Driving Safety through Vision-Language Models on Context-Aware Hazard Detection and Edge Case Evaluation

Abstract page for arXiv paper 2502.00262: INSIGHT: Enhancing Autonomous Driving Safety through Vision-Language Models on Context-Aware Ha...

arXiv - AI · 4 min ·
[2508.00500] ProbGuard: Probabilistic Runtime Monitoring for LLM Agent Safety
Llms

[2508.00500] ProbGuard: Probabilistic Runtime Monitoring for LLM Agent Safety

Abstract page for arXiv paper 2508.00500: ProbGuard: Probabilistic Runtime Monitoring for LLM Agent Safety

arXiv - AI · 4 min ·

All Content

[2510.10932] DropVLA: An Action-Level Backdoor Attack on Vision--Language--Action Models
Machine Learning

[2510.10932] DropVLA: An Action-Level Backdoor Attack on Vision--Language--Action Models

The paper presents DropVLA, an action-level backdoor attack on Vision-Language-Action models, demonstrating how minimal data poisoning ca...

arXiv - AI · 4 min ·
[2506.01392] Sparse Imagination for Efficient Visual World Model Planning
Machine Learning

[2506.01392] Sparse Imagination for Efficient Visual World Model Planning

The paper presents a novel approach called Sparse Imagination for enhancing visual world model planning in robotics, improving computatio...

arXiv - AI · 3 min ·
[2505.04317] Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning
Robotics

[2505.04317] Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning

This paper presents a novel approach to multi-drone volleyball using a hierarchical reinforcement learning framework, achieving high perf...

arXiv - AI · 4 min ·
[2602.23331] Utilizing LLMs for Industrial Process Automation
Llms

[2602.23331] Utilizing LLMs for Industrial Process Automation

This article explores the application of Large Language Models (LLMs) in industrial process automation, focusing on their potential to en...

arXiv - AI · 3 min ·
[2602.23259] Risk-Aware World Model Predictive Control for Generalizable End-to-End Autonomous Driving
Machine Learning

[2602.23259] Risk-Aware World Model Predictive Control for Generalizable End-to-End Autonomous Driving

This paper presents the Risk-aware World Model Predictive Control (RaWMPC) framework aimed at enhancing generalization in end-to-end auto...

arXiv - AI · 4 min ·
[2602.23172] Latent Gaussian Splatting for 4D Panoptic Occupancy Tracking
Robotics

[2602.23172] Latent Gaussian Splatting for 4D Panoptic Occupancy Tracking

The paper presents Latent Gaussian Splatting (LaGS) for 4D panoptic occupancy tracking, enhancing robot perception in dynamic environment...

arXiv - AI · 3 min ·
[2602.23073] Accelerated Online Risk-Averse Policy Evaluation in POMDPs with Theoretical Guarantees and Novel CVaR Bounds
Robotics

[2602.23073] Accelerated Online Risk-Averse Policy Evaluation in POMDPs with Theoretical Guarantees and Novel CVaR Bounds

This paper presents a theoretical framework for accelerating risk-averse policy evaluation in partially observable Markov decision proces...

arXiv - AI · 4 min ·
[2602.23321] Deep ensemble graph neural networks for probabilistic cosmic-ray direction and energy reconstruction in autonomous radio arrays
Machine Learning

[2602.23321] Deep ensemble graph neural networks for probabilistic cosmic-ray direction and energy reconstruction in autonomous radio arrays

This paper presents a novel method using deep ensemble graph neural networks to accurately reconstruct the direction and energy of cosmic...

arXiv - Machine Learning · 4 min ·
[2602.23312] Evaluating Zero-Shot and One-Shot Adaptation of Small Language Models in Leader-Follower Interaction
Llms

[2602.23312] Evaluating Zero-Shot and One-Shot Adaptation of Small Language Models in Leader-Follower Interaction

This paper evaluates the effectiveness of small language models (SLMs) in leader-follower interactions, comparing zero-shot and one-shot ...

arXiv - Machine Learning · 4 min ·
[2602.22724] AgentSentry: Mitigating Indirect Prompt Injection in LLM Agents via Temporal Causal Diagnostics and Context Purification
Llms

[2602.22724] AgentSentry: Mitigating Indirect Prompt Injection in LLM Agents via Temporal Causal Diagnostics and Context Purification

AgentSentry introduces a novel framework to mitigate indirect prompt injection (IPI) in LLM agents, enhancing their security while mainta...

arXiv - AI · 4 min ·
[2602.22801] Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving
Machine Learning

[2602.22801] Unleashing the Potential of Diffusion Models for End-to-End Autonomous Driving

This article explores the application of diffusion models in end-to-end autonomous driving, demonstrating their effectiveness through ext...

arXiv - Machine Learning · 4 min ·
[2602.22630] HyperKKL: Enabling Non-Autonomous State Estimation through Dynamic Weight Conditioning
Robotics

[2602.22630] HyperKKL: Enabling Non-Autonomous State Estimation through Dynamic Weight Conditioning

The paper presents HyperKKL, a novel approach for designing KKL observers for non-autonomous nonlinear systems, leveraging hypernetwork a...

arXiv - Machine Learning · 3 min ·
[2602.22549] DrivePTS: A Progressive Learning Framework with Textual and Structural Enhancement for Driving Scene Generation
Machine Learning

[2602.22549] DrivePTS: A Progressive Learning Framework with Textual and Structural Enhancement for Driving Scene Generation

DrivePTS introduces a progressive learning framework for generating diverse driving scenes, enhancing fidelity and controllability in aut...

arXiv - AI · 4 min ·
[2602.22514] SignVLA: A Gloss-Free Vision-Language-Action Framework for Real-Time Sign Language-Guided Robotic Manipulation
Robotics

[2602.22514] SignVLA: A Gloss-Free Vision-Language-Action Framework for Real-Time Sign Language-Guided Robotic Manipulation

The paper presents SignVLA, a novel gloss-free Vision-Language-Action framework for real-time robotic manipulation guided by sign languag...

arXiv - AI · 4 min ·
[2602.22474] When to Act, Ask, or Learn: Uncertainty-Aware Policy Steering
Llms

[2602.22474] When to Act, Ask, or Learn: Uncertainty-Aware Policy Steering

This article presents a framework for uncertainty-aware policy steering in robotics, enabling adaptive robot behavior by addressing task ...

arXiv - Machine Learning · 4 min ·
[2602.22289] What Topological and Geometric Structure Do Biological Foundation Models Learn? Evidence from 141 Hypotheses
Llms

[2602.22289] What Topological and Geometric Structure Do Biological Foundation Models Learn? Evidence from 141 Hypotheses

The paper investigates the geometric and topological structures learned by biological foundation models, analyzing 141 hypotheses through...

arXiv - Machine Learning · 4 min ·
[2602.22376] AeroDGS: Physically Consistent Dynamic Gaussian Splatting for Single-Sequence Aerial 4D Reconstruction
Machine Learning

[2602.22376] AeroDGS: Physically Consistent Dynamic Gaussian Splatting for Single-Sequence Aerial 4D Reconstruction

AeroDGS presents a novel framework for 4D reconstruction from monocular UAV videos, addressing challenges in depth ambiguity and motion e...

arXiv - AI · 4 min ·
[2602.23280] Physics Informed Viscous Value Representations
Nlp

[2602.23280] Physics Informed Viscous Value Representations

This paper presents a novel approach to offline goal-conditioned reinforcement learning by introducing a physics-informed regularization ...

arXiv - Machine Learning · 3 min ·
[2602.23330] Toward Expert Investment Teams:A Multi-Agent LLM System with Fine-Grained Trading Tasks
Llms

[2602.23330] Toward Expert Investment Teams:A Multi-Agent LLM System with Fine-Grained Trading Tasks

This article presents a multi-agent LLM framework for financial trading, emphasizing fine-grained task decomposition to enhance decision-...

arXiv - AI · 4 min ·
[2602.23193] ESAA: Event Sourcing for Autonomous Agents in LLM-Based Software Engineering
Llms

[2602.23193] ESAA: Event Sourcing for Autonomous Agents in LLM-Based Software Engineering

The paper presents ESAA, an architecture for autonomous agents using event sourcing to enhance state management and execution in LLM-base...

arXiv - AI · 4 min ·
Previous Page 24 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime