Robotics & Embodied AI

Physical AI, robots, and autonomous systems

Top This Week

[2512.19576] LeLaR: The First In-Orbit Demonstration of an AI-Based Satellite Attitude Controller
Machine Learning

[2512.19576] LeLaR: The First In-Orbit Demonstration of an AI-Based Satellite Attitude Controller

Abstract page for arXiv paper 2512.19576: LeLaR: The First In-Orbit Demonstration of an AI-Based Satellite Attitude Controller

arXiv - AI · 4 min ·
[2511.14565] Masked IRL: LLM-Guided Reward Disambiguation from Demonstrations and Language
Llms

[2511.14565] Masked IRL: LLM-Guided Reward Disambiguation from Demonstrations and Language

Abstract page for arXiv paper 2511.14565: Masked IRL: LLM-Guided Reward Disambiguation from Demonstrations and Language

arXiv - AI · 4 min ·
[2511.12882] Towards High-Consistency Embodied World Model with Multi-View Trajectory Videos
Machine Learning

[2511.12882] Towards High-Consistency Embodied World Model with Multi-View Trajectory Videos

Abstract page for arXiv paper 2511.12882: Towards High-Consistency Embodied World Model with Multi-View Trajectory Videos

arXiv - AI · 4 min ·

All Content

[2602.13473] NeuroWeaver: An Autonomous Evolutionary Agent for Exploring the Programmatic Space of EEG Analysis Pipelines
Llms

[2602.13473] NeuroWeaver: An Autonomous Evolutionary Agent for Exploring the Programmatic Space of EEG Analysis Pipelines

NeuroWeaver is an autonomous evolutionary agent designed to optimize EEG analysis pipelines, addressing data constraints and computationa...

arXiv - AI · 3 min ·
[2602.13323] Contrastive explanations of BDI agents
Robotics

[2602.13323] Contrastive explanations of BDI agents

This article discusses the extension of Belief-Desire-Intention (BDI) agents to provide contrastive explanations, enhancing transparency ...

arXiv - AI · 3 min ·
[2602.13248] X-Blocks: Linguistic Building Blocks of Natural Language Explanations for Automated Vehicles
Nlp

[2602.13248] X-Blocks: Linguistic Building Blocks of Natural Language Explanations for Automated Vehicles

The paper introduces X-Blocks, a framework for analyzing natural language explanations in automated vehicles, enhancing user trust and un...

arXiv - AI · 4 min ·
Robotics

[D] We found 18K+ exposed OpenClaw instances and ~15% of community skills contain malicious instructionsc

A security audit reveals over 18,000 exposed OpenClaw instances and alarming findings of malicious instructions in 15% of community-built...

Reddit - Machine Learning · 1 min ·
[2602.10727] Rising Multi-Armed Bandits with Known Horizons
Machine Learning

[2602.10727] Rising Multi-Armed Bandits with Known Horizons

The paper presents a novel approach to the Rising Multi-Armed Bandit (RMAB) problem, introducing CUmulative Reward Estimation UCB (CURE-U...

arXiv - Machine Learning · 3 min ·
[2602.13197] Imitating What Works: Simulation-Filtered Modular Policy Learning from Human Videos
Robotics

[2602.13197] Imitating What Works: Simulation-Filtered Modular Policy Learning from Human Videos

This article presents a framework called Perceive-Simulate-Imitate (PSI) for training robots to learn manipulation skills from human vide...

arXiv - Machine Learning · 4 min ·
[2602.13003] MASAR: Motion-Appearance Synergy Refinement for Joint Detection and Trajectory Forecasting
Machine Learning

[2602.13003] MASAR: Motion-Appearance Synergy Refinement for Joint Detection and Trajectory Forecasting

The paper presents MASAR, a novel framework for joint 3D detection and trajectory forecasting that enhances performance by integrating mo...

arXiv - Machine Learning · 3 min ·
[2602.12684] Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution
Machine Learning

[2602.12684] Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution

Xiaomi-Robotics-0 is an advanced open-sourced vision-language-action model designed for real-time execution, showcasing state-of-the-art ...

arXiv - Machine Learning · 4 min ·
[2602.12492] Composable Model-Free RL for Navigation with Input-Affine Systems
Machine Learning

[2602.12492] Composable Model-Free RL for Navigation with Input-Affine Systems

This paper presents a novel composable model-free reinforcement learning approach for navigation in dynamic environments, focusing on rea...

arXiv - Machine Learning · 3 min ·
[2602.12487] Gradient-Enhanced Partitioned Gaussian Processes for Real-Time Quadrotor Dynamics Modeling
Machine Learning

[2602.12487] Gradient-Enhanced Partitioned Gaussian Processes for Real-Time Quadrotor Dynamics Modeling

This paper introduces a novel Gaussian Process model for quadrotor dynamics that integrates gradient information, enabling real-time infe...

arXiv - Machine Learning · 4 min ·
[2602.12407] MiDAS: A Multimodal Data Acquisition System and Dataset for Robot-Assisted Minimally Invasive Surgery
Robotics

[2602.12407] MiDAS: A Multimodal Data Acquisition System and Dataset for Robot-Assisted Minimally Invasive Surgery

The paper presents MiDAS, an open-source multimodal data acquisition system for robot-assisted minimally invasive surgery, enabling synch...

arXiv - Machine Learning · 3 min ·
[2602.12405] Self-Refining Vision Language Model for Robotic Failure Detection and Reasoning
Llms

[2602.12405] Self-Refining Vision Language Model for Robotic Failure Detection and Reasoning

The paper presents ARMOR, a self-refining vision language model designed for robotic failure detection and reasoning, achieving significa...

arXiv - Machine Learning · 4 min ·
[2602.13052] Quantization-Aware Collaborative Inference for Large Embodied AI Models
Machine Learning

[2602.13052] Quantization-Aware Collaborative Inference for Large Embodied AI Models

This paper explores quantization-aware collaborative inference for large embodied AI models, addressing challenges in resource-limited en...

arXiv - Machine Learning · 3 min ·
[2602.13040] TCRL: Temporal-Coupled Adversarial Training for Robust Constrained Reinforcement Learning in Worst-Case Scenarios
Machine Learning

[2602.13040] TCRL: Temporal-Coupled Adversarial Training for Robust Constrained Reinforcement Learning in Worst-Case Scenarios

The paper presents TCRL, a novel framework for robust constrained reinforcement learning that addresses challenges posed by temporally co...

arXiv - Machine Learning · 4 min ·
[2602.12636] Dual-Granularity Contrastive Reward via Generated Episodic Guidance for Efficient Embodied RL
Machine Learning

[2602.12636] Dual-Granularity Contrastive Reward via Generated Episodic Guidance for Efficient Embodied RL

This paper introduces the Dual-Granularity Contrastive Reward framework, which enhances sample efficiency in reinforcement learning (RL) ...

arXiv - Machine Learning · 4 min ·
[2602.12520] Multi-Agent Model-Based Reinforcement Learning with Joint State-Action Learned Embeddings
Machine Learning

[2602.12520] Multi-Agent Model-Based Reinforcement Learning with Joint State-Action Learned Embeddings

This paper presents a novel framework for multi-agent model-based reinforcement learning, integrating joint state-action representation l...

arXiv - Machine Learning · 3 min ·
[2602.10915] Blind Gods and Broken Screens: Architecting a Secure, Intent-Centric Mobile Agent Operating System
Llms

[2602.10915] Blind Gods and Broken Screens: Architecting a Secure, Intent-Centric Mobile Agent Operating System

The paper presents Aura, a secure mobile agent operating system designed to address vulnerabilities in current app-centric models by impl...

arXiv - AI · 4 min ·
[2602.10234] Transforming Policy-Car Swerving for Mitigating Stop-and-Go Traffic Waves: A Practice-Oriented Jam-Absorption Driving Strategy
Ai Agents

[2602.10234] Transforming Policy-Car Swerving for Mitigating Stop-and-Go Traffic Waves: A Practice-Oriented Jam-Absorption Driving Strategy

This article presents a novel driving strategy to mitigate stop-and-go traffic waves using a jam-absorption technique inspired by police-...

arXiv - AI · 4 min ·
[2602.08543] GISA: A Benchmark for General Information-Seeking Assistant
Llms

[2602.08543] GISA: A Benchmark for General Information-Seeking Assistant

The paper introduces GISA, a benchmark designed for evaluating General Information-Seeking Assistants, addressing limitations in existing...

arXiv - AI · 4 min ·
[2601.09605] Sim2real Image Translation Enables Viewpoint-Robust Policies from Fixed-Camera Datasets
Nlp

[2601.09605] Sim2real Image Translation Enables Viewpoint-Robust Policies from Fixed-Camera Datasets

The paper presents MANGO, a novel image translation method that enhances viewpoint robustness in robot manipulation policies using fixed-...

arXiv - AI · 4 min ·
Previous Page 51 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime