Robotics & Embodied AI

Physical AI, robots, and autonomous systems

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

An attack class that passes every current LLM filter - no payload, no injection signature, no log trace

https://shapingrooms.com/research I published a paper today on something I've been calling postural manipulation. The short version: ordi...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

[R] An attack class that passes every current LLM filter - no payload, no injection signature, no log trace

https://shapingrooms.com/research I've been documenting what I'm calling postural manipulation: a specific class of language that install...

Reddit - Machine Learning · 1 min · about 3 hours ago

Machine Learning

[2601.07855] RoAD Benchmark: How LiDAR Models Fail under Coupled Domain Shifts and Label Evolution

Abstract page for arXiv paper 2601.07855: RoAD Benchmark: How LiDAR Models Fail under Coupled Domain Shifts and Label Evolution

arXiv - AI · 3 min · about 13 hours ago

All Content

Computer Vision

[2504.13647] An Efficient LiDAR-Camera Fusion Network for Multi-Class 3D Dynamic Object Detection and Trajectory Prediction

The paper presents a novel LiDAR-camera fusion framework for real-time 3D dynamic object detection and trajectory prediction, enhancing s...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2501.16613] Safe Reinforcement Learning for Real-World Engine Control

This article presents a novel toolchain for implementing safe reinforcement learning in real-world engine control, specifically for trans...

arXiv - Machine Learning · 4 min · about 1 month ago

Robotics

[2511.23055] MindPower: Enabling Theory-of-Mind Reasoning in VLM-based Embodied Agents

The paper presents MindPower, a framework that enhances embodied agents' decision-making by integrating Theory of Mind (ToM) reasoning, o...

arXiv - AI · 3 min · about 1 month ago

Llms

[2510.12462] Evaluating and Mitigating LLM-as-a-judge Bias in Communication Systems

This article evaluates biases in Large Language Models (LLMs) used as judges in communication systems, assessing their reliability and pr...

arXiv - AI · 4 min · about 1 month ago

Llms

[2506.04867] Sensory-Motor Control with Large Language Models via Iterative Policy Refinement

This paper presents a novel method for enabling large language models (LLMs) to control embodied agents through iterative policy refineme...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2506.04500] "Don't Do That!": Guiding Embodied Systems through Large Language Model-based Constraint Generation

This paper presents STPR, a framework that utilizes large language models to convert complex natural language constraints into executable...

arXiv - AI · 4 min · about 1 month ago

Llms

[2503.12434] A Survey on the Optimization of Large Language Model-based Agents

This survey reviews optimization techniques for Large Language Model (LLM)-based agents, categorizing methods into parameter-driven and p...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.21198] Learning from Trials and Errors: Reflective Test-Time Planning for Embodied LLMs

This article presents a novel approach called Reflective Test-Time Planning for embodied LLMs, enabling robots to learn from mistakes thr...

arXiv - Machine Learning · 4 min · about 1 month ago

Robotics

[2602.21174] Efficient Hierarchical Any-Angle Path Planning on Multi-Resolution 3D Grids

This paper presents an efficient hierarchical approach for any-angle path planning on multi-resolution 3D grids, addressing scalability i...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.21119] Cooperative-Competitive Team Play of Real-World Craft Robots

The paper explores advancements in multi-agent reinforcement learning for training cooperative and competitive robots, introducing a nove...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.20958] EKF-Based Depth Camera and Deep Learning Fusion for UAV-Person Distance Estimation and Following in SAR Operations

This paper presents a novel system that integrates depth camera measurements and deep learning for accurate distance estimation in UAV-as...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.20636] SurgAtt-Tracker: Online Surgical Attention Tracking via Temporal Proposal Reranking and Motion-Aware Refinement

The paper presents SurgAtt-Tracker, a novel framework for online surgical attention tracking that enhances minimally invasive surgery thr...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.20323] Learning Physical Principles from Interaction: Self-Evolving Planning via Test-Time Memory

This article presents PhysMem, a memory framework that allows vision-language model planners to learn physical principles through interac...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.20220] What Matters for Simulation to Online Reinforcement Learning on Real Robots

This paper explores design choices that enhance online reinforcement learning (RL) on physical robots, presenting findings from 100 train...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.20219] An Approach to Combining Video and Speech with Large Language Models in Human-Robot Interaction

This article presents a novel multimodal framework for human-robot interaction that integrates video and speech processing with large lan...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.20200] Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulation

The paper presents OptimusVLA, a dual-memory framework for robotic manipulation that enhances efficiency and robustness in action generat...

arXiv - AI · 4 min · about 1 month ago

Robotics

[2602.20169] Autonomous AI and Ownership Rules

This article explores the ownership rules surrounding AI-generated outputs, examining how they are linked to their creators and the impli...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.21201] Aletheia tackles FirstProof autonomously

The paper presents Aletheia, an autonomous mathematics research agent that successfully solved 6 out of 10 problems in the FirstProof cha...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.21172] NoRD: A Data-Efficient Vision-Language-Action Model that Drives without Reasoning

The paper presents NoRD, a data-efficient Vision-Language-Action model that enhances autonomous driving without requiring extensive datas...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.20934] Architecting AgentOS: From Token-Level Context to Emergent System-Level Intelligence

The paper introduces AgentOS, a conceptual framework that transitions Large Language Models from static inference engines to dynamic cogn...

arXiv - AI · 3 min · about 1 month ago

Previous Page 28 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Robotics & Embodied AI

Top This Week

An attack class that passes every current LLM filter - no payload, no injection signature, no log trace

[R] An attack class that passes every current LLM filter - no payload, no injection signature, no log trace

[2601.07855] RoAD Benchmark: How LiDAR Models Fail under Coupled Domain Shifts and Label Evolution

All Content

[2504.13647] An Efficient LiDAR-Camera Fusion Network for Multi-Class 3D Dynamic Object Detection and Trajectory Prediction

[2501.16613] Safe Reinforcement Learning for Real-World Engine Control

[2511.23055] MindPower: Enabling Theory-of-Mind Reasoning in VLM-based Embodied Agents

[2510.12462] Evaluating and Mitigating LLM-as-a-judge Bias in Communication Systems

[2506.04867] Sensory-Motor Control with Large Language Models via Iterative Policy Refinement

[2506.04500] "Don't Do That!": Guiding Embodied Systems through Large Language Model-based Constraint Generation

[2503.12434] A Survey on the Optimization of Large Language Model-based Agents

[2602.21198] Learning from Trials and Errors: Reflective Test-Time Planning for Embodied LLMs

[2602.21174] Efficient Hierarchical Any-Angle Path Planning on Multi-Resolution 3D Grids

[2602.21119] Cooperative-Competitive Team Play of Real-World Craft Robots

[2602.20958] EKF-Based Depth Camera and Deep Learning Fusion for UAV-Person Distance Estimation and Following in SAR Operations

[2602.20636] SurgAtt-Tracker: Online Surgical Attention Tracking via Temporal Proposal Reranking and Motion-Aware Refinement

[2602.20323] Learning Physical Principles from Interaction: Self-Evolving Planning via Test-Time Memory

[2602.20220] What Matters for Simulation to Online Reinforcement Learning on Real Robots

[2602.20219] An Approach to Combining Video and Speech with Large Language Models in Human-Robot Interaction

[2602.20200] Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulation

[2602.20169] Autonomous AI and Ownership Rules

[2602.21201] Aletheia tackles FirstProof autonomously

[2602.21172] NoRD: A Data-Efficient Vision-Language-Action Model that Drives without Reasoning

[2602.20934] Architecting AgentOS: From Token-Level Context to Emergent System-Level Intelligence

Related Topics

Stay updated with AI News