Robotics & Embodied AI

Physical AI, robots, and autonomous systems

Top This Week

Llms

An attack class that passes every current LLM filter - no payload, no injection signature, no log trace

https://shapingrooms.com/research I published a paper today on something I've been calling postural manipulation. The short version: ordi...

Reddit - Artificial Intelligence · 1 min ·
Llms

[R] An attack class that passes every current LLM filter - no payload, no injection signature, no log trace

https://shapingrooms.com/research I've been documenting what I'm calling postural manipulation: a specific class of language that install...

Reddit - Machine Learning · 1 min ·
[2601.07855] RoAD Benchmark: How LiDAR Models Fail under Coupled Domain Shifts and Label Evolution
Machine Learning

[2601.07855] RoAD Benchmark: How LiDAR Models Fail under Coupled Domain Shifts and Label Evolution

Abstract page for arXiv paper 2601.07855: RoAD Benchmark: How LiDAR Models Fail under Coupled Domain Shifts and Label Evolution

arXiv - AI · 3 min ·

All Content

[2504.13647] An Efficient LiDAR-Camera Fusion Network for Multi-Class 3D Dynamic Object Detection and Trajectory Prediction
Computer Vision

[2504.13647] An Efficient LiDAR-Camera Fusion Network for Multi-Class 3D Dynamic Object Detection and Trajectory Prediction

The paper presents a novel LiDAR-camera fusion framework for real-time 3D dynamic object detection and trajectory prediction, enhancing s...

arXiv - AI · 4 min ·
[2501.16613] Safe Reinforcement Learning for Real-World Engine Control
Machine Learning

[2501.16613] Safe Reinforcement Learning for Real-World Engine Control

This article presents a novel toolchain for implementing safe reinforcement learning in real-world engine control, specifically for trans...

arXiv - Machine Learning · 4 min ·
[2511.23055] MindPower: Enabling Theory-of-Mind Reasoning in VLM-based Embodied Agents
Robotics

[2511.23055] MindPower: Enabling Theory-of-Mind Reasoning in VLM-based Embodied Agents

The paper presents MindPower, a framework that enhances embodied agents' decision-making by integrating Theory of Mind (ToM) reasoning, o...

arXiv - AI · 3 min ·
[2510.12462] Evaluating and Mitigating LLM-as-a-judge Bias in Communication Systems
Llms

[2510.12462] Evaluating and Mitigating LLM-as-a-judge Bias in Communication Systems

This article evaluates biases in Large Language Models (LLMs) used as judges in communication systems, assessing their reliability and pr...

arXiv - AI · 4 min ·
[2506.04867] Sensory-Motor Control with Large Language Models via Iterative Policy Refinement
Llms

[2506.04867] Sensory-Motor Control with Large Language Models via Iterative Policy Refinement

This paper presents a novel method for enabling large language models (LLMs) to control embodied agents through iterative policy refineme...

arXiv - Machine Learning · 4 min ·
[2506.04500] "Don't Do That!": Guiding Embodied Systems through Large Language Model-based Constraint Generation
Llms

[2506.04500] "Don't Do That!": Guiding Embodied Systems through Large Language Model-based Constraint Generation

This paper presents STPR, a framework that utilizes large language models to convert complex natural language constraints into executable...

arXiv - AI · 4 min ·
[2503.12434] A Survey on the Optimization of Large Language Model-based Agents
Llms

[2503.12434] A Survey on the Optimization of Large Language Model-based Agents

This survey reviews optimization techniques for Large Language Model (LLM)-based agents, categorizing methods into parameter-driven and p...

arXiv - AI · 4 min ·
[2602.21198] Learning from Trials and Errors: Reflective Test-Time Planning for Embodied LLMs
Llms

[2602.21198] Learning from Trials and Errors: Reflective Test-Time Planning for Embodied LLMs

This article presents a novel approach called Reflective Test-Time Planning for embodied LLMs, enabling robots to learn from mistakes thr...

arXiv - Machine Learning · 4 min ·
[2602.21174] Efficient Hierarchical Any-Angle Path Planning on Multi-Resolution 3D Grids
Robotics

[2602.21174] Efficient Hierarchical Any-Angle Path Planning on Multi-Resolution 3D Grids

This paper presents an efficient hierarchical approach for any-angle path planning on multi-resolution 3D grids, addressing scalability i...

arXiv - AI · 3 min ·
[2602.21119] Cooperative-Competitive Team Play of Real-World Craft Robots
Machine Learning

[2602.21119] Cooperative-Competitive Team Play of Real-World Craft Robots

The paper explores advancements in multi-agent reinforcement learning for training cooperative and competitive robots, introducing a nove...

arXiv - AI · 3 min ·
[2602.20958] EKF-Based Depth Camera and Deep Learning Fusion for UAV-Person Distance Estimation and Following in SAR Operations
Machine Learning

[2602.20958] EKF-Based Depth Camera and Deep Learning Fusion for UAV-Person Distance Estimation and Following in SAR Operations

This paper presents a novel system that integrates depth camera measurements and deep learning for accurate distance estimation in UAV-as...

arXiv - AI · 4 min ·
[2602.20636] SurgAtt-Tracker: Online Surgical Attention Tracking via Temporal Proposal Reranking and Motion-Aware Refinement
Machine Learning

[2602.20636] SurgAtt-Tracker: Online Surgical Attention Tracking via Temporal Proposal Reranking and Motion-Aware Refinement

The paper presents SurgAtt-Tracker, a novel framework for online surgical attention tracking that enhances minimally invasive surgery thr...

arXiv - AI · 4 min ·
[2602.20323] Learning Physical Principles from Interaction: Self-Evolving Planning via Test-Time Memory
Llms

[2602.20323] Learning Physical Principles from Interaction: Self-Evolving Planning via Test-Time Memory

This article presents PhysMem, a memory framework that allows vision-language model planners to learn physical principles through interac...

arXiv - AI · 3 min ·
[2602.20220] What Matters for Simulation to Online Reinforcement Learning on Real Robots
Machine Learning

[2602.20220] What Matters for Simulation to Online Reinforcement Learning on Real Robots

This paper explores design choices that enhance online reinforcement learning (RL) on physical robots, presenting findings from 100 train...

arXiv - AI · 3 min ·
[2602.20219] An Approach to Combining Video and Speech with Large Language Models in Human-Robot Interaction
Llms

[2602.20219] An Approach to Combining Video and Speech with Large Language Models in Human-Robot Interaction

This article presents a novel multimodal framework for human-robot interaction that integrates video and speech processing with large lan...

arXiv - AI · 3 min ·
[2602.20200] Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulation
Machine Learning

[2602.20200] Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulation

The paper presents OptimusVLA, a dual-memory framework for robotic manipulation that enhances efficiency and robustness in action generat...

arXiv - AI · 4 min ·
[2602.20169] Autonomous AI and Ownership Rules
Robotics

[2602.20169] Autonomous AI and Ownership Rules

This article explores the ownership rules surrounding AI-generated outputs, examining how they are linked to their creators and the impli...

arXiv - AI · 3 min ·
[2602.21201] Aletheia tackles FirstProof autonomously
Llms

[2602.21201] Aletheia tackles FirstProof autonomously

The paper presents Aletheia, an autonomous mathematics research agent that successfully solved 6 out of 10 problems in the FirstProof cha...

arXiv - Machine Learning · 3 min ·
[2602.21172] NoRD: A Data-Efficient Vision-Language-Action Model that Drives without Reasoning
Machine Learning

[2602.21172] NoRD: A Data-Efficient Vision-Language-Action Model that Drives without Reasoning

The paper presents NoRD, a data-efficient Vision-Language-Action model that enhances autonomous driving without requiring extensive datas...

arXiv - AI · 3 min ·
[2602.20934] Architecting AgentOS: From Token-Level Context to Emergent System-Level Intelligence
Llms

[2602.20934] Architecting AgentOS: From Token-Level Context to Emergent System-Level Intelligence

The paper introduces AgentOS, a conceptual framework that transitions Large Language Models from static inference engines to dynamic cogn...

arXiv - AI · 3 min ·
Previous Page 28 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime