[2604.00557] Multi-Camera View Scaling for Data-Efficient Robot Imitation Learning

[2604.00557] Multi-Camera View Scaling for Data-Efficient Robot Imitation Learning

arXiv - Machine Learning 4 min read

About this article

Abstract page for arXiv paper 2604.00557: Multi-Camera View Scaling for Data-Efficient Robot Imitation Learning

Computer Science > Robotics arXiv:2604.00557 (cs) [Submitted on 1 Apr 2026] Title:Multi-Camera View Scaling for Data-Efficient Robot Imitation Learning Authors:Yichen Xie, Yixiao Wang, Shuqi Zhao, Cheng-En Wu, Masayoshi Tomizuka, Jianwen Xie, Hao-Shu Fang View a PDF of the paper titled Multi-Camera View Scaling for Data-Efficient Robot Imitation Learning, by Yichen Xie and 6 other authors View PDF HTML (experimental) Abstract:The generalization ability of imitation learning policies for robotic manipulation is fundamentally constrained by the diversity of expert demonstrations, while collecting demonstrations across varied environments is costly and difficult in practice. In this paper, we propose a practical framework that exploits inherent scene diversity without additional human effort by scaling camera views during demonstration collection. Instead of acquiring more trajectories, multiple synchronized camera perspectives are used to generate pseudo-demonstrations from each expert trajectory, which enriches the training distribution and improves viewpoint invariance in visual representations. We analyze how different action spaces interact with view scaling and show that camera-space representations further enhance diversity. In addition, we introduce a multiview action aggregation method that allows single-view policies to benefit from multiple cameras during deployment. Extensive experiments in simulation and real-world manipulation tasks demonstrate significant gains...

Originally published on April 02, 2026. Curated by AI News.

Related Articles

Robotics

Eyes in the sky: Drones and AI set to revolutionize forest carbon accounting

AI News - General ·
Llms

Anthropic found emergent emotional states in Claude. I'm seeing the same phenomenon in simple trading agents. Is emergence universal under optimization pressure?

Anthropic researchers recently found that Claude develops internal representations of emotional concepts that aren't decorative. They inf...

Reddit - Artificial Intelligence · 1 min ·
[2604.01676] GPA: Learning GUI Process Automation from Demonstrations
Llms

[2604.01676] GPA: Learning GUI Process Automation from Demonstrations

Abstract page for arXiv paper 2604.01676: GPA: Learning GUI Process Automation from Demonstrations

arXiv - AI · 3 min ·
[2603.13842] Fine-tuning is Not Enough: A Parallel Framework for Collaborative Imitation and Reinforcement Learning in End-to-end Autonomous Driving
Machine Learning

[2603.13842] Fine-tuning is Not Enough: A Parallel Framework for Collaborative Imitation and Reinforcement Learning in End-to-end Autonomous Driving

Abstract page for arXiv paper 2603.13842: Fine-tuning is Not Enough: A Parallel Framework for Collaborative Imitation and Reinforcement L...

arXiv - AI · 4 min ·
More in Robotics: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime