[2604.00557] Multi-Camera View Scaling for Data-Efficient Robot

[2604.00557] Multi-Camera View Scaling for Data-Efficient Robot Imitation Learning

arXiv - Machine Learning April 02, 2026 4 min read

About this article

Abstract page for arXiv paper 2604.00557: Multi-Camera View Scaling for Data-Efficient Robot Imitation Learning

Computer Science > Robotics arXiv:2604.00557 (cs) [Submitted on 1 Apr 2026] Title:Multi-Camera View Scaling for Data-Efficient Robot Imitation Learning Authors:Yichen Xie, Yixiao Wang, Shuqi Zhao, Cheng-En Wu, Masayoshi Tomizuka, Jianwen Xie, Hao-Shu Fang View a PDF of the paper titled Multi-Camera View Scaling for Data-Efficient Robot Imitation Learning, by Yichen Xie and 6 other authors View PDF HTML (experimental) Abstract:The generalization ability of imitation learning policies for robotic manipulation is fundamentally constrained by the diversity of expert demonstrations, while collecting demonstrations across varied environments is costly and difficult in practice. In this paper, we propose a practical framework that exploits inherent scene diversity without additional human effort by scaling camera views during demonstration collection. Instead of acquiring more trajectories, multiple synchronized camera perspectives are used to generate pseudo-demonstrations from each expert trajectory, which enriches the training distribution and improves viewpoint invariance in visual representations. We analyze how different action spaces interact with view scaling and show that camera-space representations further enhance diversity. In addition, we introduce a multiview action aggregation method that allows single-view policies to benefit from multiple cameras during deployment. Extensive experiments in simulation and real-world manipulation tasks demonstrate significant gains...

Originally published on April 02, 2026. Curated by AI News.

Robotics

Eyes in the sky: Drones and AI set to revolutionize forest carbon accounting

AI News - General · about 2 hours ago

Llms

Anthropic found emergent emotional states in Claude. I'm seeing the same phenomenon in simple trading agents. Is emergence universal under optimization pressure?

Anthropic researchers recently found that Claude develops internal representations of emotional concepts that aren't decorative. They inf...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

[2604.01676] GPA: Learning GUI Process Automation from Demonstrations

Abstract page for arXiv paper 2604.01676: GPA: Learning GUI Process Automation from Demonstrations

arXiv - AI · 3 min · about 3 hours ago

Machine Learning

[2603.13842] Fine-tuning is Not Enough: A Parallel Framework for Collaborative Imitation and Reinforcement Learning in End-to-end Autonomous Driving

Abstract page for arXiv paper 2603.13842: Fine-tuning is Not Enough: A Parallel Framework for Collaborative Imitation and Reinforcement L...

arXiv - AI · 4 min · about 3 hours ago

[2604.00557] Multi-Camera View Scaling for Data-Efficient Robot Imitation Learning

About this article

Related Articles

Eyes in the sky: Drones and AI set to revolutionize forest carbon accounting

Anthropic found emergent emotional states in Claude. I'm seeing the same phenomenon in simple trading agents. Is emergence universal under optimization pressure?

[2604.01676] GPA: Learning GUI Process Automation from Demonstrations

[2603.13842] Fine-tuning is Not Enough: A Parallel Framework for Collaborative Imitation and Reinforcement Learning in End-to-end Autonomous Driving

No comments

Stay updated with AI News