[2603.29387] Extend3D: Town-Scale 3D Generation

arXiv - AI April 01, 2026 4 min read

About this article

Abstract page for arXiv paper 2603.29387: Extend3D: Town-Scale 3D Generation

Computer Science > Computer Vision and Pattern Recognition arXiv:2603.29387 (cs) [Submitted on 31 Mar 2026] Title:Extend3D: Town-Scale 3D Generation Authors:Seungwoo Yoon, Jinmo Kim, Jaesik Park View a PDF of the paper titled Extend3D: Town-Scale 3D Generation, by Seungwoo Yoon and 2 other authors View PDF HTML (experimental) Abstract:In this paper, we propose Extend3D, a training-free pipeline for 3D scene generation from a single image, built upon an object-centric 3D generative model. To overcome the limitations of fixed-size latent spaces in object-centric models for representing wide scenes, we extend the latent space in the $x$ and $y$ directions. Then, by dividing the extended latent space into overlapping patches, we apply the object-centric 3D generative model to each patch and couple them at each time step. Since patch-wise 3D generation with image conditioning requires strict spatial alignment between image and latent patches, we initialize the scene using a point cloud prior from a monocular depth estimator and iteratively refine occluded regions through SDEdit. We discovered that treating the incompleteness of 3D structure as noise during 3D refinement enables 3D completion via a concept, which we term under-noising. Furthermore, to address the sub-optimality of object-centric models for sub-scene generation, we optimize the extended latent during denoising, ensuring that the denoising trajectories remain consistent with the sub-scene dynamics. To this end, we...

Originally published on April 01, 2026. Curated by AI News.

Open Source Ai

From OpenAI to Nvidia, firms channel billions into AI infrastructure as demand booms

This article is discussing another large investment being made by tech firms into AI projects. I’ve noticed that whilst this is happening...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

Slides Help Teaching ML First Time [P]

I’m an electrical engineering teacher. One of our faculty members has fallen ill, so I’ve been asked to take over teaching machine learni...

Reddit - Machine Learning · 1 min · about 5 hours ago

Machine Learning

easyaligner: Forced alignment with GPU acceleration and flexible text normalization (compatible with all w2v2 models on HF Hub) [P]

https://preview.redd.it/f4d5krhkjyvg1.png?width=1020&format=png&auto=webp&s=11310f377b22abbe3dd110cc7d362ba8aae35f8d I have b...

Reddit - Machine Learning · 1 min · about 8 hours ago

Machine Learning

ICML 2026 - Heavy score variance among various batches? [D]

I've seen some people say in their batch very few papers have above 3.5 score, but then other reviewers say that most papers in their sco...

Reddit - Machine Learning · 1 min · about 10 hours ago

[2603.29387] Extend3D: Town-Scale 3D Generation

About this article

Related Articles

From OpenAI to Nvidia, firms channel billions into AI infrastructure as demand booms

Slides Help Teaching ML First Time [P]

easyaligner: Forced alignment with GPU acceleration and flexible text normalization (compatible with all w2v2 models on HF Hub) [P]

ICML 2026 - Heavy score variance among various batches? [D]

No comments

Stay updated with AI News