Extend3D: Town-Scale 3D Generation

📰 ArXiv cs.AI

Extend3D is a training-free pipeline for 3D scene generation from a single image using an object-centric 3D generative model

advanced Published 1 Apr 2026

Action Steps

Extend the latent space in the x and y directions to overcome limitations of fixed-size latent spaces
Divide the extended latent space into overlapping patches
Apply the object-centric 3D generative model to each patch to generate 3D scenes

Who Needs to Know This

Computer vision engineers and researchers on a team can benefit from Extend3D to generate 3D scenes for various applications, and product managers can utilize this technology to enhance user experience in fields like gaming and architecture

Key Insight

💡 Extend3D enables training-free 3D scene generation from a single image by extending the latent space and applying an object-centric 3D generative model