Extend3D: Town-Scale 3D Generation

📰 ArXiv cs.AI

Extend3D is a training-free pipeline for 3D scene generation from a single image using an object-centric 3D generative model

advanced Published 1 Apr 2026
Action Steps
  1. Extend the latent space in the x and y directions to overcome limitations of fixed-size latent spaces
  2. Divide the extended latent space into overlapping patches
  3. Apply the object-centric 3D generative model to each patch to generate 3D scenes
Who Needs to Know This

Computer vision engineers and researchers on a team can benefit from Extend3D to generate 3D scenes for various applications, and product managers can utilize this technology to enhance user experience in fields like gaming and architecture

Key Insight

💡 Extend3D enables training-free 3D scene generation from a single image by extending the latent space and applying an object-centric 3D generative model

Share This
🌆 Generate 3D towns from a single image with Extend3D!
Read full paper → ← Back to News