With just a single image as input, enables you to effortlessly obtain multiple angled views and generate intricate 3D videos.

Introducing Stable Video 3D (SV3D)

Quality Novel View Synthesis and 3D Generation from Single Images

a revolutionary generative model that takes 3D technology to the next level. Based on Stable Video Diffusion, SV3D delivers significantly improved quality and view-consistency for novel view synthesis and 3D generation from single images.

SV3D Advantages

Significantly improved quality and view-consistency

SV3D surpasses existing open-source alternatives in generating detailed and faithful multi-views.

Novel view synthesis advancements

Offers superior pose-controllability and consistent object appearance across multiple views.

High-quality 3D mesh generation

Produces accurate and realistic 3D meshes directly from single image inputs.

Commercially available

Can be used for commercial purposes with a Stability AI Membership.

Open-source access

Model weights available on Hugging Face for non-commercial use.


Feature List

Generates novel multi-views from single images

SV3D can create realistic and consistent views of an object from any angle, even those not present in the original image.

Improved 3D optimization

Leverages multi-view consistency to generate high-quality 3D meshes directly from novel views.

Masked score distillation sampling loss

Enhances 3D quality in regions not visible in the predicted views.

Disentangled illumination model

Reduces baked-in lighting issues, leading to more realistic 3D models.

Two variants available

SV3D_u: Generates orbital videos from single images without camera conditioning. SV3D_p: Creates 3D video along specified camera paths using either single images or orbital views.


Novel-View Generation

SV3D introduces significant advancements in 3D generation in novel view synthesis (NVS).
novel view synthesis
3D Generation

Stable Video 3D (SV3D) leverages its multi-view consistency to optimize 3D Neural Radiance Fields (NeRF) and mesh representations to improve the quality of 3D meshes generated directly from novel views.
Stable Video 3D (SV3D)'s two different variant

SV3D_u and SV3D_p

Stable Video 3D has two different variants: SV3D_u and SV3D_p.
SV3D_u generates orbital videos based on single image inputs without camera conditioning. This means that it can create a video of an object rotating in space, even if the original image only shows the object from one viewpoint.
SV3D_p extends the capability of SV3D_u by accommodating both single images and orbital views. This allows for the creation of 3D video along specified camera paths. For example, you could use SV3D_p to create a video of a drone flying around an object, or a video of a camera panning across a scene.
Both SV3D_u and SV3D_p are capable of generating high-quality, realistic 3D videos. However, SV3D_p offers more flexibility and control over the camera path, which can be useful for certain applications.
Here is a table in HTML that summarizes the key differences between the two Stable Video 3D variants:
VariantInputOutputCamera path control
SV3D_uSingle imageOrbital videoNo
SV3D_pSingle image or orbital views3D videoYes

Frequently Asked Questions

