A Diffusion Transformer model for 3D video-like data was introduced in SkyReels-V2 by the Skywork AI.
The model can be loaded with the following code snippet.
from diffusers import SkyReelsV2Transformer3DModel
transformer = SkyReelsV2Transformer3DModel.from_pretrained("Skywork/SkyReels-V2-DF-1.3B-540P-Diffusers", subfolder="transformer", torch_dtype=torch.bfloat16)
[[autodoc]] SkyReelsV2Transformer3DModel
[[autodoc]] models.modeling_outputs.Transformer2DModelOutput