ConsisIDTransformer3DModel

A Diffusion Transformer model for 3D data from ConsisID was introduced in Identity-Preserving Text-to-Video Generation by Frequency Decomposition by Peking University & University of Rochester & etc.

The model can be loaded with the following code snippet.

from diffusers import ConsisIDTransformer3DModel

transformer = ConsisIDTransformer3DModel.from_pretrained("BestWishYsh/ConsisID-preview", subfolder="transformer", torch_dtype=torch.bfloat16).to("cuda")

ConsisIDTransformer3DModel

[[autodoc]] ConsisIDTransformer3DModel

Transformer2DModelOutput

[[autodoc]] models.modeling_outputs.Transformer2DModelOutput