The 3D variational autoencoder (VAE) model with KL loss used in HunyuanImage2.1 for its refiner pipeline.
The model can be loaded with the following code snippet.
from diffusers import AutoencoderKLHunyuanImageRefiner
vae = AutoencoderKLHunyuanImageRefiner.from_pretrained("hunyuanvideo-community/HunyuanImage-2.1-Refiner-Diffusers", subfolder="vae", torch_dtype=torch.bfloat16)
[[autodoc]] AutoencoderKLHunyuanImageRefiner
[[autodoc]] models.autoencoders.vae.DecoderOutput