Skip to content
Help ComfyUI Wiki remove ads Become a Patron
NewsNVIDIA Open Sources Sana - An AI Model for Efficient 4K Image Generation

NVIDIA Open Sources Sana - An AI Model for Efficient 4K Image Generation

NVIDIA has recently released Sana, a significant open-source project. This text-to-image generation model’s key feature is its ability to efficiently generate images up to 4096×4096 resolution. Even more impressive is that the optimized model can run on consumer laptop GPUs with 16GB of VRAM.

Key Features

  • Ultra Efficiency: Sana-0.6B model generates 1024×1024 resolution images in less than a second
  • Light Deployment: Runs on laptop GPUs with 16GB VRAM
  • High Resolution: Supports image generation up to 4K (4096×4096) resolution
  • Open Source Support: Official ComfyUI support and LoRA training tools included

Technical Innovations

Sana’s high performance is achieved through several technical innovations:

  1. Deep Compression Autoencoder: Achieves 32x image compression ratio, significantly reducing latent tokens
  2. Linear DiT: Uses linear attention mechanism instead of traditional attention, reducing complexity from O(N²) to O(N)
  3. Decoder-style Text Encoder: Utilizes Gemma as text encoder for improved text understanding
  4. Efficient Training and Inference: Innovative Flow-DPM-Solver reduces inference steps for faster generation

Online Demo

Experience Sana’s powerful capabilities through the online demo:

Sana Online Demo

ComfyUI Integration

Sana now officially supports ComfyUI, allowing users to easily utilize the Sana model through ComfyUI workflows. Additionally, official LoRA training tools are provided for personalized model training.

Future Development

The NVIDIA team has completed development of training code, inference code, model library release, ComfyUI support, and LoRA training. Future plans include developing ControlNet support, 8bit/4bit device optimization, larger scale models, and releasing Sana 1.5 focused on human body/face, text rendering, photorealism, and efficiency.