SVI 2.0 Pro Released - Infinite-Length Video Generation with Wan 2.2 Support

12/29/2025

Wan2.2 Fun Control ComfyUI Workflow Complete Usage Guide, Official + Community Versions (Kijai, GGUF)

This tutorial will comprehensively introduce various implementation methods and usage of the Wan2.2 Fun Control video control generation model in ComfyUI. Wan2.2 Fun Control is a new generation of video generation and control model launched by Alibaba Cloud. By introducing an innovative control code (Control Codes) mechanism combined with deep learning and multimodal conditional input, it can generate high-quality videos that meet preset control conditions.

Versions and Content Covered in This Tutorial

Completed Versions:

✅ ComfyUI Official Native Version - Complete workflow provided in the official ComfyOrg documentation
✅ Wan2.2 Fun Control 14B Video Control Version - High-quality multimodal control video generation

Versions in Preparation:

🔄 Kijai WanVideoWrapper Version - Community-developed convenient wrapper
🔄 GGUF Quantized Version - Optimized version for low-configuration devices

Model Technical Features

Wan2.2 Fun Control is based on the Wan2.2 architecture and has been specifically optimized for video control generation, with the following core features:

Core Advantages:

Multimodal Control: Supports multiple control conditions, including Canny (line drawing), Depth (depth), OpenPose (human pose), MLSD (geometric edges), etc., while also supporting trajectory control
High-Quality Video Generation: Based on the Wan2.2 architecture, outputting cinema-level quality videos
Multilingual Support: Supports multilingual prompt input including Chinese and English
Multi-Resolution Support: Supports generating videos at resolutions such as 512×512, 768×768, 1024×1024, adapting to different scenario requirements

Open Source License Description

The Wan2.2 Fun Control series models are based on the Apache2.0 open source license, supporting commercial use. The Apache2.0 license allows you to freely use, modify and distribute these models, including for commercial purposes, as long as you retain the original copyright notice and license text.

Wan2.2 Fun Control Open Source Model Version Overview

Model Type	Model Name	Parameters	Main Function	Model Repository
Video Control	Wan2.2-Fun-A14B-Control	14B	Supports different control conditions such as Canny, Depth, Pose, MLSD, etc., while also supporting trajectory control	🤗 Wan2.2-Fun-A14B-Control

VideoX-Fun GitHub Repository - Official complete implementation code
Wan2.2 Fun Control Official Documentation - Detailed model description and usage guide

Wan2.2 Fun Control ComfyUI Official Native Version Workflow Usage Guide

Version Description

The ComfyUI official native version is provided by the ComfyOrg team, using repackaged model files to ensure optimal compatibility with ComfyUI. This version supports both standard mode and Lightx2v 4-step LoRA acceleration mode.

Performance Comparison Test

Below are the test results using RTX4090D 24GB VRAM GPU for 640*640 resolution and 81-frame length:

Model Type	Resolution	VRAM Usage	First Generation Time	Second Generation Time
fp8_scaled	640×640	83%	≈ 524 seconds	≈ 520 seconds
fp8_scaled + 4-step LoRA acceleration	640×640	89%	≈ 138 seconds	≈ 79 seconds

Since 4-step LoRA is used, the user experience for first-time workflow users is better, but it may cause loss of video dynamics. We have enabled the accelerated LoRA version by default. If you need to enable another set of workflows, select it and use Ctrl+B to enable it.

1. Wan2.2 Fun Control Video Control Generation ComfyUI Workflow

Workflow Acquisition Method

Download the video or JSON file below and drag it into ComfyUI to load the corresponding workflow

Download JSON Format Workflow

Please download the images and videos below, which we will use as input.

Input Starting Image

Here we use a preprocessed video that can be directly used for control video generation

Model File Download

You can find the following models in Wan_2.2_ComfyUI_Repackaged

Diffusion Model

Wan2.2-Lightning LoRA (Optional, for acceleration)

VAE

wan_2.1_vae.safetensors

Text Encoder

umt5_xxl_fp8_e4m3fn_scaled.safetensors

ComfyUI/
├───📂 models/
│   ├───📂 diffusion_models/
│   │   ├─── wan2.2_fun_control_low_noise_14B_fp8_scaled.safetensors
│   │   └─── wan2.2_fun_control_high_noise_14B_fp8_scaled.safetensors
│   ├───📂 loras/
│   │   ├─── wan2.2_i2v_lightx2v_4steps_lora_v1_high_noise.safetensors
│   │   └─── wan2.2_i2v_lightx2v_4steps_lora_v1_low_noise.safetensors
│   ├───📂 text_encoders/
│   │   └─── umt5_xxl_fp8_e4m3fn_scaled.safetensors 
│   └───📂 vae/
│       └── wan_2.1_vae.safetensors

Detailed Operation Steps

Wan2.2 Fun Control Workflow Steps

💡

This workflow uses LoRA, please ensure the corresponding Diffusion model and LoRA are consistent. High noise and low noise models and LoRA need to be used correspondingly.

High noise model and LoRA loading

Ensure the Load Diffusion Model node loads the wan2.2_fun_control_high_noise_14B_fp8_scaled.safetensors model
Ensure the LoraLoaderModelOnly node loads the wan2.2_i2v_lightx2v_4steps_lora_v1_high_noise.safetensors

Low noise model and LoRA loading

Ensure the Load Diffusion Model node loads the wan2.2_fun_control_low_noise_14B_fp8_scaled.safetensors model
Ensure the LoraLoaderModelOnly node loads the wan2.2_i2v_lightx2v_4steps_lora_v1_low_noise.safetensors

Ensure the Load CLIP node loads the umt5_xxl_fp8_e4m3fn_scaled.safetensors model
Ensure the Load VAE node loads the wan_2.1_vae.safetensors model
Upload the starting frame in the Load Image node
In the second Load video node, control the pose video. The provided video has been preprocessed and can be used directly
Since the video we provide is a preprocessed pose video, the corresponding video image preprocessing nodes need to be disabled. You can select them and use Ctrl + B` to disable them
Modify Prompt - both Chinese and English are supported
In Wan22FunControlToVideo, modify the corresponding video size. The default is set to 640*640 resolution to avoid excessive time consumption for low VRAM users using this workflow
Click the Run button, or use the shortcut Ctrl(cmd) + Enter to execute video generation

Additional Notes

Since in the built-in nodes of ComfyUI, the preprocessor nodes only have Canny preprocessors, you can use similar ComfyUI-comfyui_controlnet_aux to implement other types of image preprocessing

Wan2.2 Fun Control Kijai WanVideoWrapper ComfyUI Workflow

⚠️

This content is being prepared and will be updated soon.

This part of the tutorial will introduce the convenient method using Kijai/ComfyUI-WanVideoWrapper.

Related model repository: https://huggingface.co/Kijai/WanVideo_comfy_fp8_scaled

Wan2.2 Fun Control GGUF Quantized Version ComfyUI Workflow

⚠️

This content is being prepared and will be updated soon.

The GGUF version is suitable for users with limited VRAM, providing the following resources:

QuantStack/Wan2.2-Fun-A14B-Control-GGUF

Related Custom Nodes: City96/ComfyUI-GGUF

SVI 2.0 Pro Released - Infinite-Length Video Generation with Wan 2.2 Support

Wan2.2 Fun Control ComfyUI Workflow Complete Usage Guide, Official + Community Versions (Kijai, GGUF)