Wan2.2 Fun InP ComfyUI Workflow Complete Usage Guide, Official + Community Versions (Kijai, GGUF)
Tutorial Overview
This tutorial will comprehensively introduce various implementation methods and usage of the Wan2.2 Fun InP first-last frame video generation model in ComfyUI. Wan2.2 Fun InP is a professional first-last frame control video generation model launched by Alibaba Cloud, supporting input of first and last frame images to generate intermediate transition videos, bringing stronger creative control to creators.
Versions and Content Covered in This Tutorial
Completed Versions:
- โ ComfyUI Official Native Version - Complete workflow provided by ComfyOrg official
- โ Wan2.2 Fun InP 14B First-Last Frame Version - High-quality first-last frame control video generation
Versions in Preparation:
- ๐ Kijai WanVideoWrapper Version - Community-developed convenient wrapper
- ๐ GGUF Quantized Version - Optimized version for low-configuration devices
Model Technical Features
Wan2.2 Fun InP is based on the Wan2.2 architecture and has been specifically optimized for first-last frame control video generation, with the following core features:
Core Advantages:
- First-Last Frame Control: Supports input of first and last frame images to generate intermediate transition videos, enhancing video coherence and creative freedom
- High-Quality Video Generation: Based on the Wan2.2 architecture, outputting cinema-level quality videos
- Multi-Resolution Support: Supports generating videos at resolutions such as 512ร512, 768ร768, 1024ร1024, adapting to different scenario requirements
The Wan2.2 Fun InP series models are based on the Apache2.0 open source license, supporting commercial use. The Apache2.0 license allows you to freely use, modify and distribute these models, including commercial purposes, as long as you retain the original copyright notice and license text.
Wan2.2 Fun InP Open Source Model Version Overview
Model Type | Model Name | Parameters | Main Function | Model Repository |
---|---|---|---|---|
First-Last Frame Control | Wan2.2-Fun-A14B-InP | 14B | Supports input of first and last frame images to generate intermediate transition videos, bringing stronger creative control to creators | ๐ค Wan2.2-Fun-A14B-InP |
Video Control | Wan2.2-Fun-A14B-Control | 14B | Supports different control conditions such as Canny, Depth, Pose, MLSD, etc., while also supporting trajectory control | ๐ค Wan2.2-Fun-A14B-Control |
Official Learning Resources
Related Code Repositories
- VideoX-Fun GitHub Repository - Official complete implementation code
- Wan2.2 Fun Official Documentation - Detailed model description and usage guide
Wan2.2 Fun InP ComfyUI Official Native Version Workflow Usage Guide
Version Description
The ComfyUI official native version is provided by the ComfyOrg team, using repackaged model files to ensure optimal compatibility with ComfyUI. This version supports both standard mode and Lightx2v 4-step LoRA acceleration mode.
Performance Comparison Test
Below are the test results using RTX4090D 24GB VRAM GPU for 640*640 resolution and 81-frame length:
Model Type | Resolution | VRAM Usage | First Generation Time | Second Generation Time |
---|---|---|---|---|
fp8_scaled | 640ร640 | 83% | โ 524 seconds | โ 520 seconds |
fp8_scaled + 4-step LoRA acceleration | 640ร640 | 89% | โ 138 seconds | โ 79 seconds |
Since using accelerated LoRA provides significant speed improvement, although there is some loss in dynamics, it is more user-friendly for low VRAM users. Therefore, in the two provided workflows, the accelerated LoRA version is enabled by default. If you need to enable the other workflow, select it and use Ctrl+B to enable it.
1. Wan2.2 Fun InP First-Last Frame Video Generation ComfyUI Workflow
Workflow Acquisition Method
Use the materials below as first and last frames
Model File Download
Diffusion Model
- wan2.2_fun_inpaint_high_noise_14B_fp8_scaled.safetensors
- wan2.2_fun_inpaint_low_noise_14B_fp8_scaled.safetensors
Lightning LoRA (Optional, for acceleration)
- wan2.2_i2v_lightx2v_4steps_lora_v1_high_noise.safetensors
- wan2.2_i2v_lightx2v_4steps_lora_v1_low_noise.safetensors
VAE
Text Encoder
ComfyUI/
โโโโ๐ models/
โ โโโโ๐ diffusion_models/
โ โ โโโโ wan2.2_fun_inpaint_high_noise_14B_fp8_scaled.safetensors
โ โ โโโโ wan2.2_fun_inpaint_low_noise_14B_fp8_scaled.safetensors
โ โโโโ๐ loras/
โ โ โโโโ wan2.2_i2v_lightx2v_4steps_lora_v1_high_noise.safetensors
โ โ โโโโ wan2.2_i2v_lightx2v_4steps_lora_v1_low_noise.safetensors
โ โโโโ๐ text_encoders/
โ โ โโโโ umt5_xxl_fp8_e4m3fn_scaled.safetensors
โ โโโโ๐ vae/
โ โโโ wan_2.1_vae.safetensors
Detailed Operation Steps
This workflow uses LoRA, please ensure the corresponding Diffusion model and LoRA are consistent
- High noise model and LoRA loading
- Ensure the
Load Diffusion Model
node loads thewan2.2_fun_inpaint_high_noise_14B_fp8_scaled.safetensors
model - Ensure the
LoraLoaderModelOnly
node loads thewan2.2_i2v_lightx2v_4steps_lora_v1_high_noise.safetensors
- Low noise model and LoRA loading
- Ensure the
Load Diffusion Model
node loads thewan2.2_fun_inpaint_low_noise_14B_fp8_scaled.safetensors
model - Ensure the
LoraLoaderModelOnly
node loads thewan2.2_i2v_lightx2v_4steps_lora_v1_low_noise.safetensors
- Ensure the
Load CLIP
node loads theumt5_xxl_fp8_e4m3fn_scaled.safetensors
model - Ensure the
Load VAE
node loads thewan_2.1_vae.safetensors
model - First and last frame image upload, upload first and last frame image materials respectively
- Enter prompts in the Prompt group
WanFunInpaintToVideo
node size and video length adjustment- Adjust the
width
andheight
dimensions, default is640
, we have set a smaller size and you can modify it as needed - Adjust
length
, this is the total number of video frames. The current workflow fps is 16. If you need to generate a 5-second video, you should set 5*16 = 80
- Adjust the
- Click the
Run
button, or use the shortcutCtrl(cmd) + Enter
to execute video generation
Wan2.2 Fun InP Kijai WanVideoWrapper ComfyUI Workflow
This content is being prepared and will be updated soon.
This part of the tutorial will introduce the convenient method using Kijai/ComfyUI-WanVideoWrapper.
Related model repository: https://huggingface.co/Kijai/WanVideo_comfy_fp8_scaled
Wan2.2 Fun InP GGUF Quantized Version ComfyUI Workflow
This content is being prepared and will be updated soon.
The GGUF version is suitable for users with limited VRAM, providing the following resources:
QuantStack/Wan2.2-Fun-A14B-InP-GGUF
Related Custom Nodes: City96/ComfyUI-GGUF