Skip to content
Help Build a Better ComfyUI Knowledge Base Become a Patron
NewsStep1X-Edit: Open Source AI Image Editing Framework

Step1X-Edit: Open Source AI Image Editing Framework

Step1X-Edit Demo

Step1X-Edit is a newly released open-source image editing framework that uses multimodal large language models to process reference images and user editing instructions, extracting latent embeddings and integrating them with a diffusion image decoder to obtain the target image. This project aims to provide performance comparable to closed-source models like GPT-4o and Gemini2 Flash.

Key Features

  • Natural Language Editing: Supports image editing via natural language instructions, such as “beautify,” “change the background,” or “convert to pixel art style”
  • Open Source and Commercial Use: Released under the Apache 2.0 license, freely available for commercial use
  • Flexible Hardware Requirements: Offers FP8 quantized version to reduce memory requirements
  • Community Support: Already has multiple community versions, including FP8 quantized models

Technical Specifications

Step1X-Edit provides multiple versions to accommodate different hardware configurations:

Model VersionPeak GPU Memory (512/786/1024 resolution)28 Steps Generation Time (seconds)
Standard42.5GB / 46.5GB / 49.8GB5s / 11s / 22s
FP8 Quantized31GB / 31.5GB / 34GB6.8s / 13.5s / 25s
Standard+CPU Offload25.9GB / 27.3GB / 29.1GB49.6s / 54.1s / 63.2s
FP8 Quantized+CPU Offload18GB / 18GB / 18GB35s / 40s / 51s

While officially recommended to use a GPU with 80GB of memory for best performance and efficiency, Step1X-Edit can also run on graphics cards with less memory through FP8 quantization and CPU offloading technology.

Online Demo

You can directly experience Step1X-Edit through the following link:

Step1X-Edit

Future Plans

The Step1X-Edit team has already completed:

  • Inference code and model weights release
  • Online demo (Gradio)
  • FP8 quantized weights

Planned features include:

  • Diffusers integration
  • ComfyUI integration

The release of this open-source project will enable more users to access high-quality AI image editing capabilities without relying on closed-source models, while its commercializable nature provides more possibilities for developers.