wooguy
#53
by
wooguy
- opened
README.md
CHANGED
|
@@ -17,7 +17,7 @@ library_name: diffusers
|
|
| 17 |
[](https://huggingface.co/spaces/Tongyi-MAI/Z-Image-Turbo) 
|
| 18 |
[](https://huggingface.co/spaces/akhaliq/Z-Image-Turbo) 
|
| 19 |
[](https://www.modelscope.cn/models/Tongyi-MAI/Z-Image-Turbo) 
|
| 20 |
-
[](https://www.modelscope.cn/aigc/imageGeneration?tab=advanced&versionId=469191&modelType=Checkpoint&sdVersion=Z_IMAGE_TURBO&modelUrl=modelscope%
|
| 21 |
[](assets/Z-Image-Gallery.pdf) 
|
| 22 |
[](https://modelscope.cn/studios/Tongyi-MAI/Z-Image-Gallery/summary) 
|
| 23 |
<a href="https://arxiv.org/abs/2511.22699" target="_blank"><img src="https://img.shields.io/badge/Report-b5212f.svg?logo=arxiv" height="21px"></a>
|
|
@@ -31,24 +31,21 @@ Welcome to the official repository for the Z-Image(造相)project!
|
|
| 31 |
|
| 32 |
## ✨ Z-Image
|
| 33 |
|
| 34 |
-
Z-Image is a powerful and highly efficient image generation model
|
| 35 |
|
| 36 |
- 🚀 **Z-Image-Turbo** – A distilled version of Z-Image that matches or exceeds leading competitors with only **8 NFEs** (Number of Function Evaluations). It offers **⚡️sub-second inference latency⚡️** on enterprise-grade H800 GPUs and fits comfortably within **16G VRAM consumer devices**. It excels in photorealistic image generation, bilingual text rendering (English & Chinese), and robust instruction adherence.
|
| 37 |
|
| 38 |
-
-
|
| 39 |
-
|
| 40 |
-
- 🧱 **Z-Image-Omni-Base** – The versatile foundation model capable of both **generation and editing tasks**. By releasing this checkpoint, we aim to unlock the full potential for community-driven fine-tuning and custom development, providing the most "raw" and diverse starting point for the open-source community.
|
| 41 |
|
| 42 |
- ✍️ **Z-Image-Edit** – A variant fine-tuned on Z-Image specifically for image editing tasks. It supports creative image-to-image generation with impressive instruction-following capabilities, allowing for precise edits based on natural language prompts.
|
| 43 |
|
| 44 |
### 📥 Model Zoo
|
| 45 |
|
| 46 |
-
| Model |
|
| 47 |
-
| :---
|
| 48 |
-
| **Z-Image-
|
| 49 |
-
| **Z-Image** |
|
| 50 |
-
| **Z-Image-
|
| 51 |
-
| **Z-Image-Edit** | ✅ | ✅ | ❌ | 50 | ✅ | Editing | High | Medium | Easy | *To be released* | *To be released* | | *To be released* |
|
| 52 |
|
| 53 |
### 🖼️ Showcase
|
| 54 |
|
|
@@ -88,7 +85,7 @@ Install the latest version of diffusers, use the following command:
|
|
| 88 |
<details>
|
| 89 |
<summary><sup>Click here for details for why you need to install diffusers from source</sup></summary>
|
| 90 |
|
| 91 |
-
We have submitted two pull requests ([#12703](https://github.com/huggingface/diffusers/pull/12703) and [#12715](https://github.com/huggingface/diffusers/pull/
|
| 92 |
Therefore, you need to install diffusers from source for the latest features and Z-Image support.
|
| 93 |
|
| 94 |
</details>
|
|
@@ -177,24 +174,12 @@ HF_XET_HIGH_PERFORMANCE=1 hf download Tongyi-MAI/Z-Image-Turbo
|
|
| 177 |
If you find our work useful in your research, please consider citing:
|
| 178 |
|
| 179 |
```bibtex
|
| 180 |
-
@
|
| 181 |
title={Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer},
|
| 182 |
-
author={
|
| 183 |
-
|
| 184 |
-
|
| 185 |
-
}
|
| 186 |
-
|
| 187 |
-
@article{liu2025decoupled,
|
| 188 |
-
title={Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield},
|
| 189 |
-
author={Dongyang Liu and Peng Gao and David Liu and Ruoyi Du and Zhen Li and Qilong Wu and Xin Jin and Sihan Cao and Shifeng Zhang and Hongsheng Li and Steven Hoi},
|
| 190 |
-
journal={arXiv preprint arXiv:2511.22677},
|
| 191 |
-
year={2025}
|
| 192 |
-
}
|
| 193 |
-
|
| 194 |
-
@article{jiang2025distribution,
|
| 195 |
-
title={Distribution Matching Distillation Meets Reinforcement Learning},
|
| 196 |
-
author={Jiang, Dengyang and Liu, Dongyang and Wang, Zanyi and Wu, Qilong and Jin, Xin and Liu, David and Li, Zhen and Wang, Mengmeng and Gao, Peng and Yang, Harry},
|
| 197 |
-
journal={arXiv preprint arXiv:2511.13649},
|
| 198 |
-
year={2025}
|
| 199 |
}
|
| 200 |
```
|
|
|
|
| 17 |
[](https://huggingface.co/spaces/Tongyi-MAI/Z-Image-Turbo) 
|
| 18 |
[](https://huggingface.co/spaces/akhaliq/Z-Image-Turbo) 
|
| 19 |
[](https://www.modelscope.cn/models/Tongyi-MAI/Z-Image-Turbo) 
|
| 20 |
+
[](https://www.modelscope.cn/aigc/imageGeneration?tab=advanced&versionId=469191&modelType=Checkpoint&sdVersion=Z_IMAGE_TURBO&modelUrl=modelscope%253A%252F%252FTongyi-MAI%252FZ-Image-Turbo%253Frevision%253Dmaster%7D%7BOnline) 
|
| 21 |
[](assets/Z-Image-Gallery.pdf) 
|
| 22 |
[](https://modelscope.cn/studios/Tongyi-MAI/Z-Image-Gallery/summary) 
|
| 23 |
<a href="https://arxiv.org/abs/2511.22699" target="_blank"><img src="https://img.shields.io/badge/Report-b5212f.svg?logo=arxiv" height="21px"></a>
|
|
|
|
| 31 |
|
| 32 |
## ✨ Z-Image
|
| 33 |
|
| 34 |
+
Z-Image is a powerful and highly efficient image generation model with **6B** parameters. Currently there are three variants:
|
| 35 |
|
| 36 |
- 🚀 **Z-Image-Turbo** – A distilled version of Z-Image that matches or exceeds leading competitors with only **8 NFEs** (Number of Function Evaluations). It offers **⚡️sub-second inference latency⚡️** on enterprise-grade H800 GPUs and fits comfortably within **16G VRAM consumer devices**. It excels in photorealistic image generation, bilingual text rendering (English & Chinese), and robust instruction adherence.
|
| 37 |
|
| 38 |
+
- 🧱 **Z-Image-Base** – The non-distilled foundation model. By releasing this checkpoint, we aim to unlock the full potential for community-driven fine-tuning and custom development.
|
|
|
|
|
|
|
| 39 |
|
| 40 |
- ✍️ **Z-Image-Edit** – A variant fine-tuned on Z-Image specifically for image editing tasks. It supports creative image-to-image generation with impressive instruction-following capabilities, allowing for precise edits based on natural language prompts.
|
| 41 |
|
| 42 |
### 📥 Model Zoo
|
| 43 |
|
| 44 |
+
| Model | Hugging Face | ModelScope |
|
| 45 |
+
| :--- |:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|
| 46 |
+
| **Z-Image-Turbo** | [](https://huggingface.co/Tongyi-MAI/Z-Image-Turbo) <br> [](https://huggingface.co/spaces/Tongyi-MAI/Z-Image-Turbo) | [](https://www.modelscope.cn/models/Tongyi-MAI/Z-Image-Turbo) <br> [](https://www.modelscope.cn/aigc/imageGeneration?tab=advanced&versionId=469191&modelType=Checkpoint&sdVersion=Z_IMAGE_TURBO&modelUrl=modelscope%3A%2F%2FTongyi-MAI%2FZ-Image-Turbo%3Frevision%3Dmaster) |
|
| 47 |
+
| **Z-Image-Base** | *To be released* | *To be released* |
|
| 48 |
+
| **Z-Image-Edit** | *To be released* | *To be released* |
|
|
|
|
| 49 |
|
| 50 |
### 🖼️ Showcase
|
| 51 |
|
|
|
|
| 85 |
<details>
|
| 86 |
<summary><sup>Click here for details for why you need to install diffusers from source</sup></summary>
|
| 87 |
|
| 88 |
+
We have submitted two pull requests ([#12703](https://github.com/huggingface/diffusers/pull/12703) and [#12715](https://github.com/huggingface/diffusers/pull/12704)) to the 🤗 diffusers repository to add support for Z-Image. Both PRs have been merged into the latest official diffusers release.
|
| 89 |
Therefore, you need to install diffusers from source for the latest features and Z-Image support.
|
| 90 |
|
| 91 |
</details>
|
|
|
|
| 174 |
If you find our work useful in your research, please consider citing:
|
| 175 |
|
| 176 |
```bibtex
|
| 177 |
+
@misc{z-image-2025,
|
| 178 |
title={Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer},
|
| 179 |
+
author={Tongyi Lab},
|
| 180 |
+
year={2025},
|
| 181 |
+
publisher={GitHub},
|
| 182 |
+
journal={GitHub repository},
|
| 183 |
+
howpublished={\url{https://github.com/Tongyi-MAI/Z-Image}}
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 184 |
}
|
| 185 |
```
|