Update README.md
Browse files
README.md
CHANGED
|
@@ -11,7 +11,7 @@ tags:
|
|
| 11 |
- unified-model
|
| 12 |
---
|
| 13 |
|
| 14 |
-
#
|
| 15 |
|
| 16 |
<div align="center">
|
| 17 |
<img src="skywork-logo.png" alt="Skywork Logo" width="500">
|
|
@@ -27,37 +27,22 @@ tags:
|
|
| 27 |
|
| 28 |
## π Introduction
|
| 29 |
|
| 30 |
-
**
|
| 31 |
|
| 32 |
-
|
| 33 |
-
- π¨ **Text-to-Image Generation**
|
| 34 |
-
- βοΈ **Image Editing**
|
| 35 |
-
|
| 36 |
-
Trained from scratch on a large-scale multimodal corpus, UniPic is designed to support a wide range of unified image-text tasks efficiently.
|
| 37 |
-
|
| 38 |
-
<div align="center">
|
| 39 |
-
<img src="teaser.png" alt="Model Teaser" width="700">
|
| 40 |
-
</div>
|
| 41 |
-
|
| 42 |
-
|
| 43 |
-
---
|
| 44 |
|
| 45 |
## π Benchmarks
|
| 46 |
|
| 47 |
-
|
| 48 |
|
| 49 |
| Task | Score |
|
| 50 |
|--------------------|--------|
|
| 51 |
-
| π§ **GenEval** | 0.
|
| 52 |
-
| πΌοΈ **DPG-Bench** |
|
| 53 |
-
| βοΈ **GEditBench-EN** |
|
| 54 |
-
| π§ͺ **ImgEdit-Bench** | 3.
|
| 55 |
|
| 56 |
-
<div align="center">
|
| 57 |
-
<img src="UniPic.png" alt="Benchmark Results" width="700">
|
| 58 |
-
</div>
|
| 59 |
|
| 60 |
-
---
|
| 61 |
|
| 62 |
## π§ Usage
|
| 63 |
|
|
|
|
| 11 |
- unified-model
|
| 12 |
---
|
| 13 |
|
| 14 |
+
# UniPic2-SD3.5M-Kontext-2B
|
| 15 |
|
| 16 |
<div align="center">
|
| 17 |
<img src="skywork-logo.png" alt="Skywork Logo" width="500">
|
|
|
|
| 27 |
|
| 28 |
## π Introduction
|
| 29 |
|
| 30 |
+
**UniPic2-SD3.5M-Kontext-2B** is a 2B-parameter post-trained model built on the SD3.5-Medium family. It focuses on text-to-image generation and image editing, delivering strong quality with a fast generation speed. It runs smoothly on a single 16 GB consumer GPU.
|
| 31 |
|
| 32 |
+
<div align="center"> <img src="teaser.png" alt="Model Teaser" width="720"> </div>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 33 |
|
| 34 |
## π Benchmarks
|
| 35 |
|
| 36 |
+
UniPic2-SD3.5M-Kontext-2B w/o GRPO achieves competitive results across a variety of vision-language tasks:
|
| 37 |
|
| 38 |
| Task | Score |
|
| 39 |
|--------------------|--------|
|
| 40 |
+
| π§ **GenEval** | 0.83 |
|
| 41 |
+
| πΌοΈ **DPG-Bench** | 83.7 |
|
| 42 |
+
| βοΈ **GEditBench-EN** | 6.31 |
|
| 43 |
+
| π§ͺ **ImgEdit-Bench** | 3.95 |
|
| 44 |
|
|
|
|
|
|
|
|
|
|
| 45 |
|
|
|
|
| 46 |
|
| 47 |
## π§ Usage
|
| 48 |
|