OrlandoHugBot commited on
Commit
8d686db
Β·
verified Β·
1 Parent(s): 82e7434

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -23
README.md CHANGED
@@ -11,7 +11,7 @@ tags:
11
  - unified-model
12
  ---
13
 
14
- # 🌌 Skywork/UniPic2-SD3.5M-Kontext-2B
15
 
16
  <div align="center">
17
  <img src="skywork-logo.png" alt="Skywork Logo" width="500">
@@ -27,37 +27,22 @@ tags:
27
 
28
  ## πŸ“– Introduction
29
 
30
- **Skywork-UniPic** is a **unified autoregressive multimodal model** with **1.5 billion parameters**, capable of handling three key vision-language tasks within a single architecture:
31
 
32
- - πŸ–ΌοΈ **Image Understanding**
33
- - 🎨 **Text-to-Image Generation**
34
- - ✏️ **Image Editing**
35
-
36
- Trained from scratch on a large-scale multimodal corpus, UniPic is designed to support a wide range of unified image-text tasks efficiently.
37
-
38
- <div align="center">
39
- <img src="teaser.png" alt="Model Teaser" width="700">
40
- </div>
41
-
42
-
43
- ---
44
 
45
  ## πŸ“Š Benchmarks
46
 
47
- Skywork-UniPic achieves competitive results across a variety of vision-language tasks:
48
 
49
  | Task | Score |
50
  |--------------------|--------|
51
- | 🧠 **GenEval** | 0.86 |
52
- | πŸ–ΌοΈ **DPG-Bench** | 85.5 |
53
- | βœ‚οΈ **GEditBench-EN** | 5.83 |
54
- | πŸ§ͺ **ImgEdit-Bench** | 3.49 |
55
 
56
- <div align="center">
57
- <img src="UniPic.png" alt="Benchmark Results" width="700">
58
- </div>
59
 
60
- ---
61
 
62
  ## 🧠 Usage
63
 
 
11
  - unified-model
12
  ---
13
 
14
+ # UniPic2-SD3.5M-Kontext-2B
15
 
16
  <div align="center">
17
  <img src="skywork-logo.png" alt="Skywork Logo" width="500">
 
27
 
28
  ## πŸ“– Introduction
29
 
30
+ **UniPic2-SD3.5M-Kontext-2B** is a 2B-parameter post-trained model built on the SD3.5-Medium family. It focuses on text-to-image generation and image editing, delivering strong quality with a fast generation speed. It runs smoothly on a single 16 GB consumer GPU.
31
 
32
+ <div align="center"> <img src="teaser.png" alt="Model Teaser" width="720"> </div>
 
 
 
 
 
 
 
 
 
 
 
33
 
34
  ## πŸ“Š Benchmarks
35
 
36
+ UniPic2-SD3.5M-Kontext-2B w/o GRPO achieves competitive results across a variety of vision-language tasks:
37
 
38
  | Task | Score |
39
  |--------------------|--------|
40
+ | 🧠 **GenEval** | 0.83 |
41
+ | πŸ–ΌοΈ **DPG-Bench** | 83.7 |
42
+ | βœ‚οΈ **GEditBench-EN** | 6.31 |
43
+ | πŸ§ͺ **ImgEdit-Bench** | 3.95 |
44
 
 
 
 
45
 
 
46
 
47
  ## 🧠 Usage
48