Update README.md
Browse files
README.md
CHANGED
|
@@ -22,6 +22,8 @@ On a single RTX 5090, the TPS (transactions per second) of Qwen3-8B-Eagle3 incre
|
|
| 22 |
| qwen3-30b_moe-eagle3 | 8*h200 | 325 |
|
| 23 |
| qwen3-30b_moe | 8*5090 | 164 |
|
| 24 |
| qwen3-30b_moe-eagle3 | 8*5090 | 268 |
|
|
|
|
|
|
|
| 25 |
## How to use
|
| 26 |
|
| 27 |
To use Eagle3 with SGLang, first replace the qwen3_moe.py file in SGLang’s directory (sglang/python/sglang/srt/models/) with the qwen3_moe.py file from this project.
|
|
|
|
| 22 |
| qwen3-30b_moe-eagle3 | 8*h200 | 325 |
|
| 23 |
| qwen3-30b_moe | 8*5090 | 164 |
|
| 24 |
| qwen3-30b_moe-eagle3 | 8*5090 | 268 |
|
| 25 |
+
|
| 26 |
+
Join our AI computing power cloud platform now and enjoy the best AI cloud service experience. The link is as follows: https://tenyunn.com/
|
| 27 |
## How to use
|
| 28 |
|
| 29 |
To use Eagle3 with SGLang, first replace the qwen3_moe.py file in SGLang’s directory (sglang/python/sglang/srt/models/) with the qwen3_moe.py file from this project.
|