Lil2J commited on
Commit
8bc2b57
·
verified ·
1 Parent(s): 014c9e9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -22,6 +22,8 @@ On a single RTX 5090, the TPS (transactions per second) of Qwen3-8B-Eagle3 incre
22
  | qwen3-30b_moe-eagle3 | 8*h200 | 325 |
23
  | qwen3-30b_moe | 8*5090 | 164 |
24
  | qwen3-30b_moe-eagle3 | 8*5090 | 268 |
 
 
25
  ## How to use
26
 
27
  To use Eagle3 with SGLang, first replace the qwen3_moe.py file in SGLang’s directory (sglang/python/sglang/srt/models/) with the qwen3_moe.py file from this project.
 
22
  | qwen3-30b_moe-eagle3 | 8*h200 | 325 |
23
  | qwen3-30b_moe | 8*5090 | 164 |
24
  | qwen3-30b_moe-eagle3 | 8*5090 | 268 |
25
+
26
+ Join our AI computing power cloud platform now and enjoy the best AI cloud service experience. The link is as follows: https://tenyunn.com/
27
  ## How to use
28
 
29
  To use Eagle3 with SGLang, first replace the qwen3_moe.py file in SGLang’s directory (sglang/python/sglang/srt/models/) with the qwen3_moe.py file from this project.