Update README.md
Browse files
README.md
CHANGED
|
@@ -13,6 +13,14 @@ library_name: mlx
|
|
| 13 |
|
| 14 |
# unsloth-GLM-4.5-Air-qx64-mlx
|
| 15 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 16 |
This model [unsloth-GLM-4.5-Air-qx64-mlx](https://huggingface.co/unsloth-GLM-4.5-Air-qx64-mlx) was
|
| 17 |
converted to MLX format from [unsloth/GLM-4.5-Air](https://huggingface.co/unsloth/GLM-4.5-Air)
|
| 18 |
using mlx-lm version **0.26.4**.
|
|
|
|
| 13 |
|
| 14 |
# unsloth-GLM-4.5-Air-qx64-mlx
|
| 15 |
|
| 16 |
+
This is an experimental quant formula still under evaluation:
|
| 17 |
+
```bash
|
| 18 |
+
head, v_proj for first 4 layers set to 8 bit
|
| 19 |
+
v_proj for the lower layers set to 6 bit
|
| 20 |
+
all others set to 4 bit, quanted with group size 32
|
| 21 |
+
```
|
| 22 |
+
|
| 23 |
+
|
| 24 |
This model [unsloth-GLM-4.5-Air-qx64-mlx](https://huggingface.co/unsloth-GLM-4.5-Air-qx64-mlx) was
|
| 25 |
converted to MLX format from [unsloth/GLM-4.5-Air](https://huggingface.co/unsloth/GLM-4.5-Air)
|
| 26 |
using mlx-lm version **0.26.4**.
|