nightmedia
/

unsloth-GLM-4.5-Air-qx64-mlx

Text Generation

Model card Files Files and versions

nightmedia commited on Aug 28

Commit

01a7fa0

·

verified ·

1 Parent(s): 2f8d3f7

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -13,6 +13,14 @@ library_name: mlx
 # unsloth-GLM-4.5-Air-qx64-mlx
 This model [unsloth-GLM-4.5-Air-qx64-mlx](https://huggingface.co/unsloth-GLM-4.5-Air-qx64-mlx) was
 converted to MLX format from [unsloth/GLM-4.5-Air](https://huggingface.co/unsloth/GLM-4.5-Air)
 using mlx-lm version **0.26.4**.

 # unsloth-GLM-4.5-Air-qx64-mlx
+This is an experimental quant formula still under evaluation:
+```bash
+head, v_proj for first 4 layers set to 8 bit
+v_proj for the lower layers set to 6 bit
+all others set to 4 bit, quanted with group size 32
+```
 This model [unsloth-GLM-4.5-Air-qx64-mlx](https://huggingface.co/unsloth-GLM-4.5-Air-qx64-mlx) was
 converted to MLX format from [unsloth/GLM-4.5-Air](https://huggingface.co/unsloth/GLM-4.5-Air)
 using mlx-lm version **0.26.4**.