Commit ·
f1140b5
1
Parent(s): 651b12f
fix
Browse files
README.md
CHANGED
|
@@ -20,7 +20,7 @@ pipeline_tag: image-text-to-text
|
|
| 20 |
|
| 21 |
Optimized version of [gemma-3-4b-it-qat-4bit](https://huggingface.co/mlx-community/gemma-3-4b-it-qat-4bit) for Apple Silicon edge devices. Reduces model size from 2.8 GB to 2.3 GB with lower runtime memory and significantly reduced thermal output, while preserving text and image understanding quality.
|
| 22 |
|
| 23 |
-
For an even smaller version (2.1 GB) with weight splitting and neuron pruning, see [gemma-3-4b-it-qat-4bit-mobile](https://huggingface.co/
|
| 24 |
|
| 25 |
## Optimizations Applied
|
| 26 |
|
|
|
|
| 20 |
|
| 21 |
Optimized version of [gemma-3-4b-it-qat-4bit](https://huggingface.co/mlx-community/gemma-3-4b-it-qat-4bit) for Apple Silicon edge devices. Reduces model size from 2.8 GB to 2.3 GB with lower runtime memory and significantly reduced thermal output, while preserving text and image understanding quality.
|
| 22 |
|
| 23 |
+
For an even smaller version (2.1 GB) with weight splitting and neuron pruning, see [gemma-3-4b-it-qat-4bit-mobile](https://huggingface.co/AtomGradient/gemma-3-4b-it-qat-4bit-mobile).
|
| 24 |
|
| 25 |
## Optimizations Applied
|
| 26 |
|