AlexWuKing commited on
Commit
f1140b5
·
1 Parent(s): 651b12f
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -20,7 +20,7 @@ pipeline_tag: image-text-to-text
20
 
21
  Optimized version of [gemma-3-4b-it-qat-4bit](https://huggingface.co/mlx-community/gemma-3-4b-it-qat-4bit) for Apple Silicon edge devices. Reduces model size from 2.8 GB to 2.3 GB with lower runtime memory and significantly reduced thermal output, while preserving text and image understanding quality.
22
 
23
- For an even smaller version (2.1 GB) with weight splitting and neuron pruning, see [gemma-3-4b-it-qat-4bit-mobile](https://huggingface.co/AtomGradientOpenSource/gemma-3-4b-it-qat-4bit-mobile).
24
 
25
  ## Optimizations Applied
26
 
 
20
 
21
  Optimized version of [gemma-3-4b-it-qat-4bit](https://huggingface.co/mlx-community/gemma-3-4b-it-qat-4bit) for Apple Silicon edge devices. Reduces model size from 2.8 GB to 2.3 GB with lower runtime memory and significantly reduced thermal output, while preserving text and image understanding quality.
22
 
23
+ For an even smaller version (2.1 GB) with weight splitting and neuron pruning, see [gemma-3-4b-it-qat-4bit-mobile](https://huggingface.co/AtomGradient/gemma-3-4b-it-qat-4bit-mobile).
24
 
25
  ## Optimizations Applied
26