AtomGradient
/

gemma-3-4b-it-qat-4bit-lite

Image-Text-to-Text

Model card Files Files and versions

AlexWuKing commited on 29 days ago

Commit

f1140b5

·

1 Parent(s): 651b12f

fix

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ pipeline_tag: image-text-to-text
 Optimized version of [gemma-3-4b-it-qat-4bit](https://huggingface.co/mlx-community/gemma-3-4b-it-qat-4bit) for Apple Silicon edge devices. Reduces model size from 2.8 GB to 2.3 GB with lower runtime memory and significantly reduced thermal output, while preserving text and image understanding quality.
-For an even smaller version (2.1 GB) with weight splitting and neuron pruning, see [gemma-3-4b-it-qat-4bit-mobile](https://huggingface.co/AtomGradientOpenSource/gemma-3-4b-it-qat-4bit-mobile).
 ## Optimizations Applied

 Optimized version of [gemma-3-4b-it-qat-4bit](https://huggingface.co/mlx-community/gemma-3-4b-it-qat-4bit) for Apple Silicon edge devices. Reduces model size from 2.8 GB to 2.3 GB with lower runtime memory and significantly reduced thermal output, while preserving text and image understanding quality.
+For an even smaller version (2.1 GB) with weight splitting and neuron pruning, see [gemma-3-4b-it-qat-4bit-mobile](https://huggingface.co/AtomGradient/gemma-3-4b-it-qat-4bit-mobile).
 ## Optimizations Applied