Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
inference-optimization
's Collections
NVIDIA-Nemotron-3-Nano-30B-A3B Quantized Models
Qwen3-Next-80B-A3B Quantized Models
Mixed Precision Models
KV Cache Quantization
NVIDIA-Nemotron-3-Nano-30B-A3B Quantized Models
updated
10 days ago
FP8-dynamic, FP8-block, NVFP4, INT4, versions of nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B
Upvote
-
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8-block
Updated
10 days ago
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8-dynamic
32B
•
Updated
9 days ago
•
35
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4
Updated
10 days ago
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-quantized.w4a16
Updated
10 days ago
Upvote
-
Share collection
View history
Collection guide
Browse collections