mmproj precision

by BigWhoop - opened Oct 31, 2025

Discussion

BigWhoop

Oct 31, 2025

Hi, I saw that the Qwen3-VL GGUFs often come with a F32 mmproj. Is there any benefit in using the F32 over F16?

danielhanchen

Unsloth AI org Oct 31, 2025

Not really - F16 / BF16 is enough - F32 might work as well, but tbh BF16 is enough

TPH441

Nov 1, 2025

•

edited Nov 1, 2025

The vision tensors are almost always in BF16 so going to F32 doesn't add anything. BF16 -> F16 is a lossy conversion but it's debatable whether you'd notice the difference.

If your hardware doesn't support BF16 however then F32 may be used if you want to use a lossless mmproj.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment