Q3_K_M (112 GB) is bigger than Q3_K_XL (104 GB)?

by rtzurtz - opened Oct 2, 2025

Oct 2, 2025

as per title

Unsloth AI org Oct 2, 2025

Yes that;s correct. K_XL is usually smaller

Kreuz

Nov 9, 2025

But what are the implications?

Cos I have strix halo 128 GB, so can run Q3_K_XL at 100 GB or Q3_K_M at 115 GB, but what's the difference in perplexity or benchmarks?

Couldn't we have a UD K_XL which sits between them?

There's 20 GB+ of 'free real estate' on the device.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment