MatGPTQ
Collection
MatGPTQ quantized models • 7 items • Updated
YAML Metadata Warning: empty or missing yaml metadata in repo card
Check out the documentation for more information.
This is the official MatGPTQ checkpoint of microsoft/Phi-3-medium-128k-instruct, produced as described in the "MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization" paper.
This model can be run via vLLM. Checkout our integration at IST-DASLab/MatGPTQ