Salyut1
/

GLM-4.7-NVFP4

Text Generation

8-bit precision

Model card Files Files and versions

Resources

View closed (2)

How long did it take to quantize?

#6 opened 5 months ago by

TensorRTLLM 1.2.0rc6 Endless Stream

#5 opened 5 months ago by

mbatuhanunverdi

could we create 139B quantization with REAP for this model?

#2 opened 5 months ago by

Trying out later with 8 x RTX 5090

#1 opened 5 months ago by