Runtime error Featured 194 Low-bit Quantized Open LLM Leaderboard 🏆 194 Track, rank and evaluate open LLMs and chatbots
INC: Testing Collection A collection of low precision models generated by Intel Neural Compressor including mxfp8, mxfp4 and nvfp4. • 13 items • Updated 4 days ago
Intel/LongCat-Flash-Thinking-2601-int4-mixed-AutoRound Text Generation • 2B • Updated 8 days ago • 35 • 2
Intel/LongCat-Flash-Thinking-2601-int4-mixed-AutoRound Text Generation • 2B • Updated 8 days ago • 35 • 2