Ministral 3 - Additional Checkpoints Collection Different formats and Quantized versions of our Ministral 3 family; 14B/8B/3B Instruct/Reasoning GGUF, 3B Instruct ONNX and 14B/8B/3B Instruct BF16. • 13 items • Updated 8 days ago • 13
Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated 8 days ago • 120
Mistral Large 3 Collection A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated 8 days ago • 73
Trinity Collection Collection of Arcee AI models in the Trinity family • 6 items • Updated 9 days ago • 16
view article Article We’re open-sourcing our text-to-image model and the process behind it 29 days ago • 74
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published Nov 9 • 129
RLVE Collection Models for "RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments" - https://arxiv.org/abs/2511.07317 • 3 items • Updated 29 days ago • 5
C2S-Scale-Gemma-Models Collection C2S-Scale Gemma models trained using the Cell2Sentence framework, described in the C2S-Scale paper. • 2 items • Updated Oct 13 • 12
view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face Feb 11 • 89
Dream-Coder 7B Collection https://hkunlp.github.io/blog/2025/dream-coder • 2 items • Updated Jul 15 • 6