Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
rohitg 's Collections
multimodal diffusion llm
Reasoning VideoLLMs
Multi-Modal Reasoning Models
Multi-modal Embedding Models

Multi-Modal Reasoning Models

updated Oct 13
Upvote
-

  • XiaomiMiMo/MiMo-VL-7B-RL-2508

    Image-Text-to-Text • 8B • Updated Aug 21 • 1.02k • 80

  • zai-org/GLM-4.1V-9B-Thinking

    Image-Text-to-Text • 10B • Updated Oct 25 • 189k • • 760

  • moonshotai/Kimi-VL-A3B-Thinking-2506

    Image-Text-to-Text • 16B • Updated Aug 18 • 171k • 329

  • Skywork/Skywork-R1V3-38B

    Image-Text-to-Text • 38B • Updated Jul 14 • 208 • 87

  • Qwen/QVQ-72B-Preview

    Image-Text-to-Text • 73B • Updated Jan 12 • 231 • 609

  • zai-org/GLM-4.5V

    Image-Text-to-Text • 108B • Updated Oct 25 • 43.8k • • 699

  • multimodal-reasoning-lab/Bagel-Zebra-CoT

    Any-to-Any • Updated Aug 6 • 60 • 8

  • multimodal-reasoning-lab/Anole-Zebra-CoT

    Any-to-Any • 7B • Updated Jul 23 • 25 • 3

  • Qwen/Qwen3-VL-30B-A3B-Thinking-FP8

    Image-Text-to-Text • 31B • Updated Nov 26 • 80.9k • 47
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs