Multi-Modal Reasoning Models - a rohitg Collection

rohitg 's Collections

multimodal diffusion llm

Reasoning VideoLLMs

Multi-Modal Reasoning Models

Multi-modal Embedding Models

Multi-Modal Reasoning Models

updated Oct 13, 2025

XiaomiMiMo/MiMo-VL-7B-RL-2508

Image-Text-to-Text • 8B • Updated Aug 21, 2025 • 1.57k • 92
zai-org/GLM-4.1V-9B-Thinking

Image-Text-to-Text • 10B • Updated Oct 25, 2025 • 435k • 779
moonshotai/Kimi-VL-A3B-Thinking-2506

Image-Text-to-Text • 16B • Updated Jan 30 • 5.48k • 369
Skywork/Skywork-R1V3-38B

Image-Text-to-Text • 38B • Updated Jul 14, 2025 • 43 • 89
Qwen/QVQ-72B-Preview

Image-Text-to-Text • 73B • Updated Jan 12, 2025 • 856 • 610
zai-org/GLM-4.5V

Image-Text-to-Text • 108B • Updated Oct 25, 2025 • 134k • • 719
multimodal-reasoning-lab/Bagel-Zebra-CoT

Any-to-Any • Updated Aug 6, 2025 • 20 • 8
multimodal-reasoning-lab/Anole-Zebra-CoT

Any-to-Any • 7B • Updated Jul 23, 2025 • 15 • 4
Qwen/Qwen3-VL-30B-A3B-Thinking-FP8

Image-Text-to-Text • 31B • Updated Nov 26, 2025 • 17.7k • 57