Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
rayruiyang 's Collections
VST
Haplo-VL

VST

updated 12 days ago

A comprehensive framework designed to cultivate VLMs with human-like visuospatial abilities.

Upvote
6

  • rayruiyang/VST-3B-RL

    Image-Text-to-Text • 4B • Updated Nov 11, 2025 • 405 • 3

  • rayruiyang/VST-3B-SFT

    Image-Text-to-Text • 4B • Updated Nov 11, 2025 • 1.87k

  • rayruiyang/VST-7B-SFT

    Image-Text-to-Text • 8B • Updated Nov 11, 2025 • 2.4k

  • rayruiyang/VST-7B-RL

    Image-Text-to-Text • 8B • Updated Nov 11, 2025 • 206

  • Visual Spatial Tuning

    Paper • 2511.05491 • Published Nov 7, 2025 • 52

  • rayruiyang/vst_3d_grounding_benchmark

    Preview • Updated 12 days ago • 30
Upvote
6
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs