vikhyatk/moondream2
Image-Text-to-Text
•
2B
•
Updated
•
1.66M
•
1.35k
https://huggingface.co/papers/2501.03006
Detect and estimate human poses in images and videos
Generate text based on your input