Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
wingrune
/
3DGraphLLM
like
0
Image-Text-to-Text
Transformers
3d-scene-understanding
scene-graph
multimodal
vlm
llama
vision-language-model
arxiv:
2412.18450
License:
mit
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
35ac252
3DGraphLLM
79.1 kB
3 contributors
History:
2 commits
wingrune
Upload ga.png
35ac252
verified
12 months ago
.gitattributes
1.52 kB
initial commit
12 months ago
README.md
24 Bytes
initial commit
12 months ago
ga.png
77.5 kB
Upload ga.png
12 months ago