arxiv:2601.16208
Jihan Yang PRO
jihanyang
AI & ML interests
Computer Vision, Multimodality, Embodied AI
Recent Activity
upvoted a paper 6 days ago
Beyond Language Modeling: An Exploration of Multimodal Pretraining upvoted a paper 11 days ago
Solaris: Building a Multiplayer Video World Model in Minecraft liked
a dataset 21 days ago
nyu-visionx/scale-rae-data