LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling Paper • 2511.20785 • Published 12 days ago • 148
Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following Paper • 2511.21662 • Published 11 days ago • 10
UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning Paper • 2510.13515 • Published Oct 15 • 11
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe Paper • 2511.16334 • Published 17 days ago • 91
Scaling Spatial Intelligence with Multimodal Foundation Models Paper • 2511.13719 • Published 20 days ago • 44
Scaling Spatial Intelligence with Multimodal Foundation Models Paper • 2511.13719 • Published 20 days ago • 44
Scaling Spatial Intelligence with Multimodal Foundation Models Paper • 2511.13719 • Published 20 days ago • 44