VideoVLA: Video Generators Can Be Generalizable Robot Manipulators Paper • 2512.06963 • Published 3 days ago • 2 • 2
SpaceControl: Introducing Test-Time Spatial Control to 3D Generative Modeling Paper • 2512.05343 • Published 5 days ago • 11 • 2
ProPhy: Progressive Physical Alignment for Dynamic World Simulation Paper • 2512.05564 • Published 5 days ago • 3 • 2
EditThinker: Unlocking Iterative Reasoning for Any Image Editor Paper • 2512.05965 • Published 5 days ago • 36 • 3
Self-Improving VLM Judges Without Human Annotations Paper • 2512.05145 • Published 8 days ago • 15 • 2
World Models That Know When They Don't Know: Controllable Video Generation with Calibrated Uncertainty Paper • 2512.05927 • Published 5 days ago • 10 • 2
SIMA 2: A Generalist Embodied Agent for Virtual Worlds Paper • 2512.04797 • Published 6 days ago • 17 • 2
TV2TV: A Unified Framework for Interleaved Language and Video Generation Paper • 2512.05103 • Published 6 days ago • 12 • 2
On GRPO Collapse in Search-R1: The Lazy Likelihood-Displacement Death Spiral Paper • 2512.04220 • Published 7 days ago • 11 • 2
DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle Paper • 2512.04324 • Published 7 days ago • 146 • 5
Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction Paper • 2512.04987 • Published 6 days ago • 71 • 2
RELIC: Interactive Video World Model with Long-Horizon Memory Paper • 2512.04040 • Published 7 days ago • 21 • 2
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published 8 days ago • 193 • 4
GUI Exploration Lab: Enhancing Screen Navigation in Agents via Multi-Turn Reinforcement Learning Paper • 2512.02423 • Published 8 days ago • 3 • 2