AdvEvo-MARL: Shaping Internalized Safety through Adversarial Co-Evolution in Multi-Agent Reinforcement Learning Paper • 2510.01586 • Published Oct 2 • 1
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models Paper • 2510.05034 • Published Oct 6 • 48
MetaSpatial: Reinforcing 3D Spatial Reasoning in VLMs for the Metaverse Paper • 2503.18470 • Published Mar 24 • 3