FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable Reasoning Paper • 2510.22543 • Published Oct 26 • 10
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper • 2511.11793 • Published 27 days ago • 160
Depth Anything 3: Recovering the Visual Space from Any Views Paper • 2511.10647 • Published 28 days ago • 93
Back to Basics: Let Denoising Generative Models Denoise Paper • 2511.13720 • Published 24 days ago • 65
A Survey of Reinforcement Learning for Large Reasoning Models Paper • 2509.08827 • Published Sep 10 • 189