GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models Paper • 2605.29398 • Published 6 days ago • 4
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem Paper • 2512.24873 • Published Dec 31, 2025 • 109
Logics-STEM: Empowering LLM Reasoning via Failure-Driven Post-Training and Document Knowledge Enhancement Paper • 2601.01562 • Published Jan 4 • 24