WorldGen: From Text to Traversable and Interactive 3D Worlds Paper • 2511.16825 • Published 16 days ago • 21
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning Paper • 2511.16043 • Published 17 days ago • 104
Think-on-Graph 3.0: Efficient and Adaptive LLM Reasoning on Heterogeneous Graphs via Multi-Agent Dual-Evolving Context Retrieval Paper • 2509.21710 • Published Sep 26 • 18
The Denario project: Deep knowledge AI agents for scientific discovery Paper • 2510.26887 • Published Oct 30 • 6
Surfer 2: The Next Generation of Cross-Platform Computer Use Agents Paper • 2510.19949 • Published Oct 22 • 38
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning Paper • 2510.25992 • Published Oct 29 • 44
The Era of Agentic Organization: Learning to Organize with Language Models Paper • 2510.26658 • Published Oct 30 • 26
Exploring Conditions for Diffusion models in Robotic Control Paper • 2510.15510 • Published Oct 17 • 39
When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA Paper • 2510.04849 • Published Oct 6 • 113
DeepScientist: Advancing Frontier-Pushing Scientific Findings Progressively Paper • 2509.26603 • Published Sep 30 • 16
MotionRAG: Motion Retrieval-Augmented Image-to-Video Generation Paper • 2509.26391 • Published Sep 30 • 21
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning Paper • 2509.25760 • Published Sep 30 • 55
CondAmbigQA: A Benchmark and Dataset for Conditional Ambiguous Question Answering Paper • 2502.01523 • Published Feb 3 • 2
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning Paper • 2509.08755 • Published Sep 10 • 56
A Survey of Reinforcement Learning for Large Reasoning Models Paper • 2509.08827 • Published Sep 10 • 189
LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering Paper • 2509.09614 • Published Sep 11 • 7
Can Understanding and Generation Truly Benefit Together -- or Just Coexist? Paper • 2509.09666 • Published Sep 11 • 34
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey Paper • 2509.02547 • Published Sep 2 • 225