NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards Paper • 2511.14659 • Published 22 days ago • 12
NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards Paper • 2511.14659 • Published 22 days ago • 12 • 2
10 Open Challenges Steering the Future of Vision-Language-Action Models Paper • 2511.05936 • Published Nov 8 • 5 • 2
10 Open Challenges Steering the Future of Vision-Language-Action Models Paper • 2511.05936 • Published Nov 8 • 5
Demystifying deep search: a holistic evaluation with hint-free multi-hop questions and factorised metrics Paper • 2510.05137 • Published Oct 1 • 5
Demystifying deep search: a holistic evaluation with hint-free multi-hop questions and factorised metrics Paper • 2510.05137 • Published Oct 1 • 5 • 3
Training Vision-Language Process Reward Models for Test-Time Scaling in Multimodal Reasoning: Key Insights and Lessons Learned Paper • 2509.23250 • Published Sep 27 • 5
Training Vision-Language Process Reward Models for Test-Time Scaling in Multimodal Reasoning: Key Insights and Lessons Learned Paper • 2509.23250 • Published Sep 27 • 5 • 2
OffTopicEval: When Large Language Models Enter the Wrong Chat, Almost Always! Paper • 2509.26495 • Published Sep 30 • 10 • 2
OffTopicEval: When Large Language Models Enter the Wrong Chat, Almost Always! Paper • 2509.26495 • Published Sep 30 • 10