arxiv:2412.09057
YUEXIN LI
yuexinlinus
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 6 hours ago
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs
upvoted
a
paper
about 6 hours ago
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning
upvoted
a
paper
about 2 months ago
V-ReasonBench: Toward Unified Reasoning Benchmark Suite for Video Generation Models