OpenRubrics/OpenRubric-Science
Viewer
•
Updated
•
11.3k
•
26
•
1
None defined yet.
Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training
OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment