Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
NYCU-RL-Bandits-Lab
rl-bandits-lab
Follow
NYCU-RL-Bandits-Lab
AI & ML interests
Reinforcement learning
Organizations
None yet
models
4
Sort: Recently updated
rl-bandits-lab/ultrafeedback_rm
8B
•
Updated
Jul 30, 2025
•
2
rl-bandits-lab/helpsteer_rm
8B
•
Updated
Jun 10, 2025
•
3
rl-bandits-lab/hhrlhf_rm
8B
•
Updated
May 21, 2025
•
3
rl-bandits-lab/translation_rm
8B
•
Updated
May 21, 2025
•
4
datasets
2
Sort: Recently updated
rl-bandits-lab/SEGALE-WMT24
Viewer
•
Updated
Nov 5, 2025
•
137k
•
55
rl-bandits-lab/SEGALE-WMT24-Human-Eval
Viewer
•
Updated
Nov 5, 2025
•
27k
•
6