rlsamplingJF/Qwen2.5-3B-Instruct-finemath-highquality-part1-seed2028-initial 3B • Updated 6 days ago • 14
rlsamplingJF/myllama-1B-20BT-finemath-highquality-part1-seed2026-initial 0.9B • Updated 6 days ago • 14
rlsamplingJF/Llama-3.2-3B-finemath-highquality-rm-run2-lr3e-5-cosine-bs32-gc1.0-initial 3B • Updated Nov 1 • 31
rlsamplingJF/Llama-3.2-3B-finemath-highquality-rm-run2-lr3e-5-cosine-bs32-gc1.0 3B • Updated Nov 1 • 95
rlsamplingJF/posttraining_sentence_Qwen2.5-7B-Instruct-finemath-rm-run1-lr1e-6-constant-bs8-gc10.0-step84 7B • Updated Oct 16 • 4
rlsamplingJF/posttraining_sentence_Qwen2.5-7B-Instruct-finemath-rm-run1-lr1e-6-constant-bs8-gc10.0-step36 7B • Updated Oct 16 • 18