wgcyeo/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen3-4B-Instruct-2507_bw0p5_fw0p5_ema0p9999_ep30 Text Generation • Updated about 15 hours ago
wgcyeo/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen3-4B-Instruct-2507_bw0p5_fw0p5_ema0p9999_ep30 Text Generation • Updated about 15 hours ago
wgcyeo/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen3-4B-Instruct-2507_bw0p5_fw0p5_ema0p99_ep30 Text Generation • Updated 3 days ago • 15
wgcyeo/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen3-4B-Instruct-2507_bw0p5_fw0p5_ema0p99_ep30 Text Generation • Updated 3 days ago • 15
wgcyeo/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen3-4B_bw0p5_fw0p5_ema0p999_ep30_rgcomq0p5 Text Generation • Updated 5 days ago • 11
wgcyeo/ci-feedback_weighted_asym_bi_kl_fixed_ema_Qwen3-4B_bw0p5_fw0p5_ema0p999_ep30_rgcomq0p5 Text Generation • Updated 5 days ago • 11
wgcyeo/ci-feedback_asym_bi_kl_hybrid_fixed_ema_Qwen3-14B_bw0p5_fw0p5_ema0p999_ep30 Text Generation • Updated 6 days ago • 16
wgcyeo/ci-feedback_asym_bi_kl_hybrid_fixed_ema_Qwen3-14B_bw0p5_fw0p5_ema0p999_ep30 Text Generation • Updated 6 days ago • 16
wgcyeo/ci-feedback_asym_bi_kl_hybrid_fixed_ema_Qwen3-8B_bw0p5_fw0p5_ema0p999_ep30 Text Generation • Updated 6 days ago • 12
wgcyeo/ci-feedback_asym_bi_kl_hybrid_fixed_ema_Qwen3-8B_bw0p5_fw0p5_ema0p999_ep30 Text Generation • Updated 6 days ago • 12
wgcyeo/ci-grpo_Olmo-3-7B-Think_bs8_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30 Text Generation • Updated 7 days ago • 36
wgcyeo/ci-feedback_asym_bi_kl_hybrid_fixed_ema_Qwen3-1.7B_bw0p5_fw0p5_ema0p999_ep30 Text Generation • Updated 7 days ago • 15
wgcyeo/ci-feedback_asym_bi_kl_hybrid_fixed_ema_Qwen3-1.7B_bw0p5_fw0p5_ema0p999_ep30 Text Generation • Updated 7 days ago • 15
wgcyeo/feedback_asym_bi_kl_hybrid_fixed_ema_Qwen3-0.6B_bw0p5_fw0p5_ema0p999_ep30 Text Generation • Updated 7 days ago • 14
wgcyeo/feedback_asym_bi_kl_hybrid_fixed_ema_Qwen3-0.6B_bw0p5_fw0p5_ema0p999_ep30 Text Generation • Updated 7 days ago • 14
wgcyeo/ci-feedback_weighted_asym_bi_kl_fixed_interp_Qwen3-4B-Instruct-2507_bw0p5_fw0p5_stw0p3_ep30 Text Generation • Updated 9 days ago • 7
wgcyeo/ci-feedback_weighted_asym_bi_kl_fixed_interp_Qwen3-4B-Instruct-2507_bw0p5_fw0p5_stw0p3_ep30 Text Generation • Updated 9 days ago • 7
wgcyeo/ci-fb_w_asym_bi_kl_fixed_ema_interp_Qwen3-4B-Instruct-2507_bw0p5_fw0p5_ema0p999_stw0p3_ep30 Text Generation • Updated 9 days ago • 13