Llama8B_Star1K_GRPO_Epoch3_bsz24 / model-00003-of-00004.safetensors

Commit History

Upload folder using huggingface_hub
b62f09c
verified

dingyue1011 commited on